Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2307.01952
Cited By
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis
4 July 2023
Dustin Podell
Zion English
Kyle Lacey
A. Blattmann
Tim Dockhorn
Jonas Muller
Joe Penna
Robin Rombach
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis"
50 / 1,635 papers shown
Title
FlipConcept: Tuning-Free Multi-Concept Personalization for Text-to-Image Generation
Young Beom Woo
Sun Eung Kim
DiffM
48
0
0
24 Feb 2025
Can Large Vision-Language Models Detect Images Copyright Infringement from GenAI?
Qipan Xu
Zihan Wang
Xiaoxiao He
Ligong Han
Ruixiang Tang
41
0
0
23 Feb 2025
DP-Adapter: Dual-Pathway Adapter for Boosting Fidelity and Text Consistency in Customizable Human Image Generation
Ye Wang
Xuping Xie
Lanjun Wang
Zili Yi
Rui Ma
DiffM
97
0
0
21 Feb 2025
Multi-Agent Multimodal Models for Multicultural Text to Image Generation
Parth Bhalerao
Mounika Yalamarty
Brian Trinh
Oana Ignat
42
0
0
21 Feb 2025
CustomVideoX: 3D Reference Attention Driven Dynamic Adaptation for Zero-Shot Customized Video Diffusion Transformers
D. She
Mushui Liu
Jingxuan Pang
Jin Wang
Zhen Yang
...
Yi Wang
Qihan Huang
Haobin Tang
YunLong Yu
Siming Fu
VGen
96
4
0
21 Feb 2025
T2ISafety: Benchmark for Assessing Fairness, Toxicity, and Privacy in Image Generation
Lijun Li
Zhelun Shi
Xuhao Hu
Bowen Dong
Yiran Qin
Xihui Liu
Lu Sheng
Jing Shao
114
1
0
21 Feb 2025
Image compositing is all you need for data augmentation
Ang Jia Ning Shermaine
Michalis Lazarou
Tania Stathaki
62
1
0
20 Feb 2025
Precise Parameter Localization for Textual Generation in Diffusion Models
Łukasz Staniszewski
Bartosz Cywiñski
Franziska Boenisch
Kamil Deja
Adam Dziedzic
DiffM
226
0
0
17 Feb 2025
Characterizing Photorealism and Artifacts in Diffusion Model-Generated Images
Negar Kamali
Karyn Nakamura
Aakriti Kumar
Angelos Chatzimparmpas
Jessica Hullman
Matthew Groh
61
0
0
17 Feb 2025
GaussianMotion: End-to-End Learning of Animatable Gaussian Avatars with Pose Guidance from Text
Gyumin Shim
Sangmin Lee
Jaegul Choo
3DGS
66
0
0
17 Feb 2025
Diffusion Models without Classifier-free Guidance
Zhicong Tang
Jianmin Bao
Dong Chen
Baining Guo
VLM
60
2
0
17 Feb 2025
EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling
Theodoros Kouzelis
Ioannis Kakogeorgiou
Spyros Gidaris
N. Komodakis
DRL
80
6
0
17 Feb 2025
FreeBlend: Advancing Concept Blending with Staged Feedback-Driven Interpolation Diffusion
Yufan Zhou
Haoyu Shen
Huan Wang
DiffM
113
0
0
17 Feb 2025
Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening
Ye Tian
L. Yang
Xinchen Zhang
Yunhai Tong
Mengdi Wang
Bin Cui
67
2
0
17 Feb 2025
MaskGWM: A Generalizable Driving World Model with Video Mask Reconstruction
Jingcheng Ni
Yuxin Guo
Yichen Liu
Rui Chen
Lewei Lu
Z. Wu
DiffM
VGen
73
3
0
17 Feb 2025
Diffusion Models Through a Global Lens: Are They Culturally Inclusive?
Zahra Bayramli
Ayhan Suleymanzade
Na Min An
Huzama Ahmad
Eunsu Kim
Junyeong Park
James Thorne
Alice Oh
91
0
0
13 Feb 2025
Learning Human Skill Generators at Key-Step Levels
Yilu Wu
Chenhui Zhu
Shuai Wang
Hanlin Wang
Jing Wang
Zhaoxiang Zhang
Limin Wang
VGen
121
0
0
12 Feb 2025
Skrr: Skip and Re-use Text Encoder Layers for Memory Efficient Text-to-Image Generation
H. Seo
Wongi Jeong
Jae-sun Seo
Se Young Chun
62
0
0
12 Feb 2025
MatSwap: Light-aware material transfers in images
Ivan Lopes
Valentin Deschaintre
Yannick Hold-Geoffroy
Raoul de Charette
DiffM
90
0
0
11 Feb 2025
Matrix3D: Large Photogrammetry Model All-in-One
Yuanxun Lu
Jingyang Zhang
Tian Fang
Jean-Daniel Nahmias
Yanghai Tsin
Long Quan
Xun Cao
Yao Yao
Shiwei Li
122
4
0
11 Feb 2025
UniMoD: Efficient Unified Multimodal Transformers with Mixture-of-Depths
Weijia Mao
Zhiyong Yang
Mike Zheng Shou
MoE
80
0
0
10 Feb 2025
Understanding Representation Dynamics of Diffusion Models via Low-Dimensional Modeling
Xiao Li
Zekai Zhang
Xiang Li
Siyi Chen
Zhihui Zhu
Peng Wang
Qing Qu
DiffM
53
0
0
09 Feb 2025
Beyond Fine-Tuning: A Systematic Study of Sampling Techniques in Personalized Image Generation
Vera Soboleva
M. Nakhodnov
Aibek Alanov
52
0
0
09 Feb 2025
Training-Free Constrained Generation With Stable Diffusion Models
Stefano Zampini
Jacob K Christopher
Luca Oneto
Davide Anguita
Ferdinando Fioretto
48
1
0
08 Feb 2025
Beyond and Free from Diffusion: Invertible Guided Consistency Training
Chia-Hong Hsu
Shiu-hong Kao
Randall Balestriero
3DV
82
0
0
08 Feb 2025
Stochastic Forward-Backward Deconvolution: Training Diffusion Models with Finite Noisy Datasets
Haoye Lu
Qifan Wu
Yaoliang Yu
DiffM
49
1
0
08 Feb 2025
Hummingbird: High Fidelity Image Generation via Multimodal Context Alignment
Minh-Quan Le
Gaurav Mittal
Tianjian Meng
A S M Iftekhar
Vishwas Suryanarayanan
Barun Patra
Dimitris Samaras
Mei Chen
DiffM
67
0
0
07 Feb 2025
FairT2I: Mitigating Social Bias in Text-to-Image Generation via Large Language Model-Assisted Detection and Attribute Rebalancing
Jinya Sakurai
Issei Sato
76
0
0
06 Feb 2025
Recommendations Beyond Catalogs: Diffusion Models for Personalized Generation
Gabriel Patron
Zhiwei Xu
Ishan Kapnadak
Felipe Maia Polo
DiffM
38
0
0
05 Feb 2025
Towards Physical Understanding in Video Generation: A 3D Point Regularization Approach
Yunuo Chen
Junli Cao
Anil Kag
Vidit Goel
Sergei Korolev
Chenfanfu Jiang
Sergey Tulyakov
Jian Ren
DiffM
VGen
95
1
0
05 Feb 2025
One Diffusion Step to Real-World Super-Resolution via Flow Trajectory Distillation
Jiyang Li
Jingyun Liang
Yong Guo
W. J. Li
Yulun Zhang
DiffM
75
0
0
04 Feb 2025
SliderSpace: Decomposing the Visual Capabilities of Diffusion Models
Rohit Gandikota
Zongze Wu
Richard Zhang
David Bau
Eli Shechtman
Nick Kolkin
DiffM
53
1
0
03 Feb 2025
HuViDPO:Enhancing Video Generation through Direct Preference Optimization for Human-Centric Alignment
Lifan Jiang
Boxi Wu
Jiahui Zhang
Xiaotong Guan
Shuang Chen
VGen
71
1
0
02 Feb 2025
DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation
Chenguo Lin
Panwang Pan
Bangbang Yang
Zeming Li
Yadong Mu
3DGS
78
7
0
28 Jan 2025
Can Pose Transfer Models Generate Realistic Human Motion?
Vaclav Knapp
Matyas Bohacek
189
0
0
28 Jan 2025
An Item is Worth a Prompt: Versatile Image Editing with Disentangled Control
Aosong Feng
Weikang Qiu
Jinbin Bai
Xiao Zhang
Zhen Dong
Kaicheng Zhou
Rex Ying
Leandros Tassiulas
DiffM
63
6
0
28 Jan 2025
Turn That Frown Upside Down: FaceID Customization via Cross-Training Data
Shuhe Wang
Xiaoya Li
Xiaofei Sun
G. Wang
Tianwei Zhang
Jiwei Li
Eduard H. Hovy
38
0
0
28 Jan 2025
CE-SDWV: Effective and Efficient Concept Erasure for Text-to-Image Diffusion Models via a Semantic-Driven Word Vocabulary
Jiahang Tu
Qian Feng
Chufan Chen
Jiahua Dong
Hanbin Zhao
Chao Zhang
Hui Qian
72
2
0
28 Jan 2025
Do Existing Testing Tools Really Uncover Gender Bias in Text-to-Image Models?
Yunbo Lyu
Zhou Yang
Yuqing Niu
Jing Jiang
David Lo
44
1
0
28 Jan 2025
CAFuser: Condition-Aware Multimodal Fusion for Robust Semantic Perception of Driving Scenes
Tim Broedermann
Daniel Gehrig
Yuqian Fu
Luc Van Gool
62
7
0
28 Jan 2025
Sparse High Rank Adapters
K. Bhardwaj
N. Pandey
Sweta Priyadarshi
Viswanath Ganapathy
Rafael Esteves
...
P. Whatmough
Risheek Garrepalli
M. V. Baalen
Harris Teague
Markus Nagel
MQ
43
4
0
28 Jan 2025
PAID: A Framework of Product-Centric Advertising Image Design
Hongyu Chen
Min Zhou
Jing Jiang
Jiale Chen
Yang Lu
Bo Xiao
T. Ge
Bo Zheng
DiffM
VLM
43
0
0
24 Jan 2025
LLM-guided Instance-level Image Manipulation with Diffusion U-Net Cross-Attention Maps
Andrey Palaev
Adil Mehmood Khan
S. M. Ahsan Kazmi
DiffM
53
0
0
23 Jan 2025
PreciseCam: Precise Camera Control for Text-to-Image Generation
Edurne Bernal-Berdun
Ana Serrano
B. Masiá
Matheus Gadelha
Yannick Hold-Geoffroy
Xin Sun
Diego F. F. Gutierrez
DiffM
VGen
56
0
0
22 Jan 2025
Accelerate High-Quality Diffusion Models with Inner Loop Feedback
M. Gwilliam
Han Cai
Di Wu
Abhinav Shrivastava
Zhiyu Cheng
95
0
0
22 Jan 2025
Regressor-Guided Image Editing Regulates Emotional Response to Reduce Online Engagement
Christoph Gebhardt
Robin Willardt
Seyedmorteza Sadat
Chih-Wei Ning
Andreas Brombach
Jie Song
Otmar Hilliges
Christian Holz
67
0
0
21 Jan 2025
TokenVerse: Versatile Multi-concept Personalization in Token Modulation Space
Daniel Garibi
Shahar Yadin
Roni Paiss
Omer Tov
Shiran Zada
Ariel Ephrat
T. Michaeli
Inbar Mosseri
Tali Dekel
DiffM
106
2
0
21 Jan 2025
DiffDoctor: Diagnosing Image Diffusion Models Before Treating
Yiyang Wang
Xi Chen
Xiaogang Xu
S. Ji
Y. Liu
Yujun Shen
Hengshuang Zhao
DiffM
51
0
0
21 Jan 2025
Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation
Zibo Zhao
Zeqiang Lai
Qingxiang Lin
Yunfei Zhao
Haolin Liu
...
Jingwei Huang
Chunchao Guo
Jie Jiang
Jingwei Huang
Chunchao Guo
113
25
0
21 Jan 2025
StyleSSP: Sampling StartPoint Enhancement for Training-free Diffusion-based Method for Style Transfer
Ruojun Xu
Weijie Xi
Xiaodi Wang
Yongbo Mao
Zach Cheng
DiffM
39
1
0
20 Jan 2025
Previous
1
2
3
...
8
9
10
...
31
32
33
Next