Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.05543
Cited By
v1
v2
v3 (latest)
Adding Conditional Control to Text-to-Image Diffusion Models
10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Adding Conditional Control to Text-to-Image Diffusion Models"
50 / 367 papers shown
Title
Create Anything Anywhere: Layout-Controllable Personalized Diffusion Model for Multiple Subjects
Wei Li
Hebei Li
Yansong Peng
Siying Wu
Yueyi Zhang
Xiaoyan Sun
DiffM
90
0
0
27 May 2025
Regularized Personalization of Text-to-Image Diffusion Models without Distributional Drift
Gihoon Kim
Hyungjin Park
Taesup Kim
DiffM
VLM
151
0
0
26 May 2025
Absolute Coordinates Make Motion Generation Easy
Zichong Meng
Zeyu Han
Xiaogang Peng
Yiming Xie
Huaizu Jiang
161
0
0
26 May 2025
What Can RL Bring to VLA Generalization? An Empirical Study
Jijia Liu
Feng Gao
Bingwen Wei
Xinlei Chen
Qingmin Liao
Yi Wu
Chao Yu
Yu Wang
OffRL
236
0
0
26 May 2025
OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data
Yiren Song
Cheng Liu
Mike Zheng Shou
DiffM
170
2
0
24 May 2025
One-Step Diffusion-Based Image Compression with Semantic Distillation
Naifu Xue
Zhaoyang Jia
Jiahao Li
Bin Li
Yuan Zhang
Yan Lu
DiffM
103
0
0
22 May 2025
Incorporating Visual Correspondence into Diffusion Model for Virtual Try-On
Siqi Wan
Jingwen Chen
Yingwei Pan
Ting Yao
Tao Mei
DiffM
236
1
0
22 May 2025
MVPainter: Accurate and Detailed 3D Texture Generation via Multi-View Diffusion with Geometric Control
Mingqi Shao
Feng Xiong
Zhaoxu Sun
Mu Xu
DiffM
78
0
0
19 May 2025
Guiding Diffusion with Deep Geometric Moments: Balancing Fidelity and Variation
Sangmin Jung
Utkarsh Nath
Yezhou Yang
Giulia Pedrielli
Joydeep Biswas
Amy Zhang
Hassan Ghasemzadeh
Pavan Turaga
DiffM
103
0
0
18 May 2025
DragLoRA: Online Optimization of LoRA Adapters for Drag-based Image Editing in Diffusion Model
Siwei Xia
Li Sun
Tiantian Sun
Qingli Li
DiffM
89
0
0
18 May 2025
Accelerating Diffusion-based Super-Resolution with Dynamic Time-Spatial Sampling
Rui Qin
Qijie Wang
Ming Sun
Haowei Zhu
Chao Zhou
Bin Wang
123
0
0
17 May 2025
Attend to Not Attended: Structure-then-Detail Token Merging for Post-training DiT Acceleration
Haipeng Fang
Sheng Tang
Juan Cao
Enshuo Zhang
Fan Tang
Tong-Yee Lee
82
0
0
16 May 2025
One Image is Worth a Thousand Words: A Usability Preservable Text-Image Collaborative Erasing Framework
Feiran Li
Qianqian Xu
Shilong Bao
Zhiyong Yang
Xiaochun Cao
Qingming Huang
DiffM
98
0
0
16 May 2025
You Only Look One Step: Accelerating Backpropagation in Diffusion Sampling with Gradient Shortcuts
Hongkun Dou
Zeyu Li
Xingyu Jiang
Haoyang Li
Lijun Yang
Wen Yao
Yue Deng
DiffM
224
0
0
12 May 2025
Towards Better Cephalometric Landmark Detection with Diffusion Data Generation
Dongqian Guo
Wencheng Han
Pang Lyu
Yuxi Zhou
Jianbing Shen
MedIm
144
0
0
09 May 2025
MCCD: Multi-Agent Collaboration-based Compositional Diffusion for Complex Text-to-Image Generation
Mingcheng Li
Xiaolu Hou
Ziyang Liu
Dingkang Yang
Ziyun Qian
Jiawei Chen
Jinjie Wei
Yiheng Jiang
Qingyao Xu
Li Zhang
DiffM
475
0
0
05 May 2025
SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image Editing
Ming Li
Xin Gu
Fan Chen
X. Xing
Longyin Wen
Chong Chen
Sijie Zhu
DiffM
224
1
0
05 May 2025
Benchmarking Feature Upsampling Methods for Vision Foundation Models using Interactive Segmentation
Volodymyr Havrylov
Haiwen Huang
Dan Zhang
Andreas Geiger
464
0
0
04 May 2025
Enhancing AI Face Realism: Cost-Efficient Quality Improvement in Distilled Diffusion Models with a Fully Synthetic Dataset
Jakub Wąsala
Bartłomiej Wrzalski
Kornelia Noculak
Yuliia Tarasenko
Oliwer Krupa
Jan Kocoń
Grzegorz Chodak
104
0
0
04 May 2025
MagicPortrait: Temporally Consistent Face Reenactment with 3D Geometric Guidance
Mengting Wei
Yante Li
Tuomas Varanka
Yan Jiang
Guoying Zhao
DiffM
VGen
130
0
0
30 Apr 2025
Diff-Prompt: Diffusion-Driven Prompt Generator with Mask Supervision
Weicai Yan
Wang Lin
Zirun Guo
Ye Wang
Fangming Feng
Xiaoda Yang
Ziyi Wang
Tao Jin
DiffM
280
2
0
30 Apr 2025
Pretraining Large Brain Language Model for Active BCI: Silent Speech
Jinzhao Zhou
Zehong Cao
Yiqun Duan
Connor Barkley
Daniel Leong
...
Ziyi Zhao
T. Do
Yu-Cheng Chang
Sheng-Fu Liang
Chin-Teng Lin
85
1
0
29 Apr 2025
IM-Portrait: Learning 3D-aware Video Diffusion for Photorealistic Talking Heads from Monocular Videos
Yuan Li
Ziqian Bai
Feitong Tan
Zhaopeng Cui
S. Fanello
Yinda Zhang
DiffM
VGen
115
0
0
27 Apr 2025
Step1X-Edit: A Practical Framework for General Image Editing
Shixuan Liu
Yucheng Han
Peng Xing
Fukun Yin
Rui Wang
...
Yibo Zhu
Binxing Jiao
Wei Wei
Gang Yu
Daxin Jiang
DiffM
176
17
0
24 Apr 2025
PPS-Ctrl: Controllable Sim-to-Real Translation for Colonoscopy Depth Estimation
Xinqi Xiong
Andrea Dunn Beltran
Jun Myeong Choi
Marc Niethammer
Roni Sengupta
DiffM
MedIm
83
0
0
23 Apr 2025
Mask Image Watermarking
Runyi Hu
Jie Zhang
Shiqian Zhao
Nils Lukas
Jiwei Li
Qing Guo
Han Qiu
Tianwei Zhang
116
1
0
17 Apr 2025
Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception
Ziqi Pang
Xin Xu
Yu-Xiong Wang
DiffM
172
0
0
15 Apr 2025
VideoPanda: Video Panoramic Diffusion with Multi-view Attention
Kevin Xie
Amirmojtaba Sabour
Jiahui Huang
Despoina Paschalidou
G. Klár
Umar Iqbal
Sanja Fidler
Fangyin Wei
VGen
MDE
98
1
0
15 Apr 2025
SPICE: A Synergistic, Precise, Iterative, and Customizable Image Editing Workflow
Kenan Tang
Yanhong Li
Yao Qin
DiffM
79
0
0
13 Apr 2025
STAGE: Stemmed Accompaniment Generation through Prefix-Based Conditioning
Giorgio Strano
Chiara Ballanti
Donato Crisostomi
Michele Mancusi
Luca Cosmo
Emanuele Rodolà
115
0
0
08 Apr 2025
BrainMRDiff: A Diffusion Model for Anatomically Consistent Brain MRI Synthesis
Moinak Bhattacharya
Saumya Gupta
Annie Singh
Chong Chen
Gagandeep Singh
Prateek Prasanna
MedIm
108
0
0
06 Apr 2025
Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing
Xiangyu Zhao
Peiyuan Zhang
Kexian Tang
Hao Li
Zicheng Zhang
...
Guangtao Zhai
Junchi Yan
Hua Yang
Xue Yang
Haodong Duan
VLM
LRM
123
5
0
03 Apr 2025
SketchVideo: Sketch-based Video Generation and Editing
Feng-Lin Liu
Hongbo Fu
Xintao Wang
Weicai Ye
Pengfei Wan
Di Zhang
Lin Gao
DiffM
VGen
108
0
0
30 Mar 2025
Meta-LoRA: Meta-Learning LoRA Components for Domain-Aware ID Personalization
Barış Batuhan Topal
Umut Özyurt
Zafer Doğan Budak
Ramazan Gokberk Cinbis
85
0
0
28 Mar 2025
ORIGEN: Zero-Shot 3D Orientation Grounding in Text-to-Image Generation
Yunhong Min
Daehyeon Choi
Kyeongmin Yeo
Jihyun Lee
Minhyuk Sung
94
0
0
28 Mar 2025
CoGen: 3D Consistent Video Generation via Adaptive Conditioning for Autonomous Driving
Yishen Ji
Ziyue Zhu
Zhenxin Zhu
Kaixin Xiong
Ming Lu
Zhiqi Li
Lijun Zhou
Haiyang Sun
Bing Wang
Tong Lu
VGen
92
2
0
28 Mar 2025
SyncSDE: A Probabilistic Framework for Diffusion Synchronization
Hyunjun Lee
Hyunsoo Lee
Sookwan Han
DiffM
121
0
0
27 Mar 2025
DynamiCtrl: Rethinking the Basic Structure and the Role of Text for High-quality Human Image Animation
Haoyu Zhao
Zhongang Qi
Cong Wang
Qingping Zheng
Guansong Lu
Fei Chen
Hang Xu
Zuxuan Wu
DiffM
VGen
97
0
0
27 Mar 2025
Parametric Shadow Control for Portrait Generation in Text-to-Image Diffusion Models
Haoming Cai
Tsung-Wei Huang
Shiv Gehlot
Brandon Yushan Feng
Sachin Shah
Guan-Ming Su
Christopher A. Metzler
DiffM
86
0
0
27 Mar 2025
Unconditional Priors Matter! Improving Conditional Generation of Fine-Tuned Diffusion Models
Prin Phunyaphibarn
Phillip Y. Lee
Jaihoon Kim
Minhyuk Sung
DiffM
151
0
0
26 Mar 2025
ICE: Intrinsic Concept Extraction from a Single Image via Diffusion Models
Fernando Julio Cendra
Kai Han
VLM
106
0
0
25 Mar 2025
EvAnimate: Event-conditioned Image-to-Video Generation for Human Animation
Qiang Qu
Ming Li
Xiaoming Chen
Tongliang Liu
DiffM
VGen
110
1
0
24 Mar 2025
DeLoRA: Decoupling Angles and Strength in Low-rank Adaptation
Massimo Bini
Leander Girrbach
Zeynep Akata
185
1
0
23 Mar 2025
Zero-1-to-A: Zero-Shot One Image to Animatable Head Avatars Using Video Diffusion
Zhou Zhenglin
Ma Fan
Fan Hehe
Chua Tat-Seng
VGen
164
0
0
20 Mar 2025
The Change You Want To Detect: Semantic Change Detection In Earth Observation With Hybrid Data Generation
Benidir Yanis
Gonthier Nicolas
Mallet Clement
81
1
0
19 Mar 2025
MOSAIC: Generating Consistent, Privacy-Preserving Scenes from Multiple Depth Views in Multi-Room Environments
Zhixuan Liu
H. Zhu
R. Chen
Jonathan M Francis
Soonmin Hwang
Jiangning Zhang
Jean Oh
VGen
466
0
0
18 Mar 2025
Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control
Nvidia
Hassan Abu Alhaija
Jose M. Alvarez
Maciej Bala
Tiffany Cai
...
Yuchong Ye
Xiaodong Yang
Boxin Wang
Fangyin Wei
Yu Zeng
VGen
145
7
0
18 Mar 2025
Advances in 4D Generation: A Survey
Qiaowei Miao
Kehan Li
Jinsheng Quan
Zhiyuan Min
Shaojie Ma
Yichao Xu
Yi Yang
Yawei Luo
142
1
0
18 Mar 2025
From Head to Tail: Towards Balanced Representation in Large Vision-Language Models through Adaptive Data Calibration
Mingyang Song
Xiaoye Qu
Jiawei Zhou
Yu Cheng
VLM
115
1
0
17 Mar 2025
One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation
Daniil Selikhanovych
David Li
Aleksei Leonov
Nikita Gushchin
Sergei Kushneriuk
Alexander N. Filippov
Evgeny Burnaev
Iaroslav Koshelev
Alexander Korotin
DiffM
134
0
0
17 Mar 2025
1
2
3
4
5
6
7
8
Next