Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.05543
Cited By
v1
v2
v3 (latest)
Adding Conditional Control to Text-to-Image Diffusion Models
10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Adding Conditional Control to Text-to-Image Diffusion Models"
50 / 3,090 papers shown
Title
Inv-Adapter: ID Customization Generation via Image Inversion and Lightweight Adapter
Peng-Fei Xing
Ning Wang
Jianbo Ouyang
Zechao Li
DiffM
70
1
0
05 Jun 2024
Controllable Talking Face Generation by Implicit Facial Keypoints Editing
Dong Zhao
Jiaying Shi
Wenjun Li
Shudong Wang
Shenghui Xu
Zhaoming Pan
CVBM
83
0
0
05 Jun 2024
Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion
Hao Wen
Zehuan Huang
Yaohui Wang
Xinyuan Chen
Yu Qiao
159
9
0
05 Jun 2024
Attend and Enrich: Enhanced Visual Prompt for Zero-Shot Learning
Man Liu
H. Bai
Feng Li
Chunjie Zhang
Yunchao Wei
Meng Wang
Tat-Seng Chua
VLM
128
0
0
05 Jun 2024
V-Express: Conditional Dropout for Progressive Training of Portrait Video Generation
Cong Wang
Kuan Tian
Jun Zhang
Yonghang Guan
Feng Luo
Fei Shen
Zhiwei Jiang
Qing Gu
Xiao Han
Wei Yang
127
45
0
04 Jun 2024
CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation
Dejia Xu
Weili Nie
Chao Liu
Sifei Liu
Jan Kautz
Zhangyang Wang
Arash Vahdat
DiffM
VGen
134
59
0
04 Jun 2024
Guiding a Diffusion Model with a Bad Version of Itself
Tero Karras
M. Aittala
Tuomas Kynkaanniemi
J. Lehtinen
Timo Aila
S. Laine
127
90
0
04 Jun 2024
Stable-Pose: Leveraging Transformers for Pose-Guided Text-to-Image Generation
Jiajun Wang
Morteza Ghahremani
Yitong Li
Björn Ommer
Christian Wachinger
DiffM
50
2
0
04 Jun 2024
RoomTex: Texturing Compositional Indoor Scenes via Iterative Inpainting
Qi Wang
Ruijie Lu
Xudong Xu
Jingbo Wang
Michael Yu Wang
Bo Dai
Gang Zeng
Dan Xu
DiffM
88
6
0
04 Jun 2024
Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation
Clement Chadebec
O. Tasar
Eyal Benaroche
Benjamin Aubin
VLM
120
14
0
04 Jun 2024
Why Only Text: Empowering Vision-and-Language Navigation with Multi-modal Prompts
Haodong Hong
Sen Wang
Zi Huang
Qi Wu
Jiajun Liu
109
4
0
04 Jun 2024
The Crystal Ball Hypothesis in diffusion models: Anticipating object positions from initial noise
Yuanhao Ban
Ruochen Wang
Tianyi Zhou
Boqing Gong
Cho-Jui Hsieh
Minhao Cheng
DiffM
108
6
0
04 Jun 2024
Plug-and-Play Diffusion Distillation
Yi-Ting Hsiao
Siavash Khodadadeh
Kevin Duarte
Wei-An Lin
Hui Qu
Mingi Kwon
Ratheesh Kalarot
110
9
0
04 Jun 2024
Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation
Yi Ma
Hongyu Liu
Haobo Wang
Heng Pan
Yingqing He
...
Ailing Zeng
Chengfei Cai
H. Shum
Wen Liu
Qifeng Chen
130
61
0
04 Jun 2024
Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting
Inkyu Shin
Qihang Yu
Xiaohui Shen
In So Kweon
KuK-Jin Yoon
Liang-Chieh Chen
VGen
DiffM
125
1
0
04 Jun 2024
MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training
Kengo Uchida
Takashi Shibuya
Yuhta Takida
Naoki Murata
Shusuke Takahashi
Shusuke Takahashi
Yuki Mitsufuji
VGen
147
5
0
04 Jun 2024
L-MAGIC: Language Model Assisted Generation of Images with Coherence
Zhipeng Cai
Matthias Mueller
R. Birkl
Diana Wofk
Shaoyen Tseng
JunDa Cheng
Gabriela Ben-Melech Stan
Vasudev Lal
Michael Paulitsch
DiffM
MLLM
82
6
0
03 Jun 2024
Differentially Private Fine-Tuning of Diffusion Models
Yu-Lin Tsai
Yizhe Li
Zekai Chen
Po-yu Chen
Chia-Mu Yu
Xuebin Ren
Francois Buet-Golfouse
111
3
0
03 Jun 2024
Unleashing Generalization of End-to-End Autonomous Driving with Controllable Long Video Generation
Enhui Ma
Lijun Zhou
Tao Tang
Zhan Zhang
Dong Han
...
Peng Jia
Xianpeng Lang
Haiyang Sun
Di Lin
Kaicheng Yu
VGen
114
28
0
03 Jun 2024
FreeTumor: Advance Tumor Segmentation via Large-Scale Tumor Synthesis
Linshan Wu
Jiaxin Zhuang
Xuefeng Ni
Hao Chen
MedIm
102
13
0
03 Jun 2024
UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation
Xiang Wang
Shiwei Zhang
Changxin Gao
Jiayu Wang
Xiaoqiang Zhou
Yingya Zhang
Luxin Yan
Nong Sang
VGen
141
41
0
03 Jun 2024
Dimba: Transformer-Mamba Diffusion Models
Zhengcong Fei
Mingyuan Fan
Changqian Yu
Debang Li
Youqiang Zhang
Junshi Huang
Mamba
105
19
0
03 Jun 2024
Δ
Δ
Δ
-DiT: A Training-Free Acceleration Method Tailored for Diffusion Transformers
Pengtao Chen
Mingzhu Shen
Peng Ye
Jianjian Cao
Chongjun Tu
C. Bouganis
Yiren Zhao
Tao Chen
129
44
0
03 Jun 2024
Layout-Agnostic Scene Text Image Synthesis with Diffusion Models
Qilong Zhangli
Jindong Jiang
Di Liu
Licheng Yu
Xiaoliang Dai
Ankit Ramchandani
Guan Pang
Dimitris N. Metaxas
Praveen Krishnan
DiffM
126
8
0
03 Jun 2024
Learning Temporally Consistent Video Depth from Video Diffusion Priors
Jiahao Shao
Yuanbo Yang
Hongyu Zhou
Youmin Zhang
Yujun Shen
Vitor Campagnolo Guizilini
Yue Wang
Matteo Poggi
Yiyi Liao
VGen
DiffM
MDE
123
43
0
03 Jun 2024
AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation
Junhao Cheng
Xi Lu
Hanhui Li
Khun Loun Zai
Baiqiao Yin
Yuhao Cheng
Yiqiang Yan
Xiaodan Liang
DiffM
VGen
130
11
0
03 Jun 2024
Diffusion Tuning: Transferring Diffusion Models via Chain of Forgetting
Jincheng Zhong
Xingzhuo Guo
Jiaxiang Dong
Mingsheng Long
DiffM
99
2
0
02 Jun 2024
ContextFlow++: Generalist-Specialist Flow-based Generative Models with Mixed-Variable Context Encoding
Denis A. Gudovskiy
Tomoyuki Okuno
Yohei Nakata
MoE
AI4CE
90
2
0
02 Jun 2024
Improving Text Generation on Images with Synthetic Captions
Jun Young Koh
Sang Hyun Park
Joy Song
DiffM
135
2
0
01 Jun 2024
The Curious Case of End Token: A Zero-Shot Disentangled Image Editing using CLIP
Hidir Yesiltepe
Yusuf Dalva
Pinar Yanardag
DiffM
64
2
0
01 Jun 2024
GenPalm: Contactless Palmprint Generation with Diffusion Models
Steven A. Grosz
Anil K. Jain
79
2
0
01 Jun 2024
Temporally Consistent Object Editing in Videos using Extended Attention
AmirHossein Zamani
Amir G. Aghdam
Tiberiu Popa
Eugene Belilovsky
DiffM
109
1
0
01 Jun 2024
A-SDM: Accelerating Stable Diffusion through Model Assembly and Feature Inheritance Strategies
Jinchao Zhu
Yuxuan Wang
Siyuan Pan
Pengfei Wan
Di Zhang
Gao Huang
101
0
0
31 May 2024
Spectrum-Aware Parameter Efficient Fine-Tuning for Diffusion Models
Xinxi Zhang
Song Wen
Ligong Han
Felix Juefei Xu
Akash Srivastava
Junzhou Huang
Hao Wang
Molei Tao
Dimitris N. Metaxas
DiffM
74
7
0
31 May 2024
MegActor: Harness the Power of Raw Video for Vivid Portrait Animation
Shurong Yang
Huadong Li
Juhao Wu
Minhao Jing
Linze Li
Renhe Ji
Jiajun Liang
Haoqiang Fan
DiffM
VGen
115
15
0
31 May 2024
Slight Corruption in Pre-training Data Makes Better Diffusion Models
Hao Chen
Yujin Han
Diganta Misra
Xiang Li
Kai Hu
Difan Zou
Masashi Sugiyama
Jindong Wang
Bhiksha Raj
DiffM
125
5
0
30 May 2024
Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
Kailu Wu
Fangfu Liu
Zhihan Cai
Runjie Yan
Hanyang Wang
Yating Hu
Yueqi Duan
Kaisheng Ma
113
64
0
30 May 2024
MotionFollower: Editing Video Motion via Lightweight Score-Guided Diffusion
Shuyuan Tu
Qi Dai
Zihao Zhang
Sicheng Xie
Zhi-Qi Cheng
Chong Luo
Xintong Han
Zuxuan Wu
Yu-Gang Jiang
DiffM
VGen
75
11
0
30 May 2024
SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow
Chaoyang Wang
Xiangtai Li
Lu Qi
Henghui Ding
Yunhai Tong
Ming-Hsuan Yang
DiffM
134
7
0
30 May 2024
ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections
Massimo Bini
Karsten Roth
Zeynep Akata
Anna Khoreva
69
5
0
30 May 2024
MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model
Muyao Niu
Xiaodong Cun
Xintao Wang
Yong Zhang
Ying Shan
Yinqiang Zheng
DiffM
VGen
112
43
0
30 May 2024
MotionDreamer: Zero-Shot 3D Mesh Animation from Video Diffusion Models
Lukas Uzolas
E. Eisemann
Petr Kellnhofer
113
1
0
30 May 2024
Applications of Generative AI (GAI) for Mobile and Wireless Networking: A Survey
Thai-Hoc Vu
Senthil Kumar Jagatheesaperumal
Minh-Duong Nguyen
Nguyen Van Huynh
Sunghwan Kim
Quoc-Viet Pham
94
13
0
30 May 2024
RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection
Fangyi Chen
Han Zhang
Zhantao Yang
Hao Chen
Kai Hu
Marios Savvides
ObjD
VLM
86
5
0
30 May 2024
HQ-DiT: Efficient Diffusion Transformer with FP4 Hybrid Quantization
Wenxuan Liu
Saiqian Zhang
MQ
81
5
0
30 May 2024
DeMamba: AI-Generated Video Detection on Million-Scale GenVideo Benchmark
Haoxing Chen
Yan Hong
Zizheng Huang
Zhuoer Xu
Zhangxuan Gu
...
Jun Lan
Huijia Zhu
Jianfu Zhang
Weiqiang Wang
Huaxiong Li
Mamba
166
21
0
30 May 2024
Creating Language-driven Spatial Variations of Icon Images
Xianghao Xu
Aditya Ganeshan
K. Willis
Yewen Pu
Daniel E. Ritchie
78
0
0
30 May 2024
Promptus: Can Prompts Streaming Replace Video Streaming with Stable Diffusion
Jiangkai Wu
Liming Liu
Yunpeng Tan
Junlin Hao
Xinggong Zhang
146
3
0
30 May 2024
Don't drop your samples! Coherence-aware training benefits Conditional diffusion
Nicolas Dufour
Victor Besnier
Vicky Kalogeiton
David Picard
DiffM
133
2
0
30 May 2024
X-VILA: Cross-Modality Alignment for Large Language Model
Hanrong Ye
De-An Huang
Yao Lu
Zhiding Yu
Ming-Yu Liu
...
Jan Kautz
Song Han
Dan Xu
Pavlo Molchanov
Hongxu Yin
MLLM
VLM
86
35
0
29 May 2024
Previous
1
2
3
...
30
31
32
...
60
61
62
Next