Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.05543
Cited By
v1
v2
v3 (latest)
Adding Conditional Control to Text-to-Image Diffusion Models
10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Adding Conditional Control to Text-to-Image Diffusion Models"
50 / 3,090 papers shown
Title
CRoSS: Diffusion Model Makes Controllable, Robust and Secure Image Steganography
Jiwen Yu
Xuanyu Zhang
You-song Xu
Jian Zhang
DiffM
99
53
0
26 May 2023
ZeroAvatar: Zero-shot 3D Avatar Generation from a Single Image
Zhenzhen Weng
Zeyu Wang
S. Yeung
DiffM
51
21
0
25 May 2023
Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
Shihao Zhao
Dongdong Chen
Yen-Chun Chen
Jianmin Bao
Shaozhe Hao
Lu Yuan
Kwan-Yee K. Wong
115
268
0
25 May 2023
Diversify Your Vision Datasets with Automatic Diffusion-Based Augmentation
Lisa Dunlap
Alyssa Umino
Han Zhang
Jiezhi Yang
Joseph E. Gonzalez
Trevor Darrell
DiffM
91
79
0
25 May 2023
CommonScenes: Generating Commonsense 3D Indoor Scenes with Scene Graph Diffusion
Guangyao Zhai
Evin Pınar Örnek
Shun-cheng Wu
Yan Di
F. Tombari
Nassir Navab
Benjamin Busam
DiffM
139
14
0
25 May 2023
Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models
Xingqian Xu
Jiayi Guo
Zhangyang Wang
Gao Huang
Irfan Essa
Humphrey Shi
VLM
DiffM
127
61
0
25 May 2023
Robust Category-Level 3D Pose Estimation from Synthetic Data
Jiahao Yang
Wufei Ma
Angtian Wang
Xiaoding Yuan
Alan Yuille
Adam Kortylewski
106
2
0
25 May 2023
DiffusionShield: A Watermark for Copyright Protection against Generative Diffusion Models
Yingqian Cui
Jie Ren
Han Xu
Pengfei He
Hui Liu
Lichao Sun
Yue Xing
Jiliang Tang
WIGM
92
35
0
25 May 2023
DiffCLIP: Leveraging Stable Diffusion for Language Grounded 3D Classification
Sitian Shen
Zilin Zhu
Linqian Fan
Harry Zhang
Xinxiao Wu
DiffM
150
28
0
25 May 2023
Differentially Private Latent Diffusion Models
Saiyue Lyu
Michael F. Liu
Margarita Vinaroz
Mijung Park
108
26
0
25 May 2023
Efficient Neural Music Generation
Max W. Y. Lam
Qiao Tian
Tang-Chun Li
Zongyu Yin
Siyuan Feng
...
Mingbo Ma
Xuchen Song
Jitong Chen
Yuping Wang
Yuxuan Wang
DiffM
MGen
95
56
0
25 May 2023
Alleviating Exposure Bias in Diffusion Models through Sampling with Shifted Time Steps
Mingxiao Li
Tingyu Qu
Ruicong Yao
Wei Sun
Marie-Francine Moens
DiffM
99
42
0
24 May 2023
Unsupervised Semantic Correspondence Using Stable Diffusion
Eric Hedlin
Gopal Sharma
Shweta Mahajan
Hossam N. Isack
Abhishek Kar
Andrea Tagliasacchi
K. M. Yi
DiffM
106
95
0
24 May 2023
LayoutGPT: Compositional Visual Planning and Generation with Large Language Models
Weixi Feng
Wanrong Zhu
Tsu-Jui Fu
Varun Jampani
Arjun Reddy Akula
Xuehai He
Sugato Basu
Xinze Wang
William Yang Wang
MLLM
113
180
0
24 May 2023
A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence
Junyi Zhang
Charles Herrmann
Junhwa Hur
Luisa Polania Cabrera
Varun Jampani
Deqing Sun
Ming-Hsuan Yang
DiffM
103
188
0
24 May 2023
MultiFusion: Fusing Pre-Trained Models for Multi-Lingual, Multi-Modal Image Generation
Marco Bellagente
Manuel Brack
H. Teufel
Felix Friedrich
Bjorn Deiseroth
...
Koen Oostermeijer
Andres Felipe Cruz Salinas
P. Schramowski
Kristian Kersting
Samuel Weinbach
141
20
0
24 May 2023
L-CAD: Language-based Colorization with Any-level Descriptions using Diffusion Priors
Zheng Chang
Shuchen Weng
Pei Zhang
Yu Li
Si Li
Boxin Shi
DiffM
67
7
0
24 May 2023
DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion Models
Sungnyun Kim
Junsoo Lee
Kibeom Hong
Daesik Kim
Namhyuk Ahn
DiffM
88
15
0
24 May 2023
Deceptive-NeRF/3DGS: Diffusion-Generated Pseudo-Observations for High-Quality Sparse-View Reconstruction
Xinhang Liu
Jiaben Chen
Shiu-hong Kao
Yu-Wing Tai
Chi-Keung Tang
DiffM
54
15
0
24 May 2023
BLIP-Diffusion: Pre-trained Subject Representation for Controllable Text-to-Image Generation and Editing
Dongxu Li
Junnan Li
Steven C. H. Hoi
105
330
0
24 May 2023
Vision + Language Applications: A Survey
Yutong Zhou
N. Shimada
VLM
117
7
0
24 May 2023
Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion Models
Ruichen Wang
Zekang Chen
Chen Chen
Jiancang Ma
H. Lu
Xiaodong Lin
DiffM
89
73
0
23 May 2023
Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models
Weifeng Chen
Yatai Ji
Jie Wu
Hefeng Wu
Pan Xie
Jiashi Li
Xin Xia
Xuefeng Xiao
Liang Lin
VGen
208
11
0
23 May 2023
VisorGPT: Learning Visual Prior via Generative Pre-Training
Jinheng Xie
Kai Ye
Yudong Li
Yuexiang Li
Kevin Qinghong Lin
Yefeng Zheng
Linlin Shen
Mike Zheng Shou
ViT
321
8
0
23 May 2023
Pulling Target to Source: A New Perspective on Domain Adaptive Semantic Segmentation
Haochen Wang
Yujun Shen
Jingjing Fei
Wei Li
Liwei Wu
Yuxi Wang
Zhaoxiang Zhang
OOD
101
7
0
23 May 2023
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models
Long Lian
Boyi Li
Adam Yala
Trevor Darrell
106
164
0
23 May 2023
Training Diffusion Models with Reinforcement Learning
Kevin Black
Michael Janner
Yilun Du
Ilya Kostrikov
Sergey Levine
EGVM
154
377
0
22 May 2023
MAGE: Machine-generated Text Detection in the Wild
Yafu Li
Qintong Li
Leyang Cui
Wei Bi
Zhilin Wang
Longyue Wang
Linyi Yang
Shuming Shi
Yue Zhang
DeLMO
129
58
0
22 May 2023
ControlVideo: Training-free Controllable Text-to-Video Generation
Yabo Zhang
Yuxiang Wei
Dongsheng Jiang
Xiaopeng Zhang
W. Zuo
Qi Tian
VGen
DiffM
124
254
0
22 May 2023
The CLIP Model is Secretly an Image-to-Prompt Converter
Yuxuan Ding
Chunna Tian
Haoxuan Ding
Lingqiao Liu
DiffM
59
15
0
22 May 2023
ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer
Huadai Liu
Rongjie Huang
Xuan Lin
Wenqiang Xu
Maozong Zheng
Hong Chen
Jinzheng He
Zhou Zhao
DiffM
124
20
0
22 May 2023
DreamWaltz: Make a Scene with Complex 3D Animatable Avatars
Yukun Huang
Jianan Wang
Ailing Zeng
He Cao
Xianbiao Qi
Yukai Shi
Zhengjun Zha
Lei Zhang
101
73
0
21 May 2023
MaGIC: Multi-modality Guided Image Completion
Yongsheng Yu
Hao Wang
Tiejian Luo
Hengrui Fan
Libo Zhang
77
12
0
19 May 2023
LeftRefill: Filling Right Canvas based on Left Reference through Generalized Text-to-Image Diffusion Model
Chenjie Cao
Yunuo Cai
Qiaole Dong
Yikai Wang
Yanwei Fu
DiffM
96
15
0
19 May 2023
Brain Captioning: Decoding human brain activity into images and text
Matteo Ferrante
Furkan Ozcelik
T. Boccato
R. V. Rullen
N. Toschi
DiffM
92
30
0
19 May 2023
LaCon: Late-Constraint Diffusion for Steerable Guided Image Synthesis
Chang-Shu Liu
Rui Li
Kaidong Zhang
Xin Luo
Dong Liu
DiffM
90
3
0
19 May 2023
RoomDreamer: Text-Driven 3D Indoor Scene Synthesis with Coherent Geometry and Texture
Liangchen Song
Liangliang Cao
Hongyu Xu
Kai Kang
Feng Tang
Junsong Yuan
Yang Zhao
VGen
DiffM
85
44
0
18 May 2023
SlotDiffusion: Object-Centric Generative Modeling with Diffusion Models
Ziyi Wu
Jingyu Hu
Wuyue Lu
Igor Gilitschenski
Animesh Garg
DiffM
OCL
124
47
0
18 May 2023
UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild
Can Qin
Shu Zhen Zhang
Ning Yu
Yihao Feng
Xinyi Yang
...
Caiming Xiong
Silvio Savarese
Stefano Ermon
Yun Fu
Ran Xu
111
136
0
18 May 2023
LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation
Yujie Lu
Xianjun Yang
Xiujun Li
Xinze Wang
William Yang Wang
EGVM
143
79
0
18 May 2023
Structural Pruning for Diffusion Models
Gongfan Fang
Xinyin Ma
Xinchao Wang
106
140
0
18 May 2023
Swap Attention in Spatiotemporal Diffusions for Text-to-Video Generation
Wenjing Wang
Huan Yang
Zixi Tuo
Huiguo He
Sitong Su
Jianlong Fu
Jiaying Liu
DiffM
VGen
155
117
0
18 May 2023
TextDiffuser: Diffusion Models as Text Painters
Jingye Chen
Yupan Huang
Tengchao Lv
Lei Cui
Qifeng Chen
Furu Wei
191
126
0
18 May 2023
LDM3D: Latent Diffusion Model for 3D
Gabriela Ben-Melech Stan
Diana Wofk
Scottie Fox
Alex Redden
Will Saxton
...
Estelle Aflalo
Shao-Yen Tseng
Fabio Nonato
Matthias Muller
Vasudev Lal
108
48
0
18 May 2023
DiffUTE: Universal Text Editing Diffusion Model
Haoxing Chen
Zhuoer Xu
Zhangxuan Gu
Jun Lan
Xing Zheng
Yaohui Li
Changhua Meng
Huijia Zhu
Weiqiang Wang
DiffM
102
35
0
18 May 2023
OR-NeRF: Object Removing from 3D Scenes Guided by Multiview Segmentation with Neural Radiance Fields
Youtan Yin
Zhoujie Fu
Fan Yang
Guosheng Lin
114
30
0
17 May 2023
Preserve Your Own Correlation: A Noise Prior for Video Diffusion Models
Songwei Ge
Seungjun Nah
Guilin Liu
Tyler Poon
Andrew Tao
Bryan Catanzaro
David Jacobs
Jia-Bin Huang
Ming-Yuan Liu
Yogesh Balaji
DiffM
VGen
125
263
0
17 May 2023
Controllable Mind Visual Diffusion Model
Bo-Wen Zeng
Shanglin Li
Xuhui Liu
Sicheng Gao
Xiaolong Jiang
Xu Tang
Feng-Long Xie
Jianzhuang Liu
Baochang Zhang
DiffM
62
26
0
17 May 2023
Face Recognition Using Synthetic Face Data
Omer Granoviter
Alexey Gruzdev
V. Loginov
Max Kogan
Orly Zvitia
89
1
0
17 May 2023
Generating coherent comic with rich story using ChatGPT and Stable Diffusion
Ze Jin
Zorina Song
DiffM
41
16
0
16 May 2023
Previous
1
2
3
...
58
59
60
61
62
Next