Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.05543
Cited By
v1
v2
v3 (latest)
Adding Conditional Control to Text-to-Image Diffusion Models
10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Adding Conditional Control to Text-to-Image Diffusion Models"
50 / 3,090 papers shown
Title
Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts
Yuyang Zhao
Enze Xie
Lanqing Hong
Zhenguo Li
G. Lee
DiffM
VGen
102
34
0
15 May 2023
Meta-DM: Applications of Diffusion Models on Few-Shot Learning
W. Hu
Xiurong Jiang
Jiarun Liu
Yuqi Yang
Hui Tian
DiffM
80
7
0
14 May 2023
Exploiting Diffusion Prior for Real-World Image Super-Resolution
Jianyi Wang
Zongsheng Yue
Shangchen Zhou
Kelvin C. K. Chan
Chen Change Loy
179
326
0
11 May 2023
Null-text Guidance in Diffusion Models is Secretly a Cartoon-style Creator
Jing Zhao
Heliang Zheng
Chaoyue Wang
Long Lan
Wanrong Huang
Wenjing Yang
DiffM
102
10
0
11 May 2023
Generative AI meets 3D: A Survey on Text-to-3D in AIGC Era
Chenghao Li
Chaoning Zhang
Atish Waghwase
Lik-Hang Lee
François Rameau
Yang Yang
Sung-Ho Bae
Choong Seon Hong
104
78
0
10 May 2023
MMoT: Mixture-of-Modality-Tokens Transformer for Composed Multimodal Conditional Image Synthesis
Jinsheng Zheng
Daqing Liu
Chaoyue Wang
Minghui Hu
Zuopeng Yang
Changxing Ding
Dacheng Tao
72
1
0
10 May 2023
Text-guided High-definition Consistency Texture Model
Zhibin Tang
Tiantong He
DiffM
37
6
0
10 May 2023
Sketching the Future (STF): Applying Conditional Control Techniques to Text-to-Video Models
Rohan Dhesikan
V. Rajmohan
DiffM
VGen
48
7
0
10 May 2023
InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
Zhaoyang Liu
Yinan He
Wenhai Wang
Weiyun Wang
Yi Wang
...
Yali Wang
Limin Wang
Ping Luo
Jifeng Dai
Yu Qiao
LRM
MLLM
150
85
0
09 May 2023
SUR-adapter: Enhancing Text-to-Image Pre-trained Diffusion Models with Large Language Models
Shan Zhong
Zhongzhan Huang
Wushao Wen
Jinghui Qin
Liang Lin
94
41
0
09 May 2023
Prompt Tuning Inversion for Text-Driven Image Editing Using Diffusion Models
Wenkai Dong
Song Xue
Xiaoyue Duan
Shumin Han
DiffM
95
62
0
08 May 2023
Exploring One-shot Semi-supervised Federated Learning with A Pre-trained Diffusion Model
Min Yang
Shangchao Su
Bin Li
Xiangyang Xue
DiffM
127
31
0
06 May 2023
AADiff: Audio-Aligned Video Synthesis with Text-to-Image Diffusion
Seungwoo Lee
Chaerin Kong
D. Jeon
Nojun Kwak
DiffM
111
20
0
06 May 2023
LEO: Generative Latent Image Animator for Human Video Synthesis
Yaohui Wang
Xin Ma
Xinyuan Chen
A. Dantcheva
Bo Dai
Yu Qiao
DiffM
183
33
0
06 May 2023
Guided Image Synthesis via Initial Image Editing in Diffusion Model
Jiafeng Mao
Xueting Wang
Kiyoharu Aizawa
DiffM
89
55
0
05 May 2023
DisenBooth: Identity-Preserving Disentangled Tuning for Subject-Driven Text-to-Image Generation
Hong Chen
Yipeng Zhang
Simin Wu
Xin Eric Wang
Xuguang Duan
Yuwei Zhou
Wenwu Zhu
DiffM
110
51
0
05 May 2023
TUVF: Learning Generalizable Texture UV Radiance Fields
Annie Cheng
Xueting Li
Sifei Liu
Xinyu Wang
DiffM
82
8
0
04 May 2023
In-Context Learning Unlocked for Diffusion Models
Zhendong Wang
Yi Ding
Yadong Lu
Yelong Shen
Pengcheng He
Weizhu Chen
Zhangyang Wang
Mingyuan Zhou
VLM
DiffM
150
78
0
01 May 2023
Let the Chart Spark: Embedding Semantic Context into Chart with Text-to-Image Generative Model
Shishi Xiao
Suizi Huang
Yue Lin
Yilin Ye
Weizhen Zeng
83
34
0
28 Apr 2023
Generating images of rare concepts using pre-trained diffusion models
Dvir Samuel
Rami Ben-Ari
Simon Raviv
N. Darshan
Gal Chechik
241
44
0
27 Apr 2023
Edit Everything: A Text-Guided Generative System for Images Editing
Defeng Xie
Ruichen Wang
Jiancang Ma
Chen Chen
H. Lu
Ke Wang
Fobo Shi
Xiaodong Lin
DiffM
166
32
0
27 Apr 2023
TR0N: Translator Networks for 0-Shot Plug-and-Play Conditional Generation
Zhaoyan Liu
Noël Vouitsis
S. Gorti
Jimmy Ba
Gabriel Loaiza-Ganem
ViT
73
1
0
26 Apr 2023
Seeing is not always believing: Benchmarking Human and Model Perception of AI-Generated Images
Zeyu Lu
Di Huang
Lei Bai
Jingjing Qu
Chengzhi Wu
Xihui Liu
Wanli Ouyang
90
58
0
25 Apr 2023
The Potential of Visual ChatGPT For Remote Sensing
L. Osco
Eduardo Lopes de Lemos
W. Gonçalves
A. P. Ramos
J. M. Junior
77
31
0
25 Apr 2023
Patch Diffusion: Faster and More Data-Efficient Training of Diffusion Models
Zhendong Wang
Yi Ding
Huangjie Zheng
Peihao Wang
Pengcheng He
Zhangyang Wang
Weizhu Chen
Mingyuan Zhou
90
108
0
25 Apr 2023
Speed Is All You Need: On-Device Acceleration of Large Diffusion Models via GPU-Aware Optimizations
Yu-Hui Chen
Raman Sarokin
Juhyun Lee
Jiuqiang Tang
Chuo-Ling Chang
Andrei Kulik
Matthias Grundmann
VLM
77
43
0
21 Apr 2023
Advances in Deep Concealed Scene Understanding
Deng-Ping Fan
Ge-Peng Ji
Peng Xu
Ming-Ming Cheng
Daniel Gehrig
Luc Van Gool
103
74
0
21 Apr 2023
Improved Diffusion-based Image Colorization via Piggybacked Models
Hanyuan Liu
Jinbo Xing
M. Xie
Chengze Li
T. Wong
VLM
DiffM
60
19
0
21 Apr 2023
Anything-3D: Towards Single-view Anything Reconstruction in the Wild
Qiuhong Shen
Xingyi Yang
Xinchao Wang
DiffM
77
88
0
19 Apr 2023
Promptify: Text-to-Image Generation through Interactive Prompt Exploration with Large Language Models
Stephen Brade
Bryan Wang
Maurício Sousa
Sageev Oore
Tovi Grossman
MLLM
DiffM
77
94
0
18 Apr 2023
Text-guided Image-and-Shape Editing and Generation: A Short Survey
Cheng-Kang Ted Chao
Y. Gingold
129
3
0
18 Apr 2023
UPGPT: Universal Diffusion Model for Person Image Generation, Editing and Pose Transfer
Soon Yau Cheong
A. Mustafa
Andrew Gilbert
DiffM
103
13
0
18 Apr 2023
Generative Disco: Text-to-Video Generation for Music Visualization
Vivian Liu
Tao Long
Nathan Raw
Lydia B. Chilton
VGen
66
34
0
17 Apr 2023
MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image Synthesis and Editing
Ming Cao
Xintao Wang
Zhongang Qi
Ying Shan
Xiaohu Qie
Yinqiang Zheng
DiffM
109
471
0
17 Apr 2023
Everyone Can Be Picasso? A Computational Framework into the Myth of Human versus AI Painting
Yilin Ye
Rong Huang
Kangyi Zhang
Weizhen Zeng
27
1
0
17 Apr 2023
Magnitude Invariant Parametrizations Improve Hypernetwork Learning
Jose Javier Gonzalez Ortiz
John Guttag
Adrian Dalca
74
7
0
15 Apr 2023
On the Opportunities and Challenges of Foundation Models for Geospatial Artificial Intelligence
Gengchen Mai
Weiming Huang
Jin Sun
Suhang Song
Deepak Mishra
...
Yingjie Hu
Chris Cundy
Ziyuan Li
Rui Zhu
Ni Lao
AI4CE
122
134
0
13 Apr 2023
Expressive Text-to-Image Generation with Rich Text
Songwei Ge
Taesung Park
Jun-Yan Zhu
Jia-Bin Huang
DiffM
159
82
0
13 Apr 2023
Segment Everything Everywhere All at Once
Xueyan Zou
Jianwei Yang
Hao Zhang
Feng Li
Linjie Li
Jianfeng Wang
Lijuan Wang
Jianfeng Gao
Yong Jae Lee
MLLM
VLM
113
491
0
13 Apr 2023
Control3Diff: Learning Controllable 3D Diffusion Models from Single-view Images
Jiatao Gu
Qingzhe Gao
Shuangfei Zhai
Baoquan Chen
Lingjie Liu
J. Susskind
105
29
0
13 Apr 2023
Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation
Jaemin Cho
Linjie Li
Zhengyuan Yang
Zhe Gan
Lijuan Wang
Joey Tianyi Zhou
EGVM
72
6
0
13 Apr 2023
DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-Efficient Fine-Tuning
Enze Xie
Lewei Yao
Han Shi
Zhili Liu
Daquan Zhou
Zhaoqiang Liu
Jiawei Li
Zhenguo Li
74
81
0
13 Apr 2023
DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion
J. Karras
Aleksander Holynski
Ting-Chun Wang
Ira Kemelmacher-Shlizerman
DiffM
VGen
87
148
0
12 Apr 2023
Improving Diffusion Models for Scene Text Editing with Dual Encoders
Jiabao Ji
Guanhua Zhang
Zhaowen Wang
Bairu Hou
Zhifei Zhang
Brian L. Price
Shiyu Chang
DiffM
77
31
0
12 Apr 2023
DDRF: Denoising Diffusion Model for Remote Sensing Image Fusion
Zihan Cao
Shiqi Cao
Xiao Wu
Junming Hou
Ran Ran
Liang-Jian Deng
DiffM
75
15
0
10 Apr 2023
HumanSD: A Native Skeleton-Guided Diffusion Model for Human Image Generation
Xu Ju
Ailing Zeng
Chenchen Zhao
Jianan Wang
Lei Zhang
Qian Xu
DiffM
80
93
0
09 Apr 2023
Harnessing the Spatial-Temporal Attention of Diffusion Models for High-Fidelity Text-to-Image Synthesis
Qiucheng Wu
Yujian Liu
Handong Zhao
T. Bui
Zhe Lin
Yang Zhang
Shiyu Chang
DiffM
92
46
0
07 Apr 2023
InstantBooth: Personalized Text-to-Image Generation without Test-Time Finetuning
Jing Shi
Wei Xiong
Zhe Lin
H. J. Jung
DiffM
184
294
0
06 Apr 2023
RoSteALS: Robust Steganography using Autoencoder Latent Space
Tu Bui
Shrutina Agarwal
Ning Yu
John Collomosse
100
42
0
06 Apr 2023
Training-Free Layout Control with Cross-Attention Guidance
Minghao Chen
Iro Laina
Andrea Vedaldi
DiffM
225
237
0
06 Apr 2023
Previous
1
2
3
...
59
60
61
62
Next