Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.05543
Cited By
v1
v2
v3 (latest)
Adding Conditional Control to Text-to-Image Diffusion Models
10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Adding Conditional Control to Text-to-Image Diffusion Models"
50 / 367 papers shown
Title
StainDiffuser: MultiTask Dual Diffusion Model for Virtual Staining
Tushar Kataria
Beatrice Knudsen
Shireen Y. Elhabian
DiffM
MedIm
78
10
0
17 Mar 2024
Controllable Text-to-3D Generation via Surface-Aligned Gaussian Splatting
Zhiqi Li
Yiming Chen
Lingzhe Zhao
Peidong Liu
DiffM
3DGS
101
18
0
15 Mar 2024
HeadEvolver: Text to Head Avatars via Expressive and Attribute-Preserving Mesh Deformation
D. B. Wang
Hengyu Meng
Zeyu Cai
Zhijing Shao
Qianxi Liu
Lin Wang
Mingming Fan
Xiaohang Zhan
Zhaoxiang Wang
99
3
0
14 Mar 2024
Explore In-Context Segmentation via Latent Diffusion Models
Chaoyang Wang
Xiangtai Li
Henghui Ding
Lu Qi
Jiangning Zhang
Yunhai Tong
Chen Change Loy
Shuicheng Yan
DiffM
132
6
0
14 Mar 2024
Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model
Yuxuan Zhang
Lifu Wei
Qing Zhang
Yiren Song
DiffM
100
17
0
12 Mar 2024
ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models
Jiaxiang Cheng
Pan Xie
Xin Xia
Jiashi Li
Jie Wu
Yuxi Ren
Huixia Li
Xuefeng Xiao
Min Zheng
Lean Fu
97
12
0
04 Mar 2024
On the Challenges and Opportunities in Generative AI
Laura Manduchi
Kushagra Pandey
Robert Bamler
Ryan Cotterell
Sina Daubener
...
F. Wenzel
Frank Wood
Stephan Mandt
Vincent Fortuin
Vincent Fortuin
240
21
0
28 Feb 2024
Diffusion Model-Based Image Editing: A Survey
Yi Huang
Jiancheng Huang
Yifan Liu
Mingfu Yan
Jiaxi Lv
Jianzhuang Liu
Wei Xiong
He Zhang
Liangliang Cao
Liangliang Cao
EGVM
151
98
0
27 Feb 2024
ADEPT: Hierarchical Bayes Approach to Personalized Federated Unsupervised Learning
Kaan Ozkara
Bruce Huang
Ruida Zhou
Suhas Diggavi
191
0
0
19 Feb 2024
MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning
Chenyu Wang
Weixin Luo
Qianyu Chen
Haonan Mai
Jindi Guo
Sixun Dong
Xiaohua Xuan
MLLM
LLMAG
107
18
0
19 Jan 2024
A Survey on 3D Gaussian Splatting
Guikun Chen
Wenguan Wang
3DGS
154
191
0
08 Jan 2024
Latte: Latent Diffusion Transformer for Video Generation
Xin Ma
Yaohui Wang
Gengyun Jia
Xinyuan Chen
Ziqiang Liu
Yuan-Fang Li
Cunjian Chen
Yu Qiao
DiffM
VGen
238
270
0
05 Jan 2024
Discrete Distribution Networks
Lei Yang
90
1
0
29 Dec 2023
RealCraft: Attention Control as A Tool for Zero-Shot Consistent Video Editing
Shutong Jin
Ruiyu Wang
Florian T. Pokorny
DiffM
VGen
128
1
0
19 Dec 2023
Learning Naturally Aggregated Appearance for Efficient 3D Editing
Ka Leong Cheng
Qiuyu Wang
Zifan Shi
Kecheng Zheng
Yinghao Xu
Ouyang Hao
Qifeng Chen
Yujun Shen
3DH
95
4
0
11 Dec 2023
Stable Diffusion for Data Augmentation in COCO and Weed Datasets
Boyang Deng
44
1
0
07 Dec 2023
Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images
Zhuoran Yu
Chenchen Zhu
Sean Culatana
Raghuraman Krishnamoorthi
Fanyi Xiao
Yong Jae Lee
151
15
0
04 Dec 2023
Meta ControlNet: Enhancing Task Adaptation via Meta Learning
Junjie Yang
Jinze Zhao
Peihao Wang
Zhangyang Wang
Yingbin Liang
93
3
0
03 Dec 2023
CLAP: Isolating Content from Style through Contrastive Learning with Augmented Prompts
Yichao Cai
Yuhang Liu
Zhen Zhang
Javen Qinfeng Shi
CLIP
VLM
103
8
0
28 Nov 2023
Flow-Guided Diffusion for Video Inpainting
Bohai Gu
Yongsheng Yu
Hengrui Fan
Libo Zhang
VGen
DiffM
81
12
0
26 Nov 2023
Image Super-Resolution with Text Prompt Diffusion
Zheng Chen
Yulun Zhang
Jinjin Gu
Xin Yuan
Linghe Kong
Guihai Chen
Xiaokang Yang
DiffM
113
20
0
24 Nov 2023
Paragraph-to-Image Generation with Information-Enriched Diffusion Model
Weijia Wu
Zhuang Li
Yefei He
Mike Zheng Shou
Chunhua Shen
Lele Cheng
Yan Li
Yan Li
Di Zhang
VLM
190
25
0
24 Nov 2023
Closed-Form Diffusion Models
Christopher Scarvelis
Haitz Sáez de Ocáriz Borde
Justin Solomon
DiffM
166
12
0
19 Oct 2023
Ctrl-Room: Controllable Text-to-3D Room Meshes Generation with Layout Constraints
Chuan Fang
Yuan Dong
Kunming Luo
Xiaotao Hu
Rakesh Shrestha
Ping Tan
DiffM
121
37
0
05 Oct 2023
Counting Guidance for High Fidelity Text-to-Image Synthesis
Wonjune Kang
Kevin Galim
H. Koo
Nam Ik Cho
DiffM
94
10
0
30 Jun 2023
Expressive Text-to-Image Generation with Rich Text
Songwei Ge
Taesung Park
Jun-Yan Zhu
Jia-Bin Huang
DiffM
123
82
0
13 Apr 2023
Enhanced Controllability of Diffusion Models via Feature Disentanglement and Realism-Enhanced Sampling Methods
Wonwoong Cho
Hareesh Ravi
Midhun Harikumar
V. Khuc
Krishna Kumar Singh
Jingwan Lu
David I. Inouye
Ajinkya Kale
DiffM
117
7
0
28 Feb 2023
Region-Aware Diffusion for Zero-shot Text-driven Image Editing
Nisha Huang
Fan Tang
Weiming Dong
Tong-Yee Lee
Changsheng Xu
DiffM
64
25
0
23 Feb 2023
Composer: Creative and Controllable Image Synthesis with Composable Conditions
Lianghua Huang
Di Chen
Yu Liu
Yujun Shen
Deli Zhao
Jingren Zhou
DiffM
56
288
0
20 Feb 2023
T2I-Adapter: Learning Adapters to Dig out More Controllable Ability for Text-to-Image Diffusion Models
Chong Mou
Xintao Wang
Liangbin Xie
Yanze Wu
Shuai Liu
Zhongang Qi
Ying Shan
Xiaohu Qie
DiffM
121
1,027
0
16 Feb 2023
MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation
Omer Bar-Tal
Lior Yariv
Y. Lipman
Tali Dekel
78
383
1
16 Feb 2023
MaskSketch: Unpaired Structure-guided Masked Image Generation
D. Bashkirova
José Lezama
Kihyuk Sohn
Kate Saenko
Irfan Essa
DiffM
42
25
0
10 Feb 2023
Zero-shot Image-to-Image Translation
Gaurav Parmar
Krishna Kumar Singh
Richard Y. Zhang
Yijun Li
Jingwan Lu
Jun-Yan Zhu
DiffM
80
447
0
06 Feb 2023
GLIGEN: Open-Set Grounded Text-to-Image Generation
Yuheng Li
Haotian Liu
Qingyang Wu
Fangzhou Mu
Jianwei Yang
Jianfeng Gao
Chunyuan Li
Yong Jae Lee
VLM
112
599
1
17 Jan 2023
OrthoGAN:High-Precision Image Generation for Teeth Orthodontic Visualization
Feihong Shen
Jingjing Liu
Hai-Zhen Li
B. Fang
Chenglong Ma
Jinxiang Hao
Yang Feng
Youyi Zheng
Youyi Zheng
MedIm
DiffM
199
1
0
29 Dec 2022
CLIPascene: Scene Sketching with Different Types and Levels of Abstraction
Yael Vinker
Yuval Alaluf
Daniel Cohen-Or
Ariel Shamir
CLIP
72
58
0
30 Nov 2022
SpaText: Spatio-Textual Representation for Controllable Image Generation
Omri Avrahami
Thomas Hayes
Oran Gafni
Sonal Gupta
Yaniv Taigman
Devi Parikh
Dani Lischinski
Ohad Fried
Xiaoyue Yin
DiffM
83
208
0
25 Nov 2022
Sketch-Guided Text-to-Image Diffusion Models
A. Voynov
Kfir Aberman
Daniel Cohen-Or
DiffM
85
209
0
24 Nov 2022
Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation
Narek Tumanyan
Michal Geyer
Shai Bagon
Tali Dekel
126
679
0
22 Nov 2022
InstructPix2Pix: Learning to Follow Image Editing Instructions
Tim Brooks
Aleksander Holynski
Alexei A. Efros
DiffM
205
1,813
0
17 Nov 2022
OneFormer: One Transformer to Rule Universal Image Segmentation
Jitesh Jain
Jiacheng Li
M. Chiu
Ali Hassani
Nikita Orlov
Humphrey Shi
ViT
64
343
0
10 Nov 2022
Imagic: Text-Based Real Image Editing with Diffusion Models
Bahjat Kawar
Shiran Zada
Oran Lang
Omer Tov
Hui-Tang Chang
Tali Dekel
Inbar Mosseri
Michal Irani
76
1,089
0
17 Oct 2022
LAION-5B: An open large-scale dataset for training next generation image-text models
Christoph Schuhmann
Romain Beaumont
Richard Vencu
Cade Gordon
Ross Wightman
...
Srivatsa Kundurthy
Katherine Crowson
Ludwig Schmidt
R. Kaczmarczyk
J. Jitsev
VLM
MLLM
CLIP
192
3,482
0
16 Oct 2022
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Yael Pritch
Michael Rubinstein
Kfir Aberman
279
2,861
0
25 Aug 2022
Prompt-to-Prompt Image Editing with Cross Attention Control
Amir Hertz
Ron Mokady
J. Tenenbaum
Kfir Aberman
Yael Pritch
Daniel Cohen-Or
DiffM
200
1,773
0
02 Aug 2022
An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion
Rinon Gal
Yuval Alaluf
Yuval Atzmon
Or Patashnik
Amit H. Bermano
Gal Chechik
Daniel Cohen-Or
160
1,889
0
02 Aug 2022
Classifier-Free Diffusion Guidance
Jonathan Ho
Tim Salimans
FaML
193
3,898
0
26 Jul 2022
Semantic Image Synthesis via Diffusion Models
Weilun Wang
Weilun Wang
Wen-gang Zhou
Dongdong Chen
Dong Chen
Lu Yuan
Houqiang Li
DiffM
328
178
0
30 Jun 2022
Pretraining is All You Need for Image-to-Image Translation
Tengfei Wang
Ting Zhang
Bo Zhang
Hao Ouyang
Dong Chen
Qifeng Chen
Fang Wen
DiffM
243
178
0
25 May 2022
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
...
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
VLM
437
6,023
0
23 May 2022
Previous
1
2
3
4
5
6
7
8
Next