Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.05543
Cited By
v1
v2
v3 (latest)
Adding Conditional Control to Text-to-Image Diffusion Models
10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Adding Conditional Control to Text-to-Image Diffusion Models"
50 / 3,090 papers shown
Title
OSTAF: A One-Shot Tuning Method for Improved Attribute-Focused T2I Personalization
Ye Wang
Zili Yi
Rui Ma
DiffM
74
0
0
17 Mar 2024
StainDiffuser: MultiTask Dual Diffusion Model for Virtual Staining
Tushar Kataria
Beatrice Knudsen
Shireen Y. Elhabian
DiffM
MedIm
105
10
0
17 Mar 2024
Efficient Diffusion-Driven Corruption Editor for Test-Time Adaptation
Yeongtak Oh
Jonghyun Lee
Jooyoung Choi
Dahuin Jung
Uiwon Hwang
Sungroh Yoon
TTA
DiffM
82
5
0
16 Mar 2024
ContourDiff: Unpaired Image Translation with Contour-Guided Diffusion Models
Yuwen Chen
Nicholas Konz
Han Gu
Haoyu Dong
Yaqian Chen
Lin Li
Jisoo Lee
Maciej A. Mazurowski
MedIm
72
1
0
16 Mar 2024
StableGarment: Garment-Centric Generation via Stable Diffusion
Rui Wang
Hailong Guo
Jiaming Liu
Huaxia Li
Haibo Zhao
Xu Tang
Yao Hu
Hao Tang
Peipei Li
DiffM
66
16
0
16 Mar 2024
Giving a Hand to Diffusion Models: a Two-Stage Approach to Improving Conditional Human Image Generation
Anton Pelykh
Ozge Mercanoglu
Richard Bowden
DiffM
69
8
0
15 Mar 2024
IMPRINT: Generative Object Compositing by Learning Identity-Preserving Representation
Yizhi Song
Zhifei Zhang
Zhe Lin
Scott D. Cohen
Brian L. Price
Jianming Zhang
Soo Ye Kim
He Zhang
Wei Xiong
Daniel G. Aliaga
DiffM
102
41
0
15 Mar 2024
LightIt: Illumination Modeling and Control for Diffusion Models
Peter Kocsis
Julien Philip
Kalyan Sunkavalli
Matthias Nießner
Yannick Hold-Geoffroy
76
24
0
15 Mar 2024
Strong and Controllable Blind Image Decomposition
Zeyu Zhang
Junlin Han
Chenhui Gou
Hongdong Li
Liang Zheng
82
2
0
15 Mar 2024
Animate Your Motion: Turning Still Images into Dynamic Videos
Mingxiao Li
Bo Wan
Marie-Francine Moens
Tinne Tuytelaars
VGen
DiffM
91
7
0
15 Mar 2024
SphereDiffusion: Spherical Geometry-Aware Distortion Resilient Diffusion Model
Tao Wu
Xuewei Li
Zhongang Qi
Di Hu
Xintao Wang
Ying Shan
Xi Li
86
7
0
15 Mar 2024
Controllable Text-to-3D Generation via Surface-Aligned Gaussian Splatting
Zhiqi Li
Yiming Chen
Lingzhe Zhao
Peidong Liu
DiffM
3DGS
153
18
0
15 Mar 2024
SCP-Diff: Spatial-Categorical Joint Prior for Diffusion Based Semantic Image Synthesis
Huan-ang Gao
Mingju Gao
Jiaju Li
Wenyi Li
Rong Zhi
Hao Tang
Hao Zhao
DiffM
113
6
0
14 Mar 2024
Holo-Relighting: Controllable Volumetric Portrait Relighting from a Single Image
Yiqun Mei
Yu Zeng
He Zhang
Zhixin Shu
Xuaner Zhang
Sai Bi
Jianming Zhang
HyunJoon Jung
Vishal M. Patel
100
15
0
14 Mar 2024
Generalized Predictive Model for Autonomous Driving
Jiazhi Yang
Shenyuan Gao
Yihang Qiu
Li Chen
Tianyu Li
...
Ping Luo
Jun Zhang
Andreas Geiger
Yu Qiao
Hongyang Li
VGen
135
76
0
14 Mar 2024
What Sketch Explainability Really Means for Downstream Tasks
Hmrishav Bandyopadhyay
Pinaki Nath Chowdhury
A. Bhunia
Aneeshan Sain
Tao Xiang
Yi-Zhe Song
101
4
0
14 Mar 2024
Towards Faster Training of Diffusion Models: An Inspiration of A Consistency Phenomenon
Tianshuo Xu
Peng Mi
Ruilin Wang
Yingcong Chen
DiffM
162
6
0
14 Mar 2024
SketchINR: A First Look into Sketches as Implicit Neural Representations
Hmrishav Bandyopadhyay
A. Bhunia
Pinaki Nath Chowdhury
Aneeshan Sain
Tao Xiang
Timothy M. Hospedales
Yi-Zhe Song
SSL
88
10
0
14 Mar 2024
Video Editing via Factorized Diffusion Distillation
Uriel Singer
Amit Zohar
Yuval Kirstain
Shelly Sheynin
Adam Polyak
Devi Parikh
Yaniv Taigman
DiffM
VGen
91
15
0
14 Mar 2024
StainFuser: Controlling Diffusion for Faster Neural Style Transfer in Multi-Gigapixel Histology Images
R. Jewsbury
Ruoyu Wang
A. Bhalerao
Nasir M. Rajpoot
Q. Vu
102
2
0
14 Mar 2024
XReal: Realistic Anatomy and Pathology-Aware X-ray Generation via Controllable Diffusion Model
Anees Ur Rehman Hashmi
Ibrahim Almakky
Mohammad Areeb Qazi
Santosh Sanjeev
Vijay Ram Papineni
Dwarikanath Mahapatra
Mohammad Yaqub
MedIm
95
6
0
14 Mar 2024
Sculpt3D: Multi-View Consistent Text-to-3D Generation with Sparse 3D Prior
Cheng Chen
Xiaofeng Yang
Fan Yang
Chengzeng Feng
Zhoujie Fu
Chuan-Sheng Foo
Guosheng Lin
Fayao Liu
111
14
0
14 Mar 2024
Explore In-Context Segmentation via Latent Diffusion Models
Chaoyang Wang
Xiangtai Li
Henghui Ding
Lu Qi
Jiangning Zhang
Yunhai Tong
Chen Change Loy
Shuicheng Yan
DiffM
158
7
0
14 Mar 2024
HeadEvolver: Text to Head Avatars via Expressive and Attribute-Preserving Mesh Deformation
D. B. Wang
Hengyu Meng
Zeyu Cai
Zhijing Shao
Qianxi Liu
Lin Wang
Mingming Fan
Xiaohang Zhan
Zhaoxiang Wang
139
3
0
14 Mar 2024
VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis
Enric Corona
Andrei Zanfir
Eduard Gabriel Bazavan
Nikos Kolotouros
Thiemo Alldieck
C. Sminchisescu
VGen
DiffM
107
32
0
13 Mar 2024
GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing
Jing Wu
Jiawang Bian
Xinghui Li
Guangrun Wang
Ian D Reid
Philip Torr
V. Prisacariu
3DGS
98
42
0
13 Mar 2024
Data Augmentation in Human-Centric Vision
Wentao Jiang
Yige Zhang
Shaozhong Zheng
Si Liu
Shuicheng Yan
91
1
0
13 Mar 2024
ActionDiffusion: An Action-aware Diffusion Model for Procedure Planning in Instructional Videos
Lei Shi
Paul-Christian Bürkner
Andreas Bulling
DiffM
VGen
83
4
0
13 Mar 2024
Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Models
Pengze Zhang
Hubery Yin
Chen Li
Xiaohua Xie
101
9
0
13 Mar 2024
Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts
Yue Ma
Yin-Yin He
Hongfa Wang
Andong Wang
Chenyang Qi
...
Xiu Li
Zhifeng Li
H. Shum
Wei Liu
Qifeng Chen
VGen
DiffM
161
43
0
13 Mar 2024
Sketch2Manga: Shaded Manga Screening from Sketch with Diffusion Models
Jian Lin
Xueting Liu
Chengze Li
M. Xie
Tien-Tsin Wong
DiffM
32
2
0
13 Mar 2024
Mitigating the Impact of Attribute Editing on Face Recognition
Sudipta Banerjee
Sai Pranaswi Mullangi
Shruti Wagle
Chinmay Hegde
Nasir Memon
CVBM
111
1
0
12 Mar 2024
Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation
Shihao Zhao
Shaozhe Hao
Bojia Zi
Huaizhe Xu
Kwan-Yee K. Wong
DiffM
VLM
108
9
0
12 Mar 2024
SemCity: Semantic Scene Generation with Triplane Diffusion
Jumin Lee
Sebin Lee
Changho Jo
Woobin Im
Juhyeong Seon
Sung-eui Yoon
DiffM
95
21
0
12 Mar 2024
Genuine Knowledge from Practice: Diffusion Test-Time Adaptation for Video Adverse Weather Removal
Yijun Yang
Hongtao Wu
Angelica I. Aviles-Rivero
Yulun Zhang
Jing Qin
Lei Zhu
106
9
0
12 Mar 2024
DragAnything: Motion Control for Anything using Entity Representation
Wejia Wu
Zhuang Li
Yuchao Gu
Rui Zhao
Yefei He
David Junhao Zhang
Mike Zheng Shou
Yan Li
Yan Li
Di Zhang
VGen
145
62
0
12 Mar 2024
Efficient Diffusion Model for Image Restoration by Residual Shifting
Zongsheng Yue
Jianyi Wang
Chen Change Loy
DiffM
121
38
0
12 Mar 2024
AesopAgent: Agent-driven Evolutionary System on Story-to-Video Production
Jiuniu Wang
Zehua Du
Yuyuan Zhao
Bo Yuan
Kexiang Wang
...
Yihen Lu
Gengliang Li
Junlong Gao
Xin Tu
Zhenyu Guo
LLMAG
VGen
81
8
0
12 Mar 2024
It's All About Your Sketch: Democratising Sketch Control in Diffusion Models
Subhadeep Koley
A. Bhunia
Deeptanshu Sekhri
Aneeshan Sain
Pinaki Nath Chowdhury
Tao Xiang
Yi-Zhe Song
DiffM
90
16
0
12 Mar 2024
You'll Never Walk Alone: A Sketch and Text Duet for Fine-Grained Image Retrieval
Subhadeep Koley
A. Bhunia
Aneeshan Sain
Pinaki Nath Chowdhury
Tao Xiang
Yi-Zhe Song
3DV
121
11
0
12 Mar 2024
Text-to-Image Diffusion Models are Great Sketch-Photo Matchmakers
Subhadeep Koley
A. Bhunia
Aneeshan Sain
Pinaki Nath Chowdhury
Tao Xiang
Yi-Zhe Song
DiffM
95
7
0
12 Mar 2024
Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model
Yuxuan Zhang
Lifu Wei
Qing Zhang
Yiren Song
DiffM
113
17
0
12 Mar 2024
Action Reimagined: Text-to-Pose Video Editing for Dynamic Human Actions
Lan Wang
Vishnu Boddeti
Sernam Lim
VGen
DiffM
56
0
0
11 Mar 2024
BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion
Xu Ju
Xian Liu
Xintao Wang
Hao Wang
Ying Shan
Qiang Xu
93
78
0
11 Mar 2024
Bayesian Diffusion Models for 3D Shape Reconstruction
Haiyang Xu
Yu Lei
Zeyuan Chen
Xiang Zhang
Yue Zhao
Yilin Wang
Zhuowen Tu
DiffM
94
9
0
11 Mar 2024
SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data
Jialu Li
Jaemin Cho
Yi-Lin Sung
Jaehong Yoon
Mohit Bansal
MoMe
DiffM
103
9
0
11 Mar 2024
DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations
Tianhao Qi
Shancheng Fang
Yanze Wu
Hongtao Xie
Jiawei Liu
Lang Chen
Qian He
Yongdong Zhang
DiffM
79
43
0
11 Mar 2024
DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation
Guosheng Zhao
Xiaofeng Wang
Zheng Zhu
Xinze Chen
Guan Huang
Xiaoyi Bao
Xingang Wang
VGen
62
80
0
11 Mar 2024
Text2QR: Harmonizing Aesthetic Customization and Scanning Robustness for Text-Guided QR Code Generation
Guangyang Wu
Xiaohong Liu
Jun Jia
Xuehao Cui
Guangtao Zhai
64
4
0
11 Mar 2024
DivCon: Divide and Conquer for Progressive Text-to-Image Generation
Yuhao Jia
Wenhan Tan
DiffM
104
1
0
11 Mar 2024
Previous
1
2
3
...
39
40
41
...
60
61
62
Next