Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.05543
Cited By
v1
v2
v3 (latest)
Adding Conditional Control to Text-to-Image Diffusion Models
10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Adding Conditional Control to Text-to-Image Diffusion Models"
50 / 3,090 papers shown
Title
HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Wei Wei
Tingbo Hou
Yael Pritch
Neal Wadhwa
Michael Rubinstein
Kfir Aberman
DiffM
101
183
0
13 Jul 2023
Improving 2D Human Pose Estimation in Rare Camera Views with Synthetic Data
Miroslav Purkrábek
Jivrí Matas
73
2
0
13 Jul 2023
My3DGen: A Scalable Personalized 3D Generative Model
Luchao Qi
Jiaye Wu
Annie N. Wang
Sheng-Yu Wang
Roni Sengupta
3DH
91
5
0
11 Jul 2023
Semantic-SAM: Segment and Recognize Anything at Any Granularity
Feng Li
Hao Zhang
Pei Sun
Xueyan Zou
Siyi Liu
Jianwei Yang
Chun-yue Li
Lei Zhang
Jianfeng Gao
VLM
112
177
0
10 Jul 2023
Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Feedback
Jaskirat Singh
Liang Zheng
107
19
0
10 Jul 2023
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Yuwei Guo
Ceyuan Yang
Anyi Rao
Zhengyang Liang
Yaohui Wang
Yu Qiao
Maneesh Agrawala
Dahua Lin
Bo Dai
VGen
152
882
0
10 Jul 2023
FreeDrag: Feature Dragging for Reliable Point-based Image Editing
Pengyang Ling
Lin Chen
Pan Zhang
H. Chen
Yi Jin
Jinjin Zheng
DiffM
108
16
0
10 Jul 2023
DIFF-NST: Diffusion Interleaving For deFormable Neural Style Transfer
Dan Ruta
Gemma Canet Tarrés
Andrew Gilbert
Eli Shechtman
Nicholas I. Kolkin
John Collomosse
DiffM
96
5
0
09 Jul 2023
Blocks2World: Controlling Realistic Scenes with Editable Primitives
Vaibhav Vavilala
Seemandhar Jain
R. Vasanth
Anand Bhattad
David A. Forsyth
VGen
89
4
0
07 Jul 2023
Text-Guided Synthesis of Eulerian Cinemagraphs
Aniruddha Mahapatra
Aliaksandr Siarohin
Hsin-Ying Lee
Sergey Tulyakov
Sitong Su
DiffM
VGen
92
21
0
06 Jul 2023
A Critical Look at the Current Usage of Foundation Model for Dense Recognition Task
Shiqi Yang
Atsushi Hashimoto
Yoshitaka Ushiku
DiffM
VLM
76
1
0
06 Jul 2023
DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models
Chong Mou
Xintao Wang
Jie Song
Ying Shan
Jian Zhang
DiffM
125
154
0
05 Jul 2023
RADiff: Controllable Diffusion Models for Radio Astronomical Maps Generation
Renato Sortino
T. Cecconello
A. DeMarco
G. Fiameni
Andrea Pilzer
...
E. Sciacca
A. Ingallinera
C. Bordiu
F. Bufano
C. Spampinato
DiffM
57
2
0
05 Jul 2023
Towards Open Federated Learning Platforms: Survey and Vision from Technical and Legal Perspectives
Moming Duan
Qinbin Li
Linshan Jiang
Bingsheng He
FedML
105
5
0
05 Jul 2023
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis
Dustin Podell
Zion English
Kyle Lacey
A. Blattmann
Tim Dockhorn
Jonas Muller
Joe Penna
Robin Rombach
302
2,457
0
04 Jul 2023
Text + Sketch: Image Compression at Ultra Low Rates
Eric Lei
Yiugit Berkay Uslu
Hamed Hassani
Shirin Saeedi Bidokhti
DiffM
75
50
0
04 Jul 2023
Collaborative Score Distillation for Consistent Visual Synthesis
Subin Kim
Kyungmin Lee
June Suk Choi
Jongheon Jeong
Kihyuk Sohn
Jinwoo Shin
DiffM
62
21
0
04 Jul 2023
MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion
Shitao Tang
Fuyang Zhang
Jiacheng Chen
Peng Wang
Yasutaka Furukawa
131
156
0
03 Jul 2023
DifFSS: Diffusion Model for Few-Shot Semantic Segmentation
Weimin Tan
Siyuan Chen
Bo Yan
DiffM
85
25
0
03 Jul 2023
DreamIdentity: Improved Editability for Efficient Face-identity Preserved Image Generation
Zhuowei Chen
Shancheng Fang
Wei Liu
Qian He
Mengqi Huang
Yongdong Zhang
Zhendong Mao
DiffM
125
24
0
01 Jul 2023
AIGCIQA2023: A Large-scale Image Quality Assessment Database for AI Generated Images: from the Perspectives of Quality, Authenticity and Correspondence
Jiarui Wang
Huiyu Duan
Jing Liu
S. Chen
Xiongkuo Min
Guangtao Zhai
EGVM
96
60
0
01 Jul 2023
DisCo: Disentangled Control for Realistic Human Dance Generation
Tan Wang
Linjie Li
Kevin Qinghong Lin
Yuanhao Zhai
Chung-Ching Lin
Zhengyuan Yang
Hanwang Zhang
Zicheng Liu
Lijuan Wang
VGen
141
89
0
30 Jun 2023
Counting Guidance for High Fidelity Text-to-Image Synthesis
Wonjune Kang
Kevin Galim
H. Koo
Nam Ik Cho
DiffM
122
10
0
30 Jun 2023
Generate Anything Anywhere in Any Scene
Yuheng Li
Haotian Liu
Yangming Wen
Yong Jae Lee
DiffM
134
12
0
29 Jun 2023
Filtered-Guided Diffusion: Fast Filter Guidance for Black-Box Diffusion Models
Zeqi Gu
Abe Davis
DiffM
54
2
0
29 Jun 2023
ID-Pose: Sparse-view Camera Pose Estimation by Inverting Diffusion Models
Weihao Cheng
Yan-Pei Cao
Ying Shan
DiffM
113
6
0
29 Jun 2023
Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation
Zibo Zhao
Wen Liu
Xin Chen
Xi Zeng
Rui Wang
Pei Cheng
Bin-Bin Fu
Tao Chen
Gang Yu
Shenghua Gao
DiffM
155
107
0
29 Jun 2023
One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization
Minghua Liu
Chao Xu
Haian Jin
Ling-Hao Chen
T. MukundVarma
Zexiang Xu
Hao Su
140
469
0
29 Jun 2023
DiffComplete: Diffusion-based Generative 3D Shape Completion
Ruihang Chu
Enze Xie
Shentong Mo
Zhenguo Li
Matthias Nießner
Chi-Wing Fu
Jiaya Jia
DiffM
73
23
0
28 Jun 2023
Next Steps for Human-Centered Generative AI: A Technical Perspective
Xiang Ánthony' Chen
Jeff Burke
Andrea Colaço
Matthew K. Hong
Jennifer Jacobs
...
Dingzeyu Li
Nanyun Peng
Karl D. D. Willis
Chien-Sheng Wu
Bolei Zhou
LLMAG
91
35
0
27 Jun 2023
A-STAR: Test-time Attention Segregation and Retention for Text-to-image Synthesis
Aishwarya Agarwal
Srikrishna Karanam
K. J. Joseph
Apoorv Saxena
Koustava Goswami
Balaji Vasan Srinivasan
VLM
DiffM
39
51
0
26 Jun 2023
Text-Anchored Score Composition: Tackling Condition Misalignment in Text-to-Image Diffusion Models
Luozhou Wang
Guibao Shen
Wenhang Ge
Guangyong Chen
Yijun Li
Yingke Chen
DiffM
78
4
0
26 Jun 2023
Zero-shot spatial layout conditioning for text-to-image diffusion models
Guillaume Couairon
Marlene Careil
Matthieu Cord
Stéphane Lathuilière
Jakob Verbeek
VLM
77
65
0
23 Jun 2023
DiffInfinite: Large Mask-Image Synthesis via Parallel Random Patch Diffusion in Histopathology
Marco Aversa
Gabriel Nobis
Miriam Hagele
Kai Standvoss
Mihaela Chirica
...
D. Ivanova
Wojciech Samek
Frederick Klauschen
B. Sanguinetti
Luis Oala
MedIm
100
23
0
23 Jun 2023
Continuous Layout Editing of Single Images with Diffusion Models
Zhiyuan Zhang
Zhitong Huang
J. Liao
DiffM
63
10
0
22 Jun 2023
Eliminating Lipschitz Singularities in Diffusion Models
Zhantao Yang
Ruili Feng
Han Zhang
Yujun Shen
Kaixuan Zhu
...
Yifei Zhang
Yu Liu
Deli Zhao
Jingren Zhou
Fan Cheng
57
12
0
20 Jun 2023
Image Harmonization with Diffusion Model
Jia-jin Li
Jian Wang
Chen Wang
Jinjun Xiong
DiffM
62
3
0
17 Jun 2023
AvatarBooth: High-Quality and Customizable 3D Human Avatar Generation
Yifei Zeng
Yuanxun Lu
Xinya Ji
Yao Yao
Hao Zhu
Xun Cao
DiffM
74
30
0
16 Jun 2023
Evaluating the Robustness of Text-to-image Diffusion Models against Real-world Attacks
Hongcheng Gao
Hao Zhang
Yinpeng Dong
Zhijie Deng
AAML
109
23
0
16 Jun 2023
Linguistic Binding in Diffusion Models: Enhancing Attribute Correspondence through Attention Map Alignment
Royi Rassin
Eran Hirsch
Daniel Glickman
Shauli Ravfogel
Yoav Goldberg
Gal Chechik
DiffM
115
108
0
15 Jun 2023
VidEdit: Zero-Shot and Spatially Aware Text-Driven Video Editing
Paul Couairon
Clément Rambour
Jean-Emmanuel Haugeard
Nicolas Thome
DiffM
VGen
76
30
0
14 Jun 2023
Diffusion in Diffusion: Cyclic One-Way Diffusion for Text-Vision-Conditioned Generation
Ruoyu Wang
Yongqi Yang
Zhihao Qian
Ye Zhu
Yuehua Wu
DiffM
97
14
0
14 Jun 2023
ZeroForge: Feedforward Text-to-Shape Without 3D Supervision
Kelly O. Marshall
Minh Pham
Ameya Joshi
Anushrut Jignasu
Aditya Balu
Adarsh Krishnamurthy
A. Hegde
CLIP
60
3
0
14 Jun 2023
Generating Images with 3D Annotations Using Diffusion Models
Wufei Ma
Qihao Liu
Jiahao Wang
Angtian Wang
Xiaoding Yuan
...
Ruxiao Duan
Yongrui Qi
Adam Kortylewski
Yaoyao Liu
Alan Yuille
DiffM
86
5
0
13 Jun 2023
Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
Shuai Yang
Yifan Zhou
Ziwei Liu
Chen Change Loy
VGen
DiffM
102
221
0
13 Jun 2023
VISION Datasets: A Benchmark for Vision-based InduStrial InspectiON
Haoping Bai
Shancong Mou
Tatiana Likhomanenko
R. G. Cinbis
Oncel Tuzel
Ping Huang
Jiulong Shan
Jianjun Shi
Mengsi Cao
VLM
73
26
0
13 Jun 2023
Paste, Inpaint and Harmonize via Denoising: Subject-Driven Image Editing with Pre-Trained Diffusion Model
Xinyu Zhang
Jiaxian Guo
Paul D. Yoo
Yutaka Matsuo
Yusuke Iwasawa
DiffM
108
22
0
13 Jun 2023
Controlling Text-to-Image Diffusion by Orthogonal Finetuning
Zeju Qiu
Wei-yu Liu
Haiwen Feng
Yuxuan Xue
Yao Feng
Zhen Liu
Dan Zhang
Adrian Weller
Bernhard Schölkopf
DiffM
126
158
0
12 Jun 2023
Scalable 3D Captioning with Pretrained Models
Tiange Luo
C. Rockwell
Honglak Lee
Justin Johnson
116
160
0
12 Jun 2023
MovieFactory: Automatic Movie Creation from Text using Large Generative Models for Language and Images
Sitong Su
Huan Yang
Huiguo He
Wenjing Wang
Zixi Tuo
Wen-Huang Cheng
Lianli Gao
Jingkuan Song
Jianlong Fu
VGen
DiffM
90
40
0
12 Jun 2023
Previous
1
2
3
...
56
57
58
...
60
61
62
Next