Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.05543
Cited By
v1
v2
v3 (latest)
Adding Conditional Control to Text-to-Image Diffusion Models
10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Adding Conditional Control to Text-to-Image Diffusion Models"
50 / 3,090 papers shown
Title
HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting
Zhen Zhou
Fan Ma
Hehe Fan
Yi Yang
3DGS
90
24
0
09 Feb 2024
Collaborative Control for Geometry-Conditioned PBR Image Generation
Shimon Vainer
Mark Boss
Mathias Parger
Konstantin Kutsy
Dante De Nigris
Ciara Rowles
Nicolas Perony
Simon Donné
DiffM
65
13
0
08 Feb 2024
AvatarMMC: 3D Head Avatar Generation and Editing with Multi-Modal Conditioning
W. Para
Abdelrahman Eldesokey
Zhenyu Li
Pradyumna Reddy
Jiankang Deng
Peter Wonka
DiffM
74
0
0
08 Feb 2024
Scalable Diffusion Models with State Space Backbone
Zhengcong Fei
Mingyuan Fan
Changqian Yu
Junshi Huang
142
40
0
08 Feb 2024
MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis
Dewei Zhou
You Li
Fan Ma
Zongxin Yang
Yi Yang
DiffM
96
61
0
08 Feb 2024
Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models
Senmao Li
Joost van de Weijer
Taihang Hu
Fahad Shahbaz Khan
Qibin Hou
Yaxing Wang
Jian Yang
DiffM
89
30
0
08 Feb 2024
SPAD : Spatially Aware Multiview Diffusers
Yash Kant
Ziyi Wu
Michael Vasilkovsky
Guocheng Qian
Jian Ren
R. A. Guler
Guohao Li
Sergey Tulyakov
Igor Gilitschenski
Aliaksandr Siarohin
DiffM
108
38
0
07 Feb 2024
Anatomically-Controllable Medical Image Generation with Segmentation-Guided Diffusion Models
Nicholas Konz
Yuwen Chen
Haoyu Dong
Maciej A. Mazurowski
MedIm
124
31
0
07 Feb 2024
λ
λ
λ
-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space
Maitreya Patel
Sangmin Jung
Chitta Baral
Yezhou Yang
VLM
95
35
0
07 Feb 2024
ChatScratch: An AI-Augmented System Toward Autonomous Visual Programming Learning for Children Aged 6-12
Liuqing Chen
Shuhong Xiao
Yunnong Chen
Ruoyu Wu
Yaxuan Song
Lingyun Sun
AI4Ed
LRM
51
22
0
07 Feb 2024
Text2Street: Controllable Text-to-image Generation for Street Views
Jinming Su
Songen Gu
Yiting Duan
Xing‐zhen Chen
Junfeng Luo
DiffM
90
6
0
07 Feb 2024
GenLens: A Systematic Evaluation of Visual GenAI Model Outputs
Tica Lin
Hanspeter Pfister
Jui-Hsien Wang
ELM
45
1
0
06 Feb 2024
InstanceDiffusion: Instance-level Control for Image Generation
Xudong Wang
Trevor Darrell
Sai Saketh Rambhatla
Rohit Girdhar
Ishan Misra
VLM
DiffM
61
101
0
05 Feb 2024
Guidance with Spherical Gaussian Constraint for Conditional Diffusion
Lingxiao Yang
Shutong Ding
Yifan Cai
Jingyi Yu
Jingya Wang
Ye-ling Shi
DiffM
138
40
0
05 Feb 2024
Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion
Shiyuan Yang
Liang Hou
Haibin Huang
Chongyang Ma
Pengfei Wan
Di Zhang
Xiaodong Chen
Jing Liao
VGen
DiffM
155
86
0
05 Feb 2024
Visual Text Meets Low-level Vision: A Comprehensive Survey on Visual Text Processing
Yan Shu
Weichao Zeng
Zhenhang Li
Fangmin Zhao
Yu Zhou
86
3
0
05 Feb 2024
Extreme Two-View Geometry From Object Poses with Diffusion Models
Yujing Sun
Caiyi Sun
Yuan Liu
Yuexin Ma
Siu-Ming Yiu
122
2
0
05 Feb 2024
DisDet: Exploring Detectability of Backdoor Attack on Diffusion Models
Yang Sui
Huy Phan
Jinqi Xiao
Tian-Di Zhang
Zijie Tang
Cong Shi
Yan Wang
Yingying Chen
Bo Yuan
DiffM
AAML
72
13
0
05 Feb 2024
PixelGen: Rethinking Embedded Camera Systems
Kunjun Li
Manoj Gulati
Steven Waskito
Dhairya Shah
Shantanu Chakrabarty
Ambuj Varshney
3DV
21
1
0
04 Feb 2024
M
3
^3
3
Face: A Unified Multi-Modal Multilingual Framework for Human Face Generation and Editing
Mohammadreza Mofayezi
Reza Alipour
Mohammad Ali Kakavand
Ehsaneddin Asgari
CVBM
58
1
0
04 Feb 2024
Decomposition-based and Interference Perception for Infrared and Visible Image Fusion in Complex Scenes
Xilai Li
Xiaosong Li
Haishu Tan
64
1
0
03 Feb 2024
Cross-view Masked Diffusion Transformers for Person Image Synthesis
T. Pham
Zhang Kang
Chang D. Yoo
108
6
0
02 Feb 2024
PRIME: Protect Your Videos From Malicious Editing
Guanlin Li
Shuai Yang
Jie Zhang
Tianwei Zhang
75
1
0
02 Feb 2024
AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency Learning
Fu-Yun Wang
Zhaoyang Huang
Xiaoyu Shi
Weikang Bian
Guanglu Song
Yu Liu
Hongsheng Li
62
16
0
01 Feb 2024
CapHuman: Capture Your Moments in Parallel Universes
Chao Liang
Fan Ma
Linchao Zhu
Yingying Deng
Yi Yang
DiffM
80
23
0
01 Feb 2024
Machine Unlearning for Image-to-Image Generative Models
Guihong Li
Hsiang Hsu
Chun-Fu Chen
R. Marculescu
MU
VLM
148
30
0
01 Feb 2024
Recasting Regional Lighting for Shadow Removal
Yuhao Liu
Zhanghan Ke
Ke Xu
Fang Liu
Zhenwei Wang
Rynson W. H. Lau
84
17
0
01 Feb 2024
Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators
Daniel Geng
Andrew Owens
DiffM
83
34
0
31 Jan 2024
Image Anything: Towards Reasoning-coherent and Training-free Multi-modal Image Generation
Yuanhuiyi Lyu
Xueye Zheng
Lin Wang
DiffM
104
11
0
31 Jan 2024
You Only Need One Step: Fast Super-Resolution with Stable Diffusion via Scale Distillation
Mehdi Noroozi
Isma Hadji
Brais Martínez
Adrian Bulat
Georgios Tzimiropoulos
90
13
0
30 Jan 2024
Repositioning the Subject within Image
Yikai Wang
Chenjie Cao
Ke Fan
Qiaole Dong
Yifan Li
Xiangyang Xue
Yanwei Fu
DiffM
99
2
0
30 Jan 2024
BoostDream: Efficient Refining for High-Quality Text-to-3D Generation from Multi-View Diffusion
Yonghao Yu
Shunan Zhu
Huai Qin
Haorui Li
Jinglu Hu
69
8
0
30 Jan 2024
Pick-and-Draw: Training-free Semantic Guidance for Text-to-Image Personalization
Henglei Lv
Jiayu Xiao
Liang Li
Qingming Huang
DiffM
100
6
0
30 Jan 2024
A Survey on Visual Anomaly Detection: Challenge, Approach, and Prospect
Yunkang Cao
Xiaohao Xu
Jiangning Zhang
Yuqi Cheng
Xiaonan Huang
Guansong Pang
Nong Sang
127
46
0
29 Jan 2024
DressCode: Autoregressively Sewing and Generating Garments from Text Guidance
Kai He
Kaixin Yao
Qixuan Zhang
Jingyi Yu
Lingjie Liu
Lan Xu
AI4CE
161
34
0
29 Jan 2024
Diffutoon: High-Resolution Editable Toon Shading via Diffusion Models
Zhongjie Duan
Chengyu Wang
Cen Chen
Weining Qian
Jun Huang
DiffM
51
7
0
29 Jan 2024
Spatial-Aware Latent Initialization for Controllable Image Generation
Wenqiang Sun
Tengtao Li
Zehong Lin
Jun Zhang
94
11
0
29 Jan 2024
Bridging Generative and Discriminative Models for Unified Visual Perception with Diffusion Priors
Shiyin Dong
Mingrui Zhu
Kun Cheng
Nannan Wang
Xinbo Gao
DiffM
42
3
0
29 Jan 2024
Motion-I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling
Xiaoyu Shi
Zhaoyang Huang
Fu-Yun Wang
Weikang Bian
Dasong Li
...
Ka Chun Cheung
Simon See
Hongwei Qin
Jifeng Da
Hongsheng Li
VGen
DiffM
131
93
0
29 Jan 2024
StableIdentity: Inserting Anybody into Anywhere at First Sight
Qinghe Wang
Xu Jia
Xiaomin Li
Taiqing Li
Liqian Ma
Yunzhi Zhuge
Huchuan Lu
79
21
0
29 Jan 2024
FreeStyle: Free Lunch for Text-guided Style Transfer using Diffusion Models
Feihong He
Gang Li
Mengyuan Zhang
Leilei Yan
Hui Xiong
Fanzhang Li
Li Shen
DiffM
88
15
0
28 Jan 2024
IntentTuner: An Interactive Framework for Integrating Human Intents in Fine-tuning Text-to-Image Generative Models
Xingchen Zeng
Ziyao Gao
Yilin Ye
Wei Zeng
48
13
0
28 Jan 2024
DiffuserLite: Towards Real-time Diffusion Planning
Zibin Dong
Jianye Hao
Yifu Yuan
Fei Ni
Yitian Wang
Pengyi Li
Yan Zheng
182
20
0
27 Jan 2024
A Survey on Data Augmentation in Large Model Era
Yue Zhou
Chenlu Guo
Xu Wang
Yi-Ju Chang
Yuan Wu
LM&MA
VLM
128
27
0
27 Jan 2024
GEM: Boost Simple Network for Glass Surface Segmentation via Segment Anything Model and Data Synthesis
Jing Hao
Moyun Liu
Kuo Feng Hung
DiffM
59
2
0
27 Jan 2024
Annotated Hands for Generative Models
Yue Yang
Atith N Gandhi
Greg Turk
DiffM
GAN
33
3
0
26 Jan 2024
UrbanGenAI: Reconstructing Urban Landscapes using Panoptic Segmentation and Diffusion Models
Timo Kapsalis
48
1
0
25 Jan 2024
Sketch2NeRF: Multi-view Sketch-guided Text-to-3D Generation
Minglin Chen
Weihao Yuan
Yukun Wang
Zhe Sheng
Yisheng He
Zilong Dong
Liefeng Bo
Yulan Guo
DiffM
90
4
0
25 Jan 2024
Grounded SAM: Assembling Open-World Models for Diverse Visual Tasks
Tianhe Ren
Shilong Liu
Ailing Zeng
Jing Lin
Kunchang Li
...
Feng Li
Jie Yang
Hongyang Li
Qing Jiang
Lei Zhang
VLM
148
449
0
25 Jan 2024
Diffusion-based Data Augmentation for Object Counting Problems
Zhen Wang
Yuelei Li
Jia Wan
Nuno Vasconcelos
98
4
0
25 Jan 2024
Previous
1
2
3
...
42
43
44
...
60
61
62
Next