Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.05543
Cited By
v1
v2
v3 (latest)
Adding Conditional Control to Text-to-Image Diffusion Models
10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Adding Conditional Control to Text-to-Image Diffusion Models"
50 / 3,090 papers shown
Title
Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images
Zhuoran Yu
Chenchen Zhu
Sean Culatana
Raghuraman Krishnamoorthi
Fanyi Xiao
Yong Jae Lee
177
15
0
04 Dec 2023
VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence
Yuchao Gu
Yipin Zhou
Bichen Wu
Licheng Yu
Jia-Wei Liu
Rui Zhao
Jay Zhangjie Wu
David Junhao Zhang
Mike Zheng Shou
Kevin Tang
DiffM
VGen
125
42
0
04 Dec 2023
UniGS: Unified Representation for Image Generation and Segmentation
Lu Qi
Lehan Yang
Weidong Guo
Yu-Syuan Xu
Bo Du
Varun Jampani
Ming-Hsuan Yang
94
19
0
04 Dec 2023
Instance-guided Cartoon Editing with a Large-scale Dataset
Jian Lin
Chengze Li
Xueting Liu
Zhongping Ge
50
0
0
04 Dec 2023
Generalization by Adaptation: Diffusion-Based Domain Extension for Domain-Generalized Semantic Segmentation
J. Niemeijer
Manuel Schwonberg
Jan-Aike Termöhlen
Nico M. Schmidt
Tim Fingscheidt
DiffM
76
19
0
04 Dec 2023
Collaborative Neural Painting
Nicola Dall’Asen
Willi Menapace
E. Peruzzo
E. Sangineto
Yiming Wang
Elisa Ricci
95
0
0
04 Dec 2023
X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model
L. Ran
Xiaodong Cun
Jia-Wei Liu
Rui Zhao
Song Zijie
Xintao Wang
Jussi Keppo
Mike Zheng Shou
91
12
0
04 Dec 2023
StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On
Jeongho Kim
Gyojung Gu
Minho Park
S. Park
Jaegul Choo
DiffM
106
103
0
04 Dec 2023
GenEM: Physics-Informed Generative Cryo-Electron Microscopy
Jiakai Zhang
Qihe Chen
Yan Zeng
Wenyuan Gao
Xuming He
Zhijie Liu
Jingyi Yu
DiffM
58
2
0
04 Dec 2023
MedXChat: A Unified Multimodal Large Language Model Framework towards CXRs Understanding and Generation
Ling Yang
Zhanyu Wang
Zhenghao Chen
Xinyu Liang
Luping Zhou
LM&MA
MedIm
100
6
0
04 Dec 2023
Tracing Hyperparameter Dependencies for Model Parsing via Learnable Graph Pooling Network
Xiao Guo
Vishal Asnani
Sijia Liu
Xiaoming Liu
91
6
0
03 Dec 2023
Generative Rendering: Controllable 4D-Guided Video Generation with 2D Diffusion Models
Shengqu Cai
Duygu Ceylan
Matheus Gadelha
C. Huang
Tuanfeng Y. Wang
Gordon Wetzstein
VGen
99
18
0
03 Dec 2023
Improving In-Context Learning in Diffusion Models with Visual Context-Modulated Prompts
Tianqi Chen
Yongfei Liu
Zhendong Wang
Jianbo Yuan
Quanzeng You
Hongxia Yang
Mingyuan Zhou
VLM
79
6
0
03 Dec 2023
Meta ControlNet: Enhancing Task Adaptation via Meta Learning
Junjie Yang
Jinze Zhao
Peihao Wang
Zhangyang Wang
Yingbin Liang
126
3
0
03 Dec 2023
QPoser: Quantized Explicit Pose Prior Modeling for Controllable Pose Generation
Yumeng Li
Zhexu Luo
Zhong Ren
Kun Zhou
138
1
0
02 Dec 2023
Planning as In-Painting: A Diffusion-Based Embodied Task Planning Framework for Environments under Uncertainty
Cheng-Fu Yang
Haoyang Xu
Te-Lin Wu
Xiaofeng Gao
Kai-Wei Chang
Feng Gao
DiffM
64
8
0
02 Dec 2023
Diffusion Handles: Enabling 3D Edits for Diffusion Models by Lifting Activations to 3D
Karran Pandey
Paul Guerrero
Matheus Gadelha
Yannick Hold-Geoffroy
Karan Singh
Niloy Mitra
DiffM
80
33
0
02 Dec 2023
Consistent Mesh Diffusion
Julian Knodt
Xifeng Gao
76
3
0
01 Dec 2023
3DiFACE: Diffusion-based Speech-driven 3D Facial Animation and Editing
Balamurugan Thambiraja
S. Aliakbarian
Darren Cosker
Justus Thies
DiffM
VGen
148
12
0
01 Dec 2023
TrackDiffusion: Tracklet-Conditioned Video Generation via Diffusion Models
Pengxiang Li
Kai Chen
Zhili Liu
Ruiyuan Gao
Lanqing Hong
Guo Zhou
Hua Yao
Dit-Yan Yeung
Huchuan Lu
Xu Jia
VGen
DiffM
66
0
0
01 Dec 2023
Text-Guided 3D Face Synthesis -- From Generation to Editing
Yunjie Wu
Yapeng Meng
Zhipeng Hu
Lincheng Li
Haoqian Wu
Kun Zhou
Weiwei Xu
Xin Yu
DiffM
130
10
0
01 Dec 2023
Lasagna: Layered Score Distillation for Disentangled Object Relighting
D. Bashkirova
Arijit Ray
Rupayan Mallick
Sarah Adel Bargal
Jianming Zhang
Ranjay Krishna
Kate Saenko
86
4
0
30 Nov 2023
ChatPose: Chatting about 3D Human Pose
Yao Feng
Jing Lin
Sai Kumar Dwivedi
Yu Sun
Priyanka Patel
Michael J. Black
3DH
91
42
0
30 Nov 2023
VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models
Zhen Xing
Qi Dai
Zihao Zhang
Hui Zhang
Hang-Rui Hu
Zuxuan Wu
Yu-Gang Jiang
VGen
102
17
0
30 Nov 2023
S2ST: Image-to-Image Translation in the Seed Space of Latent Diffusion
V. Kolmogorov
Rustem Takhanov
Dani Lischinski
DiffM
86
3
0
30 Nov 2023
ART
⋅
\boldsymbol{\cdot}
⋅
V: Auto-Regressive Text-to-Video Generation with Diffusion Models
Wenming Weng
Ruoyu Feng
Yanhui Wang
Qi Dai
Chunyu Wang
...
Jianmin Bao
Yuhui Yuan
Chong Luo
Yueyi Zhang
Zhiwei Xiong
VGen
83
38
0
30 Nov 2023
Exploiting Diffusion Prior for Generalizable Dense Prediction
Hsin-Ying Lee
Hung-Yu Tseng
Hsin-Ying Lee
Ming-Hsuan Yang
DiffM
MDE
96
23
0
30 Nov 2023
MotionEditor: Editing Video Motion via Content-Aware Diffusion
Shuyuan Tu
Qi Dai
Zhi-Qi Cheng
Hang-Rui Hu
Xintong Han
Zuxuan Wu
Yu-Gang Jiang
DiffM
VGen
102
31
0
30 Nov 2023
Motion-Conditioned Image Animation for Video Editing
Wilson Yan
Andrew Brown
Pieter Abbeel
Rohit Girdhar
S. Azadi
DiffM
VGen
131
12
0
30 Nov 2023
CoDi-2: In-Context, Interleaved, and Interactive Any-to-Any Generation
Zineng Tang
Ziyi Yang
Mahmoud Khademi
Yang Liu
Chenguang Zhu
Mohit Bansal
LRM
MLLM
AuLLM
127
52
0
30 Nov 2023
Detailed Human-Centric Text Description-Driven Large Scene Synthesis
Gwanghyun Kim
Dong un Kang
H. Seo
Hayeon Kim
Se Young Chun
3DV
DiffM
61
2
0
30 Nov 2023
DiffusionAvatars: Deferred Diffusion for High-fidelity 3D Head Avatars
Tobias Kirschstein
Simon Giebenhain
Matthias Nießner
126
28
0
30 Nov 2023
Cancer-Net PCa-Gen: Synthesis of Realistic Prostate Diffusion Weighted Imaging Data via Anatomic-Conditional Controlled Latent Diffusion
Aditya Sridhar
Chi-en Amy Tai
Hayden Gunraj
Yuhao Chen
Alexander Wong
DiffM
MedIm
15
0
0
30 Nov 2023
CAT-DM: Controllable Accelerated Virtual Try-on with Diffusion Model
Jianhao Zeng
Dan Song
Weizhi Nie
Hongshuo Tian
Tongtong Wang
Anan Liu
DiffM
88
27
0
30 Nov 2023
X-Dreamer: Creating High-quality 3D Content by Bridging the Domain Gap Between Text-to-2D and Text-to-3D Generation
Yiwei Ma
Yijun Fan
Jiayi Ji
Haowei Wang
Xiaoshuai Sun
Guannan Jiang
Annan Shu
Rongrong Ji
94
7
0
30 Nov 2023
CosAvatar: Consistent and Animatable Portrait Video Tuning with Text Prompt
Haiyao Xiao
Chenglai Zhong
Xuan Gao
Yudong Guo
Juyong Zhang
73
0
0
30 Nov 2023
Prompt-Based Exemplar Super-Compression and Regeneration for Class-Incremental Learning
Ruxiao Duan
Yaoyao Liu
Jieneng Chen
Adam Kortylewski
Alan Yuille
DiffM
VLM
102
1
0
30 Nov 2023
Non-Cross Diffusion for Semantic Consistency
Ziyang Zheng
Ruiyuan Gao
Qiang Xu
DiffM
72
2
0
30 Nov 2023
HiPA: Enabling One-Step Text-to-Image Diffusion Models via High-Frequency-Promoting Adaptation
Yifan Zhang
Bryan Hooi
VLM
76
10
0
30 Nov 2023
Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models
Daniel Geng
Inbum Park
Andrew Owens
DiffM
96
30
0
29 Nov 2023
AvatarStudio: High-fidelity and Animatable 3D Avatar Creation from Text
Jianfeng Zhang
Xuanmeng Zhang
Huichao Zhang
Jun Hao Liew
Chenxu Zhang
Yi Yang
Jiashi Feng
DiffM
108
16
0
29 Nov 2023
CG3D: Compositional Generation for Text-to-3D via Gaussian Splatting
Alexander Vilesov
Pradyumna Chari
A. Kadambi
3DGS
75
35
0
29 Nov 2023
SODA: Bottleneck Diffusion Models for Representation Learning
Drew A. Hudson
Daniel Zoran
Mateusz Malinowski
Andrew Kyle Lampinen
Andrew Jaegle
James L. McClelland
Loic Matthey
Felix Hill
Alexander Lerchner
DiffM
106
56
0
29 Nov 2023
DiG-IN: Diffusion Guidance for Investigating Networks -- Uncovering Classifier Differences Neuron Visualisations and Visual Counterfactual Explanations
Maximilian Augustin
Yannic Neuhaus
Matthias Hein
DiffM
111
5
0
29 Nov 2023
Receler: Reliable Concept Erasing of Text-to-Image Diffusion Models via Lightweight Erasers
Chi-Pin Huang
Kai-Po Chang
Chung-Ting Tsai
Yung-Hsuan Lai
Fu-En Yang
Yu-Chiang Frank Wang
DiffM
105
56
0
29 Nov 2023
Curved Diffusion: A Generative Model With Optical Geometry Control
Andrey Voynov
Amir Hertz
Moab Arar
Shlomi Fruchter
Daniel Cohen-Or
DiffM
51
5
0
29 Nov 2023
M
2
^{2}
2
Chat: Empowering VLM for Multimodal LLM Interleaved Text-Image Generation
Xiaowei Chi
Rongyu Zhang
Zhengkai Jiang
Yijiang Liu
Ziyi Lin
...
Chaoyou Fu
Peng Gao
Shanghang Zhang
Qi-fei Liu
Yi-Ting Guo
MLLM
86
2
0
29 Nov 2023
SmoothVideo: Smooth Video Synthesis with Noise Constraints on Diffusion Models for One-shot Video Tuning
Liang Peng
Haoran Cheng
Zheng Yang
Ruisi Zhao
Linxuan Xia
Chaotian Song
Qinglin Lu
Boxi Wu
Wei Liu
VGen
60
2
0
29 Nov 2023
Non-Visible Light Data Synthesis and Application: A Case Study for Synthetic Aperture Radar Imagery
Zichen Tian
Zhaozheng Chen
Qianru Sun
81
1
0
29 Nov 2023
When StyleGAN Meets Stable Diffusion: a
W
+
\mathscr{W}_+
W
+
Adapter for Personalized Image Generation
Xiaoming Li
Xinyu Hou
Chen Change Loy
64
10
0
29 Nov 2023
Previous
1
2
3
...
48
49
50
...
60
61
62
Next