Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.05543
Cited By
v1
v2
v3 (latest)
Adding Conditional Control to Text-to-Image Diffusion Models
10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Adding Conditional Control to Text-to-Image Diffusion Models"
50 / 3,090 papers shown
Title
A Survey on 3D Gaussian Splatting
Guikun Chen
Wenguan Wang
3DGS
241
193
0
08 Jan 2024
Deep Learning-based Image and Video Inpainting: A Survey
Weize Quan
Jiaxi Chen
Yanli Liu
Dong-Ming Yan
Peter Wonka
3DV
78
39
0
07 Jan 2024
Controllable Image Synthesis of Industrial Data Using Stable Diffusion
Gabriele Valvano
Antonino Agostino
Giovanni De Magistris
Antonino Graziano
Giacomo Veneri
63
8
0
06 Jan 2024
Generating Non-Stationary Textures using Self-Rectification
Yang Zhou
Rongjun Xiao
Dani Lischinski
Daniel Cohen-Or
Hui Huang
DiffM
109
4
0
05 Jan 2024
DiffBody: Diffusion-based Pose and Shape Editing of Human Images
Yuta Okuyama
Yuki Endo
Yoshihiro Kanamori
DiffM
75
4
0
05 Jan 2024
Progress and Prospects in 3D Generative AI: A Technical Overview including 3D human
Song Bai
Jie Li
76
6
0
05 Jan 2024
Latte: Latent Diffusion Transformer for Video Generation
Xin Ma
Yaohui Wang
Gengyun Jia
Xinyuan Chen
Ziqiang Liu
Yuan-Fang Li
Cunjian Chen
Yu Qiao
DiffM
VGen
291
279
0
05 Jan 2024
VASE: Object-Centric Appearance and Shape Manipulation of Real Videos
E. Peruzzo
Vidit Goel
Dejia Xu
Xingqian Xu
Yi Ding
Zhangyang Wang
Humphrey Shi
N. Sebe
LM&Ro
VGen
DiffM
124
12
0
04 Jan 2024
LLaMA Pro: Progressive LLaMA with Block Expansion
Chengyue Wu
Yukang Gan
Yixiao Ge
Zeyu Lu
Jiahao Wang
Ye Feng
Ying Shan
Ping Luo
CLL
90
72
0
04 Jan 2024
Preserving Image Properties Through Initializations in Diffusion Models
Jeffrey Zhang
Shao-Yu Chang
Kedan Li
David Forsyth
DiffM
46
6
0
04 Jan 2024
Instruct-Imagen: Image Generation with Multi-modal Instruction
Hexiang Hu
Kelvin C. K. Chan
Yu-Chuan Su
Wenhu Chen
Yandong Li
...
Xue Ben
Boqing Gong
William W. Cohen
Ming-Wei Chang
Xuhui Jia
MLLM
139
50
0
03 Jan 2024
Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions
David Junhao Zhang
Dongxu Li
Hung Le
Mike Zheng Shou
Caiming Xiong
Doyen Sahoo
VGen
81
25
0
03 Jan 2024
WordArt Designer API: User-Driven Artistic Typography Synthesis with Large Language Models on ModelScope
Jun-Yan He
Zhi-Qi Cheng
Chenyang Li
Jingdong Sun
Wangmeng Xiang
...
Zengke Jin
Bin Luo
Yifeng Geng
Xuansong Xie
Jingren Zhou
46
2
0
03 Jan 2024
SIGNeRF: Scene Integrated Generation for Neural Radiance Fields
Jan-Niklas Dihlmann
Andreas Engelhardt
Hendrik P. A. Lensch
DiffM
VGen
77
5
0
03 Jan 2024
ColorizeDiffusion: Adjustable Sketch Colorization with Reference Image and Text
Dingkun Yan
Liang Yuan
Erwin Wu
Yuma Nishioka
I. Fujishiro
Suguru Saito
DiffM
66
5
0
02 Jan 2024
VALD-MD: Visual Attribution via Latent Diffusion for Medical Diagnostics
Ammar A. Siddiqui
Santosh Tirunagari
Tehseen Zia
David Windridge
MedIm
91
1
0
02 Jan 2024
Image Sculpting: Precise Object Editing with 3D Geometry Control
Jiraphon Yenphraphai
Xichen Pan
Sainan Liu
Daniele Panozzo
Saining Xie
80
22
0
02 Jan 2024
En3D: An Enhanced Generative Model for Sculpting 3D Humans from 2D Synthetic Data
Yifang Men
Biwen Lei
Yuan Yao
Miaomiao Cui
Zhouhui Lian
Xuansong Xie
SyDa
3DH
81
7
0
02 Jan 2024
Joint Generative Modeling of Scene Graphs and Images via Diffusion Models
Bicheng Xu
Qi Yan
Renjie Liao
Lele Wang
Leonid Sigal
DiffM
82
3
0
02 Jan 2024
Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation
Jinlong Xue
Yayue Deng
Yingming Gao
Ya Li
DiffM
100
36
0
02 Jan 2024
DiffMorph: Text-less Image Morphing with Diffusion Models
Shounak Chatterjee
DiffM
28
0
0
01 Jan 2024
Diffusion Models, Image Super-Resolution And Everything: A Survey
Brian B. Moser
Arundhati S. Shanbhag
Federico Raue
Stanislav Frolov
Sebastián M. Palacio
Andreas Dengel
108
41
0
01 Jan 2024
GD^2-NeRF: Generative Detail Compensation via GAN and Diffusion for One-shot Generalizable Neural Radiance Fields
X. Pan
Zongxin Yang
Shuai Bai
Yi Yang
DiffM
OffRL
92
1
0
01 Jan 2024
A Generalist FaceX via Learning Unified Facial Representation
Yue Han
Jiangning Zhang
Junwei Zhu
Xiangtai Li
Yanhao Ge
Wei Li
Chengjie Wang
Yong Liu
Xiaoming Liu
Ying Tai
DiffM
104
13
0
31 Dec 2023
TrailBlazer: Trajectory Control for Diffusion-Based Video Generation
W. Ma
J. P. Lewis
W. Kleijn
DiffM
VGen
109
42
0
31 Dec 2023
Brain-Conditional Multimodal Synthesis: A Survey and Taxonomy
Weijian Mai
Jian Zhang
Pengfei Fang
Zhijun Zhang
180
11
0
31 Dec 2023
Inpaint4DNeRF: Promptable Spatio-Temporal NeRF Inpainting with Generative Diffusion Models
Han Jiang
Haosen Sun
Ruoxuan Li
Chi-Keung Tang
Yu-Wing Tai
DiffM
94
2
0
30 Dec 2023
P2M2-Net: Part-Aware Prompt-Guided Multimodal Point Cloud Completion
Linlian Jiang
Pan Chen
Ye Wang
Tieru Wu
Rui Ma
3DPC
70
0
0
29 Dec 2023
Discrete Distribution Networks
Lei Yang
141
1
0
29 Dec 2023
iFusion: Inverting Diffusion for Pose-Free Reconstruction from Sparse Views
Chin-Hsuan Wu
Yen-Chun Chen
Bolivar Solarte
Lu Yuan
Min Sun
85
9
0
28 Dec 2023
Personalized Restoration via Dual-Pivot Tuning
Pradyumna Chari
Sizhuo Ma
Daniil Ostashev
A. Kadambi
Gurunandan Krishnan
Jian Wang
Kfir Aberman
DiffM
87
3
0
28 Dec 2023
EFHQ: Multi-purpose ExtremePose-Face-HQ dataset
T. Dao
D. Vu
Cuong Pham
Anh Tran
87
1
0
28 Dec 2023
DiffusionGAN3D: Boosting Text-guided 3D Generation and Domain Adaptation by Combining 3D GANs and Diffusion Priors
Biwen Lei
Kai Yu
Mengyang Feng
Miaomiao Cui
Xuansong Xie
DiffM
82
17
0
28 Dec 2023
PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion
Guansong Lu
Yuanfan Guo
Jianhua Han
Minzhe Niu
Yihan Zeng
Songcen Xu
Zeyi Huang
Zhao Zhong
Wei Zhang
Hang Xu
75
4
0
27 Dec 2023
One-Dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications
Mengyao Lyu
Yuhong Yang
Haiwen Hong
Hui Chen
Xuan Jin
Yuan He
Hui Xue
Jungong Han
Guiguang Ding
DiffM
111
67
0
26 Dec 2023
Towards Flexible, Scalable, and Adaptive Multi-Modal Conditioned Face Synthesis
Jingjing Ren
Cheng Xu
Haoyu Chen
Xinran Qin
Lei Zhu
CVBM
DiffM
99
4
0
26 Dec 2023
SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation
Yuxuan Zhang
Yiren Song
Jiaming Liu
Rui Wang
Jinpeng Yu
...
Huaxia Li
Xu Tang
Yao Hu
Han Pan
Zhongliang Jing
117
69
0
26 Dec 2023
Semantic Guidance Tuning for Text-To-Image Diffusion Models
Hyun Kang
Dohae Lee
Myungjin Shin
In-Kwon Lee
51
1
0
26 Dec 2023
A Recipe for Scaling up Text-to-Video Generation with Text-free Videos
Xiang Wang
Shiwei Zhang
Hangjie Yuan
Zhiwu Qing
Biao Gong
Yingya Zhang
Yujun Shen
Changxin Gao
Nong Sang
DiffM
VGen
108
28
0
25 Dec 2023
Towards Real-World Blind Face Restoration with Generative Diffusion Prior
Xiaoxu Chen
Jingfan Tan
Tao Wang
Kaihao Zhang
Wenhan Luo
Xiaocun Cao
DiffM
91
21
0
25 Dec 2023
High-Fidelity Diffusion-based Image Editing
Chen Hou
Guoqiang Wei
Zhibo Chen
DiffM
101
4
0
25 Dec 2023
Amodal Completion via Progressive Mixed Context Diffusion
Katherine Xu
Lingzhi Zhang
Jianbo Shi
DiffM
104
16
0
24 Dec 2023
Make-A-Character: High Quality Text-to-3D Character Generation within Minutes
Jianqiang Ren
Chao He
Lin Liu
Jiahao Chen
Yutong Wang
...
Siqi Hu
Tao Chen
Kunkun Zheng
Jianjing Xiang
Liefeng Bo
71
5
0
24 Dec 2023
Tuning-Free Inversion-Enhanced Control for Consistent Image Editing
Xiaoyue Duan
Shuhao Cui
Guoliang Kang
Baochang Zhang
Zhengcong Fei
Mingyuan Fan
Junshi Huang
DiffM
59
10
0
22 Dec 2023
Revisiting Few-Shot Object Detection with Vision-Language Models
Anish Madan
Neehar Peri
Shu Kong
Deva Ramanan
VLM
101
11
0
22 Dec 2023
UniHuman: A Unified Model for Editing Human Images in the Wild
Nannan Li
Qing Liu
Krishna Kumar Singh
Yilin Wang
Jianming Zhang
Bryan A. Plummer
Zhe Lin
54
10
0
22 Dec 2023
Diffusion Models for Generative Artificial Intelligence: An Introduction for Applied Mathematicians
Catherine F. Higham
Des J. Higham
P. Grindrod
DiffM
VLM
48
3
0
21 Dec 2023
PlatoNeRF: 3D Reconstruction in Plato's Cave via Single-View Two-Bounce Lidar
Tzofi Klinghoffer
Xiaoyu Xiang
S. Somasundaram
Yuchen Fan
Christian Richardt
Ramesh Raskar
Rakesh Ranjan
84
7
0
21 Dec 2023
VCoder: Versatile Vision Encoders for Multimodal Large Language Models
Jitesh Jain
Jianwei Yang
Humphrey Shi
MLLM
76
31
0
21 Dec 2023
VideoPoet: A Large Language Model for Zero-Shot Video Generation
Dan Kondratyuk
Lijun Yu
Xiuye Gu
José Lezama
Jonathan Huang
...
Irfan Essa
Huisheng Wang
David A. Ross
Bryan Seybold
Lu Jiang
VGen
152
273
0
21 Dec 2023
Previous
1
2
3
...
44
45
46
...
60
61
62
Next