Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.05543
Cited By
v1
v2
v3 (latest)
Adding Conditional Control to Text-to-Image Diffusion Models
10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Adding Conditional Control to Text-to-Image Diffusion Models"
50 / 3,090 papers shown
Title
CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models
Qinghe Wang
Baolu Li
Xiaomin Li
Bing Cao
Liqian Ma
Huchuan Lu
Xu Jia
DiffM
105
6
0
24 Apr 2024
MatFusion: A Generative Diffusion Model for SVBRDF Capture
Sam Sartor
Pieter Peers
72
31
0
24 Apr 2024
ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning
Weifeng Chen
Jiacheng Zhang
Jie Wu
Hefeng Wu
Xuefeng Xiao
Liang Lin
100
13
0
23 Apr 2024
ID-Animator: Zero-Shot Identity-Preserving Human Video Generation
Xuanhua He
Quande Liu
Shengju Qian
Xin Eric Wang
Tao Hu
Ke Cao
K. Yan
Jie Zhang
VGen
109
50
0
23 Apr 2024
From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generation
Zehuan Huang
Hongxing Fan
Lipeng Wang
Lu Sheng
DiffM
83
11
0
23 Apr 2024
Adaptive Mixed-Scale Feature Fusion Network for Blind AI-Generated Image Quality Assessment
Tianwei Zhou
Songbai Tan
Wei Zhou
Yu Luo
Yuan-Gen Wang
Guanghui Yue
EGVM
103
11
0
23 Apr 2024
Interactive Generation of Laparoscopic Videos with Diffusion Models
Ivan Iliash
Simeon Allmendinger
Felix Meissen
Niklas Kühl
Daniel Rückert
MedIm
VGen
134
6
0
23 Apr 2024
Enhancing Prompt Following with Visual Control Through Training-Free Mask-Guided Diffusion
Hongyu Chen
Yi-Meng Gao
Min Zhou
Peng Wang
Xubin Li
Tiezheng Ge
Bo Zheng
DiffM
68
5
0
23 Apr 2024
From Matching to Generation: A Survey on Generative Information Retrieval
Xiaoxi Li
Jiajie Jin
Yujia Zhou
Yuyao Zhang
Peitian Zhang
Yutao Zhu
Zhicheng Dou
3DV
215
61
0
23 Apr 2024
Guess The Unseen: Dynamic 3D Scene Reconstruction from Partial 2D Glimpses
Inhee Lee
Byungjun Kim
Hanbyul Joo
3DGS
104
6
0
22 Apr 2024
ControlMol: Adding Substruture Control To Molecule Diffusion Models
Zhengyang Qi
Zijing Liu
Jiying Zhang
He Cao
Yu-Feng Li
73
2
0
22 Apr 2024
Gorgeous: Create Your Desired Character Facial Makeup from Any Ideas
Jia Wei Sii
Chee Seng Chan
DiffM
103
0
0
22 Apr 2024
Towards Better Text-to-Image Generation Alignment via Attention Modulation
Yihang Wu
Xiao Cao
Kaixin Li
Zitan Chen
Haonan Wang
Lei Meng
Zhiyong Huang
DiffM
104
5
0
22 Apr 2024
ColA: Collaborative Adaptation with Gradient Learning
Enmao Diao
Qi Le
Suya Wu
Xinran Wang
Ali Anwar
Jie Ding
Vahid Tarokh
68
1
0
22 Apr 2024
MultiBooth: Towards Generating All Your Concepts in an Image from Text
Chenyang Zhu
Kai Li
Yue Ma
Chunming He
Li Xiu
DiffM
241
29
0
22 Apr 2024
Universal Fingerprint Generation: Controllable Diffusion Model with Multimodal Conditions
Steven A. Grosz
Anil K. Jain
94
3
0
21 Apr 2024
Object-Attribute Binding in Text-to-Image Generation: Evaluation and Control
Maria Mihaela Truşcǎ
Wolf Nuyts
Jonathan Thomm
Robert Honig
Thomas Hofmann
Tinne Tuytelaars
Marie-Francine Moens
42
5
0
21 Apr 2024
Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis
Yuxi Ren
Xin Xia
Yanzuo Lu
Jiacheng Zhang
Jie Wu
Pan Xie
Xing Wang
Xuefeng Xiao
167
79
0
21 Apr 2024
Zero-shot High-fidelity and Pose-controllable Character Animation
Bingwen Zhu
Fanyi Wang
Tianyi Lu
Peng Liu
Jingwen Su
Yu Lei
Yanhao Zhang
Zuxuan Wu
Guo-Jun Qi
Yu-Gang Jiang
DiffM
VGen
100
6
0
21 Apr 2024
LTOS: Layout-controllable Text-Object Synthesis via Adaptive Cross-attention Fusions
Xiaoran Zhao
Tianhao Wu
Yu Lai
Zhiliang Tian
Zhen Huang
Yahui Liu
Zejiang He
Dongsheng Li
DiffM
116
1
0
21 Apr 2024
Dynamic in Static: Hybrid Visual Correspondence for Self-Supervised Video Object Segmentation
Gensheng Pei
Yazhou Yao
Jianbo Jiao
Wenguan Wang
Liqiang Nie
Jinhui Tang
VOS
98
1
0
21 Apr 2024
LASER: Tuning-Free LLM-Driven Attention Control for Efficient Text-conditioned Image-to-Animation
Haoyu Zheng
Wenqiao Zhang
Yaoke Wang
Hao Zhou
Jiang Liu
Juncheng Li
Zheqi Lv
Siliang Tang
Yueting Zhuang
Yueting Zhuang
138
2
0
21 Apr 2024
Music Consistency Models
Zhengcong Fei
Mingyuan Fan
Junshi Huang
DiffM
106
5
0
20 Apr 2024
Generating Daylight-driven Architectural Design via Diffusion Models
Pengzhi Li
Baijuan Li
AI4CE
DiffM
70
12
0
20 Apr 2024
FilterPrompt: Guiding Image Transfer in Diffusion Models
Xi Wang
Yichen Peng
Heng Fang
Haoran Xie
Xi Yang
Chuntao Li
DiffM
82
0
0
20 Apr 2024
MCM: Multi-condition Motion Synthesis Framework
Zeyu Ling
Bo Han
Yongkang Wang
Han Lin
Mohan Kankanhalli
Weidong Geng
72
0
0
19 Apr 2024
Detecting Out-Of-Distribution Earth Observation Images with Diffusion Models
Georges Le Bellier
Nicolas Audebert
75
8
0
19 Apr 2024
GenVideo: One-shot Target-image and Shape Aware Video Editing using T2I Diffusion Models
Sai Sree Harsha
Ambareesh Revanur
Dhwanit Agarwal
Shradha Agrawal
VGen
DiffM
68
4
0
18 Apr 2024
Diff-Control: A Stateful Diffusion-based Policy for Imitation Learning
Xiao Liu
Yifan Zhou
F. Weigend
Shubham D. Sonawani
Shuhei Ikemoto
H. B. Amor
67
1
0
18 Apr 2024
VideoGigaGAN: Towards Detail-rich Video Super-Resolution
Yiran Xu
Taesung Park
Richard Zhang
Yang Zhou
Eli Shechtman
Feng Liu
Jia-Bin Huang
Difan Liu
SupR
143
12
0
18 Apr 2024
RoboDreamer: Learning Compositional World Models for Robot Imagination
Siyuan Zhou
Yilun Du
Jiaben Chen
Yandong Li
Dit-Yan Yeung
Chuang Gan
VGen
LM&Ro
150
45
0
18 Apr 2024
Customizing Text-to-Image Diffusion with Camera Viewpoint Control
Nupur Kumari
Grace Su
Richard Zhang
Taesung Park
Eli Shechtman
Jun-Yan Zhu
DiffM
92
5
0
18 Apr 2024
Reducing Bias in Pre-trained Models by Tuning while Penalizing Change
Niklas Penzel
Gideon Stein
Joachim Denzler
50
0
0
18 Apr 2024
Curriculum Point Prompting for Weakly-Supervised Referring Image Segmentation
Qiyuan Dai
Sibei Yang
91
9
0
18 Apr 2024
Sketch-guided Image Inpainting with Partial Discrete Diffusion Process
Nakul Sharma
Aditay Tripathi
Anirban Chakraborty
Anand Mishra
DiffM
92
3
0
18 Apr 2024
TextCenGen: Attention-Guided Text-Centric Background Adaptation for Text-to-Image Generation
Tianyi Liang
Jiangqi Liu
Sicheng Song
Shiqi Jiang
Yifei Huang
Changbo Wang
Chenhui Li
183
0
0
18 Apr 2024
Factorized Diffusion: Perceptual Illusions by Noise Decomposition
Daniel Geng
Inbum Park
Andrew Owens
DiffM
150
16
0
17 Apr 2024
Towards Highly Realistic Artistic Style Transfer via Stable Diffusion with Step-aware and Layer-aware Prompt
Zhanjie Zhang
Quanwei Zhang
Huaizhong Lin
Wei Xing
Juncheng Mo
...
Guangyuan Li
Junsheng Luan
Lei Zhao
Dalong Zhang
Lixia Chen
DiffM
108
14
0
17 Apr 2024
Single-temporal Supervised Remote Change Detection for Domain Generalization
Qiangang Du
Jinlong Peng
Xu Chen
Qingdong He
Liren He
Qiang Nie
Wenbing Zhu
Mingmin Chi
Yabiao Wang
Chengjie Wang
91
1
0
17 Apr 2024
TiNO-Edit: Timestep and Noise Optimization for Robust Diffusion-Based Image Editing
Sherry X. Chen
Yaron Vaxman
Elad Ben Baruch
David Asulin
Aviad Moreshet
Kuo-Chin Lien
Misha Sra
Pradeep Sen
65
4
0
17 Apr 2024
Generating Human Interaction Motions in Scenes with Text Control
Hongwei Yi
Justus Thies
Michael J. Black
Xue Bin Peng
Davis Rempe
VGen
DiffM
100
47
0
16 Apr 2024
StyleCity: Large-Scale 3D Urban Scenes Stylization
Yingshu Chen
Huajian Huang
Tuan-Anh Vu
Ka Chun Shum
Sai-Kit Yeung
87
0
0
16 Apr 2024
EucliDreamer: Fast and High-Quality Texturing for 3D Models with Depth-Conditioned Stable Diffusion
Cindy X. Le
Congrui Hetang
Chendi Lin
Ang Cao
Yihui He
68
0
0
16 Apr 2024
Salient Object-Aware Background Generation using Text-Guided Diffusion Models
Amir Erfan Eshratifar
JOÃO-BRUNO Soares
K. Thadani
Shaunak Mishra
Mikhail Kuznetsov
Yueh-Ning Ku
P.De Juan
DiffM
117
4
0
15 Apr 2024
MaxFusion: Plug&Play Multi-Modal Generation in Text-to-Image Diffusion Models
Nithin Gopalakrishnan Nair
Jeya Maria Jose Valanarasu
Vishal M. Patel
MoMe
88
7
0
15 Apr 2024
Diffscaler: Enhancing the Generative Prowess of Diffusion Transformers
Nithin Gopalakrishnan Nair
Jeya Maria Jose Valanarasu
Vishal M. Patel
91
1
0
15 Apr 2024
Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model
Han Lin
Jaemin Cho
Abhaysinh Zala
Mohit Bansal
DiffM
VGen
159
28
0
15 Apr 2024
Photo-Realistic Image Restoration in the Wild with Controlled Vision-Language Models
Ziwei Luo
Fredrik K. Gustafsson
Zheng Zhao
Jens Sjölund
Thomas B. Schön
VLM
94
14
0
15 Apr 2024
In-Context Translation: Towards Unifying Image Recognition, Processing, and Generation
Han Xue
Qianru Sun
Li Song
Wenjun Zhang
Zhiwu Huang
MLLM
74
0
0
15 Apr 2024
Text-Driven Diverse Facial Texture Generation via Progressive Latent-Space Refinement
Chi-Yin Wang
Junming Huang
Rong Zhang
Qi Wang
Haotian Yang
Haibin Huang
Chongyang Ma
Weiwei Xu
3DH
77
2
0
15 Apr 2024
Previous
1
2
3
...
34
35
36
...
60
61
62
Next