Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.05543
Cited By
v1
v2
v3 (latest)
Adding Conditional Control to Text-to-Image Diffusion Models
10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Adding Conditional Control to Text-to-Image Diffusion Models"
50 / 3,090 papers shown
Title
A Comprehensive Review on Noise Control of Diffusion Model
Zhehao Guo
Jiedong Lang
Shuyu Huang
Yunfei Gao
Xintong Ding
DiffM
77
0
0
07 Feb 2025
Hummingbird: High Fidelity Image Generation via Multimodal Context Alignment
Minh-Quan Le
Gaurav Mittal
Tianjian Meng
A S M Iftekhar
Vishwas Suryanarayanan
Barun Patra
Dimitris Samaras
Mei Chen
DiffM
133
0
0
07 Feb 2025
DeblurDiff: Real-World Image Deblurring with Generative Diffusion Models
Lingshun Kong
Jiawei Zhang
Dongqing Zou
Jimmy S. J. Ren
Xiaohe Wu
Jiangxin Dong
Jinshan Pan
DiffM
110
0
0
06 Feb 2025
DILLEMA: Diffusion and Large Language Models for Multi-Modal Augmentation
Luciano Baresi
Davide Yi Xian Hu
Muhammad Irfan Masúdi
G. Quattrocchi
DiffM
VLM
167
1
0
05 Feb 2025
Recommendations Beyond Catalogs: Diffusion Models for Personalized Generation
Gabriel Patron
Zhiwei Xu
Ishan Kapnadak
Felipe Maia Polo
DiffM
74
1
0
05 Feb 2025
Controllable Satellite-to-Street-View Synthesis with Precise Pose Alignment and Zero-Shot Environmental Control
Xianghui Ze
Zhenbo Song
Qiwei Wang
Jianfeng Lu
Yujiao Shi
106
1
0
05 Feb 2025
One Diffusion Step to Real-World Super-Resolution via Flow Trajectory Distillation
Jiajian Li
Jingyun Liang
Yong Guo
Wenbo Li
Yulun Zhang
DiffM
191
3
0
04 Feb 2025
UVGS: Reimagining Unstructured 3D Gaussian Splatting using UV Mapping
Aashish Rai
Dilin Wang
Mihir Jain
N. Sarafianos
Arthur Chen
Srinath Sridhar
Aayush Prakash
3DGS
184
1
0
03 Feb 2025
Improved Training Technique for Latent Consistency Models
Quan Dao
Khanh Doan
Di Liu
Trung Le
Dimitris N. Metaxas
161
3
0
03 Feb 2025
SliderSpace: Decomposing the Visual Capabilities of Diffusion Models
Rohit Gandikota
Zongze Wu
Richard Zhang
David Bau
Eli Shechtman
Nick Kolkin
DiffM
83
2
0
03 Feb 2025
Unpaired Deblurring via Decoupled Diffusion Model
Junhao Cheng
Wei-Ting Chen
Xi Lu
Ming-Hsuan Yang
DiffM
131
0
0
03 Feb 2025
Assessing the use of Diffusion models for motion artifact correction in brain MRI
Paolo Angella
Vito Paolo Pastore
Matteo Santacesaria
MedIm
DiffM
98
1
0
03 Feb 2025
Weak Supervision Dynamic KL-Weighted Diffusion Models Guided by Large Language Models
Julian Perry
Frank Sanders
Carter Scott
113
0
0
02 Feb 2025
Position: AI Scaling: From Up to Down and Out
Yunke Wang
Yanxi Li
Chang Xu
HAI
224
1
0
02 Feb 2025
Rethinking Diffusion Posterior Sampling: From Conditional Score Estimator to Maximizing a Posterior
Tongda Xu
Xiyan Cai
Wei Wei
Xingtong Ge
Dailan He
Ming Sun
Jingjing Liu
Yuanhang Zhang
Jian Li
Yan Wang
DiffM
213
3
0
31 Jan 2025
VICCA: Visual Interpretation and Comprehension of Chest X-ray Anomalies in Generated Report Without Human Feedback
Sayeh Gholipour Picha
D. Chanti
A. Caplier
MedIm
117
0
0
29 Jan 2025
DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation
Chenguo Lin
Panwang Pan
Bangbang Yang
Zeming Li
Yadong Mu
3DGS
172
9
0
28 Jan 2025
Turn That Frown Upside Down: FaceID Customization via Cross-Training Data
Shuhe Wang
Xiaoya Li
Xiaofei Sun
G. Wang
Tianwei Zhang
Jiwei Li
Eduard H. Hovy
119
1
0
28 Jan 2025
CE-SDWV: Effective and Efficient Concept Erasure for Text-to-Image Diffusion Models via a Semantic-Driven Word Vocabulary
Jiahang Tu
Qian Feng
Chufan Chen
Jiahua Dong
Hanbin Zhao
Chao Zhang
Hui Qian
114
4
0
28 Jan 2025
Make-A-Texture: Fast Shape-Aware Texture Generation in 3 Seconds
Xiaoyu Xiang
Liat Sless Gorelik
Yuchen Fan
Omri Armstrong
Forrest N. Iandola
Yilei Li
Ita Lifshitz
Rakesh Ranjan
3DGS
DiffM
183
5
0
28 Jan 2025
Adversarially Robust Out-of-Distribution Detection Using Lyapunov-Stabilized Embeddings
Hossein Mirzaei
Mackenzie W. Mathis
OODD
AAML
129
4
0
28 Jan 2025
An Item is Worth a Prompt: Versatile Image Editing with Disentangled Control
Aosong Feng
Weikang Qiu
Jinbin Bai
Xiao Zhang
Zhen Dong
Kaicheng Zhou
Rex Ying
Leandros Tassiulas
DiffM
122
6
0
28 Jan 2025
LLM-guided Instance-level Image Manipulation with Diffusion U-Net Cross-Attention Maps
Andrey Palaev
Adil Mehmood Khan
S. M. Ahsan Kazmi
DiffM
120
0
0
23 Jan 2025
Parameter-Efficient Fine-Tuning for Foundation Models
Dan Zhang
Tao Feng
Lilong Xue
Yuandong Wang
Yuxiao Dong
J. Tang
234
12
0
23 Jan 2025
3D Object Manipulation in a Single Image using Generative Models
Ruisi Zhao
Zechuan Zhang
Zongxin Yang
Yi Yang
99
1
0
22 Jan 2025
PreciseCam: Precise Camera Control for Text-to-Image Generation
Edurne Bernal-Berdun
Ana Serrano
B. Masiá
Matheus Gadelha
Yannick Hold-Geoffroy
Xin Sun
Diego F. F. Gutierrez
DiffM
VGen
102
1
0
22 Jan 2025
Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation
Zibo Zhao
Zeqiang Lai
Qingxiang Lin
Yunfei Zhao
Haolin Liu
...
Jingwei Huang
Chunchao Guo
Jie Jiang
Jingwei Huang
Chunchao Guo
263
45
0
21 Jan 2025
Survey on Monocular Metric Depth Estimation
Jiuling Zhang
VLM
277
1
0
21 Jan 2025
Ditto: Accelerating Diffusion Model via Temporal Value Similarity
Sungbin Kim
Hyunwuk Lee
Wonho Cho
Mincheol Park
Won Woo Ro
153
1
0
20 Jan 2025
StyleSSP: Sampling StartPoint Enhancement for Training-free Diffusion-based Method for Style Transfer
Ruojun Xu
Weijie Xi
Xiaodi Wang
Yongbo Mao
Zach Cheng
DiffM
120
1
0
20 Jan 2025
Isolated Diffusion: Optimizing Multi-Concept Text-to-Image Generation Training-Freely with Isolated Diffusion Guidance
Jin Zhu
Huimin Ma
Jiansheng Chen
Jian Yuan
160
4
0
20 Jan 2025
Generate E-commerce Product Background by Integrating Category Commonality and Personalized Style
Haohan Wang
Wei Feng
Yang Lu
Yaoyu Li
Zheng Zhang
Jingjing Lv
Xin Zhu
Jun-Jun Shen
DiffM
179
5
0
20 Jan 2025
Lossy Compression with Pretrained Diffusion Models
Jeremy Vonderfecht
Feng Liu
DiffM
153
2
0
20 Jan 2025
Direct Unlearning Optimization for Robust and Safe Text-to-Image Models
Yong-Hyun Park
Sangdoo Yun
Jin-Hwa Kim
Junho Kim
Geonhui Jang
Yonghyun Jeong
Junghyo Jo
Gayoung Lee
167
19
0
17 Jan 2025
CaPa: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh Generation
Hwan Heo
Jangyeong Kim
Seongyeong Lee
Jeong A Wi
Junyoung Choi
Sangjun Ahn
111
0
0
17 Jan 2025
Editing Music with Melody and Text: Using ControlNet for Diffusion Transformer
Siyuan Hou
Shansong Liu
Ruibin Yuan
Wei Xue
Ying Shan
Mangsuo Zhao
Chao Zhang
145
6
0
17 Jan 2025
SynthLight: Portrait Relighting with Diffusion Model by Learning to Re-render Synthetic Faces
Sumit Chaturvedi
Mengwei Ren
Yannick Hold-Geoffroy
Jingyuan Liu
Julie Dorsey
Zhixin Shu
DiffM
99
0
0
17 Jan 2025
Enhanced Multi-Scale Cross-Attention for Person Image Generation
Hao Tang
Ling Shao
N. Sebe
Luc Van Gool
DiffM
145
0
0
15 Jan 2025
Joint Learning of Depth and Appearance for Portrait Image Animation
Xinya Ji
Gaspard Zoss
Prashanth Chandran
Lingchen Yang
Xun Cao
B. Solenthaler
D. Bradley
3DH
MDE
133
1
0
15 Jan 2025
Grounding Text-to-Image Diffusion Models for Controlled High-Quality Image Generation
Ahmad Süleyman
Göksel Biricik
87
2
0
15 Jan 2025
MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training
Xingyi He He
Hao Yu
Sida Peng
Dongli Tan
Zehong Shen
Hujun Bao
Xiaowei Zhou
114
6
0
13 Jan 2025
IP-FaceDiff: Identity-Preserving Facial Video Editing with Diffusion
Tharun Anand
Aryan Garg
Kaushik Mitra
VGen
DiffM
90
0
0
13 Jan 2025
Enhancing Image Generation Fidelity via Progressive Prompts
Zhen Xiong
Yuqi Li
Chuanguang Yang
Tiao Tan
Zhihong Zhu
Siyuan Li
Yue Ma
84
4
0
13 Jan 2025
Introducing 3D Representation for Medical Image Volume-to-Volume Translation via Score Fusion
Xiyue Zhu
Dou Hoon Kwark
Ruike Zhu
Kaiwen Hong
Yiqi Tao
Shirui Luo
Yudu Li
Zhi-Pei Liang
Volodymyr V. Kindratenko
MedIm
100
0
0
13 Jan 2025
Qffusion: Controllable Portrait Video Editing via Quadrant-Grid Attention Learning
Maomao Li
Lijian Lin
Yunfei Liu
Ye Zhu
Yu Li
DiffM
VGen
113
0
0
11 Jan 2025
HFMF: Hierarchical Fusion Meets Multi-Stream Models for Deepfake Detection
Anant Mehta
Bryant McArthur
Nagarjuna Kolloju
Zhengzhong Tu
96
0
0
10 Jan 2025
Beyond Flat Text: Dual Self-inherited Guidance for Visual Text Generation
Minxing Luo
Zixun Xia
L. Chen
Zhenhang Li
Weichao Zeng
Jinqiao Wang
Wentao Cheng
Yaxing Wang
Yu Zhou
Jian Yang
DiffM
149
1
0
10 Jan 2025
COCOLA: Coherence-Oriented Contrastive Learning of Musical Audio Representations
Ruben Ciranni
Emilian Postolache
Giorgio Mariani
Michele Mancusi
Giorgio Fabbro
Emanuele Rodolà
Luca Cosmo
278
8
0
10 Jan 2025
MagicFace: High-Fidelity Facial Expression Editing with Action-Unit Control
Mengting Wei
Tuomas Varanka
Xingxun Jiang
Huai-Qian Khor
Guoying Zhao
DiffM
98
0
0
10 Jan 2025
Generative AI for Cel-Animation: A Survey
Yunlong Tang
Junjia Guo
Pinxin Liu
Zhiyuan Wang
Hang Hua
...
Jing Bi
Mingqian Feng
Xuzhao Li
Zeliang Zhang
Chenliang Xu
VGen
163
7
0
08 Jan 2025
Previous
1
2
3
...
13
14
15
...
60
61
62
Next