ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.05543
  4. Cited By
Adding Conditional Control to Text-to-Image Diffusion Models
v1v2v3 (latest)

Adding Conditional Control to Text-to-Image Diffusion Models

10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "Adding Conditional Control to Text-to-Image Diffusion Models"

50 / 367 papers shown
Title
DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models
DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models
Dewei Zhou
Mingwei Li
Zongxin Yang
Yi Yang
176
3
0
17 Mar 2025
UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing
UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing
Tsu-Jui Fu
Yusu Qian
Chen Chen
Wenze Hu
Zhe Gan
Yue Yang
186
2
0
16 Mar 2025
VTON 360: High-Fidelity Virtual Try-On from Any Viewing Direction
VTON 360: High-Fidelity Virtual Try-On from Any Viewing Direction
Zijian He
Yuwei Ning
Yipeng Qin
Wangrun Wang
Sibei Yang
Liang Lin
G. Li
146
2
0
15 Mar 2025
Towards Better Alignment: Training Diffusion Models with Reinforcement Learning Against Sparse Rewards
Towards Better Alignment: Training Diffusion Models with Reinforcement Learning Against Sparse Rewards
Zijing Hu
Fengda Zhang
Long Chen
Kun Kuang
Jiahui Li
Kaifeng Gao
Jun Xiao
X. Wang
Wenwu Zhu
EGVM
207
4
0
14 Mar 2025
Toward Generalized Image Quality Assessment: Relaxing the Perfect Reference Quality Assumption
Toward Generalized Image Quality Assessment: Relaxing the Perfect Reference Quality Assumption
Du Chen
Tianhe Wu
Kede Ma
Lei Zhang
68
3
0
14 Mar 2025
Fine-Tuning Diffusion Generative Models via Rich Preference Optimization
Fine-Tuning Diffusion Generative Models via Rich Preference Optimization
Hanyang Zhao
Haoxian Chen
Yucheng Guo
Genta Indra Winata
Tingting Ou
Ziyu Huang
D. Yao
Wenpin Tang
112
0
0
13 Mar 2025
SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation
SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation
Junsong Chen
Shuchen Xue
Yuyang Zhao
Jincheng Yu
Sayak Paul
Junyu Chen
Han Cai
Enze Xie
Enze Xie
VLM
123
9
0
12 Mar 2025
Reangle-A-Video: 4D Video Generation as Video-to-Video Translation
Reangle-A-Video: 4D Video Generation as Video-to-Video Translation
Hyeonho Jeong
Suhyeon Lee
Jong Chul Ye
VGen
471
1
0
12 Mar 2025
Preference-Based Alignment of Discrete Diffusion Models
Preference-Based Alignment of Discrete Diffusion Models
Umberto Borso
Davide Paglieri
Jude Wells
Tim Rocktaschel
104
3
0
11 Mar 2025
TSCnet: A Text-driven Semantic-level Controllable Framework for Customized Low-Light Image Enhancement
Miao Zhang
Jun Yin
Pengyu Zeng
Yiqing Shen
Shuai Lu
Xueqian Wang
DiffM
155
13
0
11 Mar 2025
Temporal Triplane Transformers as Occupancy World Models
Temporal Triplane Transformers as Occupancy World Models
Haoran Xu
Peixi Peng
Guang Tan
Yiqian Chang
Yisen Zhao
Yonghong Tian
158
0
0
10 Mar 2025
Consistent Image Layout Editing with Diffusion Models
Tao Xia
Yudi Zhang
Ting Liu Lei Zhang
DiffM
114
1
0
09 Mar 2025
D3DR: Lighting-Aware Object Insertion in Gaussian Splatting
Vsevolod Skorokhodov
Nikita Durasov
Pascal Fua
3DGS
86
0
0
09 Mar 2025
X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distillation
X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distillation
Jian Ma
Qirong Peng
Xu Guo
Chen Chen
H. Lu
Zhenyu Yang
VLM
128
1
0
08 Mar 2025
PTDiffusion: Free Lunch for Generating Optical Illusion Hidden Pictures with Phase-Transferred Diffusion Model
PTDiffusion: Free Lunch for Generating Optical Illusion Hidden Pictures with Phase-Transferred Diffusion Model
Xiang Gao
Shuai Yang
Jiaying Liu
DiffM
102
0
0
08 Mar 2025
All-atom Diffusion Transformers: Unified generative modelling of molecules and materials
All-atom Diffusion Transformers: Unified generative modelling of molecules and materials
Chaitanya K. Joshi
Xiang Fu
Yi-Lun Liao
Vahe Gharakhanyan
Benjamin Kurt Miller
Anuroop Sriram
Zachary W. Ulissi
DiffM
175
8
0
05 Mar 2025
Morpheus: Text-Driven 3D Gaussian Splat Shape and Color Stylization
Jamie Wynn
Z. Qureshi
Jakub Powierza
Jamie Watson
Mohamed Sayed
3DGSDiffM
141
1
0
03 Mar 2025
KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame Interpolation
KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame Interpolation
Antoni Bigata
Michał Stypułkowski
Rodrigo Mira
Stella Bounareli
Konstantinos Vougioukas
Zoe Landgraf
Nikita Drobyshev
Maciej Ziȩba
Stavros Petridis
Maja Pantic
DiffMVGen
122
2
0
03 Mar 2025
Zero-Shot Head Swapping in Real-World Scenarios
Zero-Shot Head Swapping in Real-World Scenarios
S. Jeong
Taewoong Kang
Hyojin Jang
Jaegul Choo
82
0
0
02 Mar 2025
T2ICount: Enhancing Cross-modal Understanding for Zero-Shot Counting
T2ICount: Enhancing Cross-modal Understanding for Zero-Shot Counting
Yifei Qian
Zhongliang Guo
Bowen Deng
Chun Tong Lei
Shuai Zhao
Chun Pong Lau
Xiaopeng Hong
Michael P. Pound
DiffM
155
1
0
28 Feb 2025
Knowledge Bridger: Towards Training-free Missing Modality Completion
Knowledge Bridger: Towards Training-free Missing Modality Completion
Guanzhou Ke
Shengfeng He
Xinyu Wang
Bo Wang
Guoqing Chao
Yize Zhang
Yi Xie
HeXing Su
153
1
0
27 Feb 2025
GCC: Generative Color Constancy via Diffusing a Color Checker
GCC: Generative Color Constancy via Diffusing a Color Checker
Chen-Wei Chang
Cheng-De Fan
Chia-Che Chang
Yi-Chen Lo
Yu-Chee Tseng
Jiun-Long Huang
Yu-Lun Liu
142
0
0
24 Feb 2025
LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation
LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation
Shuai Yang
Jing Tan
Mengchen Zhang
Tong Wu
Yongqian Li
Gordon Wetzstein
Ziwei Liu
Dahua Lin
MDEVGen
142
8
0
24 Feb 2025
A Pragmatic Note on Evaluating Generative Models with Fréchet Inception Distance for Retinal Image Synthesis
A Pragmatic Note on Evaluating Generative Models with Fréchet Inception Distance for Retinal Image Synthesis
Yuli Wu
Fucheng Liu
Rüveyda Yilmaz
Henning Konermann
Peter Walter
Johannes Stegmaier
EGVMMedIm
104
2
0
24 Feb 2025
HumanGif: Single-View Human Diffusion with Generative Prior
HumanGif: Single-View Human Diffusion with Generative Prior
Shoukang Hu
Takuya Narihira
Kazumi Fukuda
Ryosuke Sawata
Takashi Shibuya
Yuki Mitsufuji
166
2
0
24 Feb 2025
3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation
3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation
Hansheng Chen
Bokui Shen
Yulin Liu
Ruoxi Shi
Linqi Zhou
Connor Z. Lin
Jiayuan Gu
H. Su
Gordon Wetzstein
Leonidas Guibas
163
3
0
21 Feb 2025
Image compositing is all you need for data augmentation
Image compositing is all you need for data augmentation
Ang Jia Ning Shermaine
Michalis Lazarou
Tania Stathaki
151
2
0
20 Feb 2025
FreeBlend: Advancing Concept Blending with Staged Feedback-Driven Interpolation Diffusion
FreeBlend: Advancing Concept Blending with Staged Feedback-Driven Interpolation Diffusion
Yufan Zhou
Haoyu Shen
Huan Wang
DiffM
203
1
0
17 Feb 2025
Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene
Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene
Tai-Yu Pan
Sooyoung Jeon
Mengdi Fan
Jinsu Yoo
Zhenyang Feng
Mark E. Campbell
Kilian Q. Weinberger
Bharath Hariharan
Wei-Lun Chao
199
0
0
10 Feb 2025
MaterialFusion: High-Quality, Zero-Shot, and Controllable Material Transfer with Diffusion Models
MaterialFusion: High-Quality, Zero-Shot, and Controllable Material Transfer with Diffusion Models
Kamil Garifullin
Maxim Nikolaev
Andrey Kuznetsov
Aibek Alanov
98
0
0
10 Feb 2025
Beyond and Free from Diffusion: Invertible Guided Consistency Training
Chia-Hong Hsu
Shiu-hong Kao
Randall Balestriero
3DV
122
0
0
08 Feb 2025
Hummingbird: High Fidelity Image Generation via Multimodal Context Alignment
Hummingbird: High Fidelity Image Generation via Multimodal Context Alignment
Minh-Quan Le
Gaurav Mittal
Tianjian Meng
A S M Iftekhar
Vishwas Suryanarayanan
Barun Patra
Dimitris Samaras
Mei Chen
DiffM
116
0
0
07 Feb 2025
One Diffusion Step to Real-World Super-Resolution via Flow Trajectory Distillation
One Diffusion Step to Real-World Super-Resolution via Flow Trajectory Distillation
Jiajian Li
Jingyun Liang
Yong Guo
Wenbo Li
Yulun Zhang
DiffM
152
2
0
04 Feb 2025
Improved Training Technique for Latent Consistency Models
Improved Training Technique for Latent Consistency Models
Quan Dao
Khanh Doan
Di Liu
Trung Le
Dimitris N. Metaxas
112
3
0
03 Feb 2025
UVGS: Reimagining Unstructured 3D Gaussian Splatting using UV Mapping
UVGS: Reimagining Unstructured 3D Gaussian Splatting using UV Mapping
Aashish Rai
Dilin Wang
Mihir Jain
N. Sarafianos
Arthur Chen
Srinath Sridhar
Aayush Prakash
3DGS
149
1
0
03 Feb 2025
DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation
DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation
Chenguo Lin
Panwang Pan
Bangbang Yang
Zeming Li
Yadong Mu
3DGS
157
7
0
28 Jan 2025
Make-A-Texture: Fast Shape-Aware Texture Generation in 3 Seconds
Make-A-Texture: Fast Shape-Aware Texture Generation in 3 Seconds
Xiaoyu Xiang
Liat Sless Gorelik
Yuchen Fan
Omri Armstrong
Forrest N. Iandola
Yilei Li
Ita Lifshitz
Rakesh Ranjan
3DGSDiffM
165
5
0
28 Jan 2025
An Item is Worth a Prompt: Versatile Image Editing with Disentangled Control
An Item is Worth a Prompt: Versatile Image Editing with Disentangled Control
Aosong Feng
Weikang Qiu
Jinbin Bai
Xiao Zhang
Zhen Dong
Kaicheng Zhou
Rex Ying
Leandros Tassiulas
DiffM
106
6
0
28 Jan 2025
Turn That Frown Upside Down: FaceID Customization via Cross-Training Data
Shuhe Wang
Xiaoya Li
Xiaofei Sun
G. Wang
Tianwei Zhang
Jiwei Li
Eduard H. Hovy
100
1
0
28 Jan 2025
LLM-guided Instance-level Image Manipulation with Diffusion U-Net Cross-Attention Maps
LLM-guided Instance-level Image Manipulation with Diffusion U-Net Cross-Attention Maps
Andrey Palaev
Adil Mehmood Khan
S. M. Ahsan Kazmi
DiffM
110
0
0
23 Jan 2025
Parameter-Efficient Fine-Tuning for Foundation Models
Parameter-Efficient Fine-Tuning for Foundation Models
Dan Zhang
Tao Feng
Lilong Xue
Yuandong Wang
Yuxiao Dong
J. Tang
195
11
0
23 Jan 2025
Survey on Monocular Metric Depth Estimation
Survey on Monocular Metric Depth Estimation
Jiuling Zhang
VLM
217
0
0
21 Jan 2025
Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation
Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation
Zibo Zhao
Zeqiang Lai
Qingxiang Lin
Yunfei Zhao
Haolin Liu
...
Jingwei Huang
Chunchao Guo
Jie Jiang
Jingwei Huang
Chunchao Guo
236
39
0
21 Jan 2025
Ditto: Accelerating Diffusion Model via Temporal Value Similarity
Ditto: Accelerating Diffusion Model via Temporal Value Similarity
Sungbin Kim
Hyunwuk Lee
Wonho Cho
Mincheol Park
Won Woo Ro
131
1
0
20 Jan 2025
StyleSSP: Sampling StartPoint Enhancement for Training-free Diffusion-based Method for Style Transfer
StyleSSP: Sampling StartPoint Enhancement for Training-free Diffusion-based Method for Style Transfer
Ruojun Xu
Weijie Xi
Xiaodi Wang
Yongbo Mao
Zach Cheng
DiffM
99
1
0
20 Jan 2025
Lossy Compression with Pretrained Diffusion Models
Jeremy Vonderfecht
Feng Liu
DiffM
138
2
0
20 Jan 2025
Isolated Diffusion: Optimizing Multi-Concept Text-to-Image Generation Training-Freely with Isolated Diffusion Guidance
Isolated Diffusion: Optimizing Multi-Concept Text-to-Image Generation Training-Freely with Isolated Diffusion Guidance
Jin Zhu
Huimin Ma
Jiansheng Chen
Jian Yuan
125
4
0
20 Jan 2025
Generate E-commerce Product Background by Integrating Category Commonality and Personalized Style
Generate E-commerce Product Background by Integrating Category Commonality and Personalized Style
Haohan Wang
Wei Feng
Yang Lu
Yaoyu Li
Zheng Zhang
Jingjing Lv
Xin Zhu
Jun-Jun Shen
DiffM
148
5
0
20 Jan 2025
Editing Music with Melody and Text: Using ControlNet for Diffusion Transformer
Editing Music with Melody and Text: Using ControlNet for Diffusion Transformer
Siyuan Hou
Shansong Liu
Ruibin Yuan
Wei Xue
Ying Shan
Mangsuo Zhao
Chao Zhang
128
5
0
17 Jan 2025
Direct Unlearning Optimization for Robust and Safe Text-to-Image Models
Direct Unlearning Optimization for Robust and Safe Text-to-Image Models
Yong-Hyun Park
Sangdoo Yun
Jin-Hwa Kim
Junho Kim
Geonhui Jang
Yonghyun Jeong
Junghyo Jo
Gayoung Lee
135
17
0
17 Jan 2025
Previous
12345678
Next