ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.05543
  4. Cited By
Adding Conditional Control to Text-to-Image Diffusion Models
v1v2v3 (latest)

Adding Conditional Control to Text-to-Image Diffusion Models

10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "Adding Conditional Control to Text-to-Image Diffusion Models"

50 / 3,090 papers shown
Title
MMO-IG: Multi-Class and Multi-Scale Object Image Generation for Remote Sensing
MMO-IG: Multi-Class and Multi-Scale Object Image Generation for Remote Sensing
Chuang Yang
Bingxuan Zhao
Qing Zhou
Qi Wang
183
3
0
18 Dec 2024
Consistent Diffusion: Denoising Diffusion Model with Data-Consistent
  Training for Image Restoration
Consistent Diffusion: Denoising Diffusion Model with Data-Consistent Training for Image Restoration
Xinlong Cheng
Tiantian Cao
Guoan Cheng
Bangxuan Huang
Xinghan Tian
Ye Wang
Xiaoyu He
Weixin Li
Tianfan Xue
Xuan Dong
DiffM
144
0
0
17 Dec 2024
Efficient Scaling of Diffusion Transformers for Text-to-Image Generation
Efficient Scaling of Diffusion Transformers for Text-to-Image Generation
Hao Li
Shamit Lal
Zhiheng Li
Yusheng Xie
Ying Wang
...
R. Manmatha
Zhuowen Tu
Stefano Ermon
Stefano Soatto
A. Swaminathan
144
1
0
16 Dec 2024
OmniPrism: Learning Disentangled Visual Concept for Image Generation
OmniPrism: Learning Disentangled Visual Concept for Image Generation
Yangyang Li
Daqing Liu
Wu Liu
Allen He
Xinchen Liu
Yongdong Zhang
Guoqing Jin
DiffMCoGe
100
0
0
16 Dec 2024
IDEA-Bench: How Far are Generative Models from Professional Designing?
IDEA-Bench: How Far are Generative Models from Professional Designing?
C. Liang
Lianghua Huang
Jingwu Fang
Huanzhang Dou
Wei Wang
Zhi-Fan Wu
Yupeng Shi
Junge Zhang
Xin Zhao
Yu Liu
3DV
142
1
0
16 Dec 2024
Generative Inbetweening through Frame-wise Conditions-Driven Video
  Generation
Generative Inbetweening through Frame-wise Conditions-Driven Video Generation
Tianyi Zhu
Dongwei Ren
Qilong Wang
Xiaohe Wu
W. Zuo
VGen
135
3
0
16 Dec 2024
StrandHead: Text to Strand-Disentangled 3D Head Avatars Using Hair
  Geometric Priors
StrandHead: Text to Strand-Disentangled 3D Head Avatars Using Hair Geometric Priors
Xiaokun Sun
Zeyu Cai
Zhenyu Zhang
Ying Tai
Jian Yang
133
0
0
16 Dec 2024
LineArt: A Knowledge-guided Training-free High-quality Appearance
  Transfer for Design Drawing with Diffusion Model
LineArt: A Knowledge-guided Training-free High-quality Appearance Transfer for Design Drawing with Diffusion Model
Xi Wang
Haoyang Li
Heng Fang
Yichen Peng
H. Xie
Xi Yang
Chuntao Li
DiffM
110
1
0
16 Dec 2024
IGR: Improving Diffusion Model for Garment Restoration from Person Image
IGR: Improving Diffusion Model for Garment Restoration from Person Image
Le Shen
Rong Huang
Zhijie Wang
DiffM
162
2
0
16 Dec 2024
Wonderland: Navigating 3D Scenes from a Single Image
Wonderland: Navigating 3D Scenes from a Single Image
Hanwen Liang
Junli Cao
Vidit Goel
Guocheng Qian
Sergei Korolev
Demetri Terzopoulos
Konstantinos N. Plataniotis
Sergey Tulyakov
Jian Ren
VGen
208
14
0
16 Dec 2024
IDArb: Intrinsic Decomposition for Arbitrary Number of Input Views and Illuminations
IDArb: Intrinsic Decomposition for Arbitrary Number of Input Views and Illuminations
Zhibing Li
Tong Wu
Jing Tan
Mengchen Zhang
Jiaqi Wang
Dahua Lin
203
3
0
16 Dec 2024
EditSplat: Multi-View Fusion and Attention-Guided Optimization for View-Consistent 3D Scene Editing with 3D Gaussian Splatting
EditSplat: Multi-View Fusion and Attention-Guided Optimization for View-Consistent 3D Scene Editing with 3D Gaussian Splatting
Dong In Lee
Hyeongcheol Park
Jiyoung Seo
Eunbyung Park
Hyunje Park
Ha Dam Baek
Shin Sangheon
Sangmin kim
Sangpil Kim
3DGS
210
3
0
16 Dec 2024
CLIP-SR: Collaborative Linguistic and Image Processing for Super-Resolution
CLIP-SR: Collaborative Linguistic and Image Processing for Super-Resolution
Bingwen Hu
Heng Liu
Zhedong Zheng
Ping Liu
SupR
261
0
0
16 Dec 2024
ColorFlow: Retrieval-Augmented Image Sequence Colorization
ColorFlow: Retrieval-Augmented Image Sequence Colorization
Junhao Zhuang
Xuan Ju
Zhe Zhang
Yong-Jin Liu
Shiyi Zhang
Chun Yuan
Ying Shan
DiffM
177
1
0
16 Dec 2024
InterDyn: Controllable Interactive Dynamics with Video Diffusion Models
InterDyn: Controllable Interactive Dynamics with Video Diffusion Models
Rick Akkerman
Haiwen Feng
M. Black
Dimitrios Tzionas
Victoria Fernandez-Abrevaya
VGenAI4CE
199
3
0
16 Dec 2024
Nearly Zero-Cost Protection Against Mimicry by Personalized Diffusion Models
Nearly Zero-Cost Protection Against Mimicry by Personalized Diffusion Models
Namhyuk Ahn
Kiyoon Yoo
Wonhyuk Ahn
Daesik Kim
Seung-Hun Nam
AAMLWIGMDiffM
192
0
0
16 Dec 2024
UIBDiffusion: Universal Imperceptible Backdoor Attack for Diffusion Models
UIBDiffusion: Universal Imperceptible Backdoor Attack for Diffusion Models
Yuning Han
Bingyin Zhao
Rui Chu
Feng Luo
Biplab Sikdar
Yingjie Lao
DiffMAAML
203
1
0
16 Dec 2024
GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained
  Ego-Motion, Object Dynamics, and Scene Composition Control
GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control
Mariam Hassan
Sebastian Stapf
Ahmad Rahimi
Pedro M B Rezende
Yasaman Haghighi
...
Mathieu Salzmann
Davide Scaramuzza
Marc Pollefeys
Paolo Favaro
Alexandre Alahi
VLMVGen
142
12
0
15 Dec 2024
OccScene: Semantic Occupancy-based Cross-task Mutual Learning for 3D
  Scene Generation
OccScene: Semantic Occupancy-based Cross-task Mutual Learning for 3D Scene Generation
Bohan Li
Xin Jin
Jiadong Wang
Yukai Shi
Yasheng Sun
...
Zhuang Ma
Baao Xie
Chao Ma
Xiaokang Yang
Wenjun Zeng
DiffM
420
1
0
15 Dec 2024
SHMT: Self-supervised Hierarchical Makeup Transfer via Latent Diffusion
  Models
SHMT: Self-supervised Hierarchical Makeup Transfer via Latent Diffusion Models
Zhaoyang Sun
Shengwu Xiong
Yaxiong Chen
Fei Du
Weihua Chen
Fan Wang
Yi Rong
DiffM
114
1
0
15 Dec 2024
GenLit: Reformulating Single-Image Relighting as Video Generation
GenLit: Reformulating Single-Image Relighting as Video Generation
Shrisha Bharadwaj
Haiwen Feng
Giorgio Becherini
Victoria Fernandez-Abrevaya
Michael J. Black
VGen
161
2
0
15 Dec 2024
Video Diffusion Transformers are In-Context Learners
Video Diffusion Transformers are In-Context Learners
Zhengcong Fei
Di Qiu
Changqian Yu
Debang Li
Mingyuan Fan
VGenDiffM
409
3
0
14 Dec 2024
EVLM: Self-Reflective Multimodal Reasoning for Cross-Dimensional Visual
  Editing
EVLM: Self-Reflective Multimodal Reasoning for Cross-Dimensional Visual Editing
Umar Khalid
Hasan Iqbal
Azib Farooq
Nazanin Rahnavard
Jing Hua
...
H. Iqbal
Azib Farooq
Nazanin Rahnavard
Jing Hua
Chen Chen
117
0
0
13 Dec 2024
Dynamic Try-On: Taming Video Virtual Try-on with Dynamic Attention
  Mechanism
Dynamic Try-On: Taming Video Virtual Try-on with Dynamic Attention Mechanism
Jun Zheng
Jing Wang
Fuwei Zhao
Xujie Zhang
Xiaodan Liang
DiffMVGen
123
0
0
13 Dec 2024
FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free
  Scale Fusion
FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion
Haonan Qiu
Shiwei Zhang
Yujie Wei
Ruihang Chu
Hangjie Yuan
Xinyu Wang
Yize Zhang
Ziwei Liu
165
4
0
12 Dec 2024
DECOR:Decomposition and Projection of Text Embeddings for Text-to-Image
  Customization
DECOR:Decomposition and Projection of Text Embeddings for Text-to-Image Customization
Geonhui Jang
Jin-Hwa Kim
Yong-Hyun Park
Junho Kim
Gayoung Lee
Yonghyun Jeong
DiffM
117
0
0
12 Dec 2024
MS2Mesh-XR: Multi-modal Sketch-to-Mesh Generation in XR Environments
MS2Mesh-XR: Multi-modal Sketch-to-Mesh Generation in XR Environments
Yuqi Tong
Yue Qiu
Ruiyang Li
Shi Qiu
Pheng-Ann Heng
VGen
140
0
0
12 Dec 2024
Olympus: A Universal Task Router for Computer Vision Tasks
Olympus: A Universal Task Router for Computer Vision Tasks
Yuanze Lin
Yunsheng Li
Dongdong Chen
Weijian Xu
Ronald Clark
Philip Torr
VLMObjD
548
1
0
12 Dec 2024
Omni-ID: Holistic Identity Representation Designed for Generative Tasks
Omni-ID: Holistic Identity Representation Designed for Generative Tasks
Guocheng Qian
Kuan-Chieh Wang
Or Patashnik
Negin Heravi
Daniil Ostashev
Sergey Tulyakov
Daniel Cohen-Or
Kfir Aberman
142
5
0
12 Dec 2024
GPTDrawer: Enhancing Visual Synthesis through ChatGPT
GPTDrawer: Enhancing Visual Synthesis through ChatGPT
Kun Li
Xinwei Chen
Tianyou Song
Hansong Zhang
Wenzhe Zhang
Qing Shan
112
7
0
11 Dec 2024
UniReal: Universal Image Generation and Editing via Learning Real-world
  Dynamics
UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics
Xi Chen
Zhifei Zhang
He Zhang
Yuqian Zhou
Seunggeun Kim
...
Nanxuan Zhao
Yilin Wang
Hui Ding
Zhe Lin
Hengshuang Zhao
VGenDiffM
187
29
0
10 Dec 2024
StyleMaster: Stylize Your Video with Artistic Generation and Translation
StyleMaster: Stylize Your Video with Artistic Generation and Translation
Zixuan Ye
Huijuan Huang
Xintao Wang
Pengfei Wan
Di Zhang
Wenhan Luo
DiffMVGen
132
6
0
10 Dec 2024
FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion
  Models
FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models
Tong Wu
Yinghao Xu
Ryan Po
Mengchen Zhang
Guandao Yang
Jiaqi Wang
Ziqiang Liu
Dahua Lin
Gordon Wetzstein
113
0
0
10 Dec 2024
ContRail: A Framework for Realistic Railway Image Synthesis using
  ControlNet
ContRail: A Framework for Realistic Railway Image Synthesis using ControlNet
Andrei-Robert Alexandrescu
Razvan-Gabriel Petec
Alexandru Manole
Laura-Silvia Diosan
DiffM
112
0
0
09 Dec 2024
Generating floorplans for various building functionalities via latent
  diffusion model
Generating floorplans for various building functionalities via latent diffusion model
Mohamed R. Ibrahim
J. Musil
Irene Gallou
DiffMAI4CE
77
0
0
09 Dec 2024
Nested Diffusion Models Using Hierarchical Latent Priors
Nested Diffusion Models Using Hierarchical Latent Priors
Xiao Zhang
Ruoxi Jiang
Rebecca Willett
Michael Maire
BDLDiffM
118
1
0
08 Dec 2024
Birth and Death of a Rose
Birth and Death of a Rose
Chen Geng
Yunzhi Zhang
Shangzhe Wu
Jiajun Wu
AI4CE
118
2
0
06 Dec 2024
CA-SSLR: Condition-Aware Self-Supervised Learning Representation for
  Generalized Speech Processing
CA-SSLR: Condition-Aware Self-Supervised Learning Representation for Generalized Speech Processing
Yen-Ju Lu
Jing Liu
Thomas Thebaud
Laureano Moro-Velazquez
Ariya Rastrow
Najim Dehak
Jesus Villalba
135
1
0
05 Dec 2024
Structure-Aware Stylized Image Synthesis for Robust Medical Image
  Segmentation
Structure-Aware Stylized Image Synthesis for Robust Medical Image Segmentation
Jie Bao
Zhixin Zhou
Wen Jung Li
Rui Luo
MedIm
121
0
0
05 Dec 2024
AnyDressing: Customizable Multi-Garment Virtual Dressing via Latent Diffusion Models
AnyDressing: Customizable Multi-Garment Virtual Dressing via Latent Diffusion Models
Xinghui Li
Qichao Sun
Pengze Zhang
Fulong Ye
Zhichao Liao
Wanquan Feng
Mingcong Liu
Qian He
DiffM
137
3
0
05 Dec 2024
Pinco: Position-induced Consistent Adapter for Diffusion Transformer in
  Foreground-conditioned Inpainting
Pinco: Position-induced Consistent Adapter for Diffusion Transformer in Foreground-conditioned Inpainting
Guangben Lu
Yuzhen Du
Zhimin Sun
Ran Yi
Yifan Qi
Yizhe Tang
Tianyi Wang
Lizhuang Ma
Fangyuan Zou
DiffM
104
1
0
05 Dec 2024
InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models
InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models
Yifan Lu
Xuanchi Ren
Jiawei Yang
Tianchang Shen
Zhangjie Wu
...
Yanjie Wang
Siheng Chen
Mike Chen
Sanja Fidler
Jiahui Huang
VGen
185
9
0
05 Dec 2024
Multi-view Image Diffusion via Coordinate Noise and Fourier Attention
Multi-view Image Diffusion via Coordinate Noise and Fourier Attention
Justin D. Theiss
Norman Müller
Daeil Kim
Aayush Prakash
102
0
0
04 Dec 2024
MV-Adapter: Multi-view Consistent Image Generation Made Easy
MV-Adapter: Multi-view Consistent Image Generation Made Easy
Zehuan Huang
Yu Guo
Haoran Wang
Ran Yi
Lizhuang Ma
Yan-Pei Cao
Lu Sheng
169
18
0
04 Dec 2024
Seeing Beyond Views: Multi-View Driving Scene Video Generation with
  Holistic Attention
Seeing Beyond Views: Multi-View Driving Scene Video Generation with Holistic Attention
Hannan Lu
Xiaohe Wu
Shudong Wang
Xiameng Qin
Xinyu Zhang
Junyu Han
W. Zuo
Ji Tao
141
2
0
04 Dec 2024
Skel3D: Skeleton Guided Novel View Synthesis
Skel3D: Skeleton Guided Novel View Synthesis
Aron Fóthi
Bence Fazekas
Natabara Máté Gyöngyössy
Kristian Fenech
131
0
0
04 Dec 2024
TASR: Timestep-Aware Diffusion Model for Image Super-Resolution
TASR: Timestep-Aware Diffusion Model for Image Super-Resolution
Qinwei Lin
Xiaopeng Sun
Yu Gao
Yujie Zhong
Dengjie Li
Zheng Zhao
Haoqian Wang
132
0
0
04 Dec 2024
DIVE: Taming DINO for Subject-Driven Video Editing
DIVE: Taming DINO for Subject-Driven Video Editing
Yi Huang
Wei Xiong
He Zhang
Chaoqi Chen
Jianzhuang Liu
Mingfu Yan
Shifeng Chen
VGenDiffM
119
1
0
04 Dec 2024
RFSR: Improving ISR Diffusion Models via Reward Feedback Learning
RFSR: Improving ISR Diffusion Models via Reward Feedback Learning
Xiaopeng Sun
Q. Lin
Yu Gao
Yujie Zhong
Chengjian Feng
Dengjie Li
Zheng Zhao
Jie Hu
Lin Ma
EGVM
119
1
0
04 Dec 2024
Align3R: Aligned Monocular Depth Estimation for Dynamic Videos
Align3R: Aligned Monocular Depth Estimation for Dynamic Videos
Jiahao Lu
Tianyu Huang
Peng Li
Zhiyang Dou
Cheng Lin
Zhiming Cui
Z. Dong
Sai-Kit Yeung
Wenping Wang
Yuan Liu
VGenMDE
200
13
0
04 Dec 2024
Previous
123...151617...606162
Next