ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.05543
  4. Cited By
Adding Conditional Control to Text-to-Image Diffusion Models
v1v2v3 (latest)

Adding Conditional Control to Text-to-Image Diffusion Models

10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "Adding Conditional Control to Text-to-Image Diffusion Models"

50 / 3,090 papers shown
Title
CharacterFactory: Sampling Consistent Characters with GANs for Diffusion
  Models
CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models
Qinghe Wang
Baolu Li
Xiaomin Li
Bing Cao
Liqian Ma
Huchuan Lu
Xu Jia
DiffM
105
6
0
24 Apr 2024
MatFusion: A Generative Diffusion Model for SVBRDF Capture
MatFusion: A Generative Diffusion Model for SVBRDF Capture
Sam Sartor
Pieter Peers
72
31
0
24 Apr 2024
ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with
  Reward Feedback Learning
ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning
Weifeng Chen
Jiacheng Zhang
Jie Wu
Hefeng Wu
Xuefeng Xiao
Liang Lin
100
13
0
23 Apr 2024
ID-Animator: Zero-Shot Identity-Preserving Human Video Generation
ID-Animator: Zero-Shot Identity-Preserving Human Video Generation
Xuanhua He
Quande Liu
Shengju Qian
Xin Eric Wang
Tao Hu
Ke Cao
K. Yan
Jie Zhang
VGen
109
50
0
23 Apr 2024
From Parts to Whole: A Unified Reference Framework for Controllable
  Human Image Generation
From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generation
Zehuan Huang
Hongxing Fan
Lipeng Wang
Lu Sheng
DiffM
83
11
0
23 Apr 2024
Adaptive Mixed-Scale Feature Fusion Network for Blind AI-Generated Image
  Quality Assessment
Adaptive Mixed-Scale Feature Fusion Network for Blind AI-Generated Image Quality Assessment
Tianwei Zhou
Songbai Tan
Wei Zhou
Yu Luo
Yuan-Gen Wang
Guanghui Yue
EGVM
103
11
0
23 Apr 2024
Interactive Generation of Laparoscopic Videos with Diffusion Models
Interactive Generation of Laparoscopic Videos with Diffusion Models
Ivan Iliash
Simeon Allmendinger
Felix Meissen
Niklas Kühl
Daniel Rückert
MedImVGen
134
6
0
23 Apr 2024
Enhancing Prompt Following with Visual Control Through Training-Free
  Mask-Guided Diffusion
Enhancing Prompt Following with Visual Control Through Training-Free Mask-Guided Diffusion
Hongyu Chen
Yi-Meng Gao
Min Zhou
Peng Wang
Xubin Li
Tiezheng Ge
Bo Zheng
DiffM
68
5
0
23 Apr 2024
From Matching to Generation: A Survey on Generative Information Retrieval
From Matching to Generation: A Survey on Generative Information Retrieval
Xiaoxi Li
Jiajie Jin
Yujia Zhou
Yuyao Zhang
Peitian Zhang
Yutao Zhu
Zhicheng Dou
3DV
215
61
0
23 Apr 2024
Guess The Unseen: Dynamic 3D Scene Reconstruction from Partial 2D
  Glimpses
Guess The Unseen: Dynamic 3D Scene Reconstruction from Partial 2D Glimpses
Inhee Lee
Byungjun Kim
Hanbyul Joo
3DGS
104
6
0
22 Apr 2024
ControlMol: Adding Substruture Control To Molecule Diffusion Models
ControlMol: Adding Substruture Control To Molecule Diffusion Models
Zhengyang Qi
Zijing Liu
Jiying Zhang
He Cao
Yu-Feng Li
73
2
0
22 Apr 2024
Gorgeous: Create Your Desired Character Facial Makeup from Any Ideas
Gorgeous: Create Your Desired Character Facial Makeup from Any Ideas
Jia Wei Sii
Chee Seng Chan
DiffM
103
0
0
22 Apr 2024
Towards Better Text-to-Image Generation Alignment via Attention
  Modulation
Towards Better Text-to-Image Generation Alignment via Attention Modulation
Yihang Wu
Xiao Cao
Kaixin Li
Zitan Chen
Haonan Wang
Lei Meng
Zhiyong Huang
DiffM
104
5
0
22 Apr 2024
ColA: Collaborative Adaptation with Gradient Learning
ColA: Collaborative Adaptation with Gradient Learning
Enmao Diao
Qi Le
Suya Wu
Xinran Wang
Ali Anwar
Jie Ding
Vahid Tarokh
68
1
0
22 Apr 2024
MultiBooth: Towards Generating All Your Concepts in an Image from Text
MultiBooth: Towards Generating All Your Concepts in an Image from Text
Chenyang Zhu
Kai Li
Yue Ma
Chunming He
Li Xiu
DiffM
241
29
0
22 Apr 2024
Universal Fingerprint Generation: Controllable Diffusion Model with
  Multimodal Conditions
Universal Fingerprint Generation: Controllable Diffusion Model with Multimodal Conditions
Steven A. Grosz
Anil K. Jain
94
3
0
21 Apr 2024
Object-Attribute Binding in Text-to-Image Generation: Evaluation and
  Control
Object-Attribute Binding in Text-to-Image Generation: Evaluation and Control
Maria Mihaela Truşcǎ
Wolf Nuyts
Jonathan Thomm
Robert Honig
Thomas Hofmann
Tinne Tuytelaars
Marie-Francine Moens
42
5
0
21 Apr 2024
Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image
  Synthesis
Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis
Yuxi Ren
Xin Xia
Yanzuo Lu
Jiacheng Zhang
Jie Wu
Pan Xie
Xing Wang
Xuefeng Xiao
167
79
0
21 Apr 2024
Zero-shot High-fidelity and Pose-controllable Character Animation
Zero-shot High-fidelity and Pose-controllable Character Animation
Bingwen Zhu
Fanyi Wang
Tianyi Lu
Peng Liu
Jingwen Su
Yu Lei
Yanhao Zhang
Zuxuan Wu
Guo-Jun Qi
Yu-Gang Jiang
DiffMVGen
100
6
0
21 Apr 2024
LTOS: Layout-controllable Text-Object Synthesis via Adaptive
  Cross-attention Fusions
LTOS: Layout-controllable Text-Object Synthesis via Adaptive Cross-attention Fusions
Xiaoran Zhao
Tianhao Wu
Yu Lai
Zhiliang Tian
Zhen Huang
Yahui Liu
Zejiang He
Dongsheng Li
DiffM
116
1
0
21 Apr 2024
Dynamic in Static: Hybrid Visual Correspondence for Self-Supervised
  Video Object Segmentation
Dynamic in Static: Hybrid Visual Correspondence for Self-Supervised Video Object Segmentation
Gensheng Pei
Yazhou Yao
Jianbo Jiao
Wenguan Wang
Liqiang Nie
Jinhui Tang
VOS
98
1
0
21 Apr 2024
LASER: Tuning-Free LLM-Driven Attention Control for Efficient Text-conditioned Image-to-Animation
LASER: Tuning-Free LLM-Driven Attention Control for Efficient Text-conditioned Image-to-Animation
Haoyu Zheng
Wenqiao Zhang
Yaoke Wang
Hao Zhou
Jiang Liu
Juncheng Li
Zheqi Lv
Siliang Tang
Yueting Zhuang
Yueting Zhuang
138
2
0
21 Apr 2024
Music Consistency Models
Music Consistency Models
Zhengcong Fei
Mingyuan Fan
Junshi Huang
DiffM
106
5
0
20 Apr 2024
Generating Daylight-driven Architectural Design via Diffusion Models
Generating Daylight-driven Architectural Design via Diffusion Models
Pengzhi Li
Baijuan Li
AI4CEDiffM
70
12
0
20 Apr 2024
FilterPrompt: Guiding Image Transfer in Diffusion Models
FilterPrompt: Guiding Image Transfer in Diffusion Models
Xi Wang
Yichen Peng
Heng Fang
Haoran Xie
Xi Yang
Chuntao Li
DiffM
82
0
0
20 Apr 2024
MCM: Multi-condition Motion Synthesis Framework
MCM: Multi-condition Motion Synthesis Framework
Zeyu Ling
Bo Han
Yongkang Wang
Han Lin
Mohan Kankanhalli
Weidong Geng
72
0
0
19 Apr 2024
Detecting Out-Of-Distribution Earth Observation Images with Diffusion
  Models
Detecting Out-Of-Distribution Earth Observation Images with Diffusion Models
Georges Le Bellier
Nicolas Audebert
75
8
0
19 Apr 2024
GenVideo: One-shot Target-image and Shape Aware Video Editing using T2I
  Diffusion Models
GenVideo: One-shot Target-image and Shape Aware Video Editing using T2I Diffusion Models
Sai Sree Harsha
Ambareesh Revanur
Dhwanit Agarwal
Shradha Agrawal
VGenDiffM
68
4
0
18 Apr 2024
Diff-Control: A Stateful Diffusion-based Policy for Imitation Learning
Diff-Control: A Stateful Diffusion-based Policy for Imitation Learning
Xiao Liu
Yifan Zhou
F. Weigend
Shubham D. Sonawani
Shuhei Ikemoto
H. B. Amor
67
1
0
18 Apr 2024
VideoGigaGAN: Towards Detail-rich Video Super-Resolution
VideoGigaGAN: Towards Detail-rich Video Super-Resolution
Yiran Xu
Taesung Park
Richard Zhang
Yang Zhou
Eli Shechtman
Feng Liu
Jia-Bin Huang
Difan Liu
SupR
143
12
0
18 Apr 2024
RoboDreamer: Learning Compositional World Models for Robot Imagination
RoboDreamer: Learning Compositional World Models for Robot Imagination
Siyuan Zhou
Yilun Du
Jiaben Chen
Yandong Li
Dit-Yan Yeung
Chuang Gan
VGenLM&Ro
150
45
0
18 Apr 2024
Customizing Text-to-Image Diffusion with Camera Viewpoint Control
Customizing Text-to-Image Diffusion with Camera Viewpoint Control
Nupur Kumari
Grace Su
Richard Zhang
Taesung Park
Eli Shechtman
Jun-Yan Zhu
DiffM
92
5
0
18 Apr 2024
Reducing Bias in Pre-trained Models by Tuning while Penalizing Change
Reducing Bias in Pre-trained Models by Tuning while Penalizing Change
Niklas Penzel
Gideon Stein
Joachim Denzler
50
0
0
18 Apr 2024
Curriculum Point Prompting for Weakly-Supervised Referring Image
  Segmentation
Curriculum Point Prompting for Weakly-Supervised Referring Image Segmentation
Qiyuan Dai
Sibei Yang
91
9
0
18 Apr 2024
Sketch-guided Image Inpainting with Partial Discrete Diffusion Process
Sketch-guided Image Inpainting with Partial Discrete Diffusion Process
Nakul Sharma
Aditay Tripathi
Anirban Chakraborty
Anand Mishra
DiffM
92
3
0
18 Apr 2024
TextCenGen: Attention-Guided Text-Centric Background Adaptation for Text-to-Image Generation
TextCenGen: Attention-Guided Text-Centric Background Adaptation for Text-to-Image Generation
Tianyi Liang
Jiangqi Liu
Sicheng Song
Shiqi Jiang
Yifei Huang
Changbo Wang
Chenhui Li
183
0
0
18 Apr 2024
Factorized Diffusion: Perceptual Illusions by Noise Decomposition
Factorized Diffusion: Perceptual Illusions by Noise Decomposition
Daniel Geng
Inbum Park
Andrew Owens
DiffM
150
16
0
17 Apr 2024
Towards Highly Realistic Artistic Style Transfer via Stable Diffusion
  with Step-aware and Layer-aware Prompt
Towards Highly Realistic Artistic Style Transfer via Stable Diffusion with Step-aware and Layer-aware Prompt
Zhanjie Zhang
Quanwei Zhang
Huaizhong Lin
Wei Xing
Juncheng Mo
...
Guangyuan Li
Junsheng Luan
Lei Zhao
Dalong Zhang
Lixia Chen
DiffM
108
14
0
17 Apr 2024
Single-temporal Supervised Remote Change Detection for Domain
  Generalization
Single-temporal Supervised Remote Change Detection for Domain Generalization
Qiangang Du
Jinlong Peng
Xu Chen
Qingdong He
Liren He
Qiang Nie
Wenbing Zhu
Mingmin Chi
Yabiao Wang
Chengjie Wang
91
1
0
17 Apr 2024
TiNO-Edit: Timestep and Noise Optimization for Robust Diffusion-Based
  Image Editing
TiNO-Edit: Timestep and Noise Optimization for Robust Diffusion-Based Image Editing
Sherry X. Chen
Yaron Vaxman
Elad Ben Baruch
David Asulin
Aviad Moreshet
Kuo-Chin Lien
Misha Sra
Pradeep Sen
65
4
0
17 Apr 2024
Generating Human Interaction Motions in Scenes with Text Control
Generating Human Interaction Motions in Scenes with Text Control
Hongwei Yi
Justus Thies
Michael J. Black
Xue Bin Peng
Davis Rempe
VGenDiffM
100
47
0
16 Apr 2024
StyleCity: Large-Scale 3D Urban Scenes Stylization
StyleCity: Large-Scale 3D Urban Scenes Stylization
Yingshu Chen
Huajian Huang
Tuan-Anh Vu
Ka Chun Shum
Sai-Kit Yeung
87
0
0
16 Apr 2024
EucliDreamer: Fast and High-Quality Texturing for 3D Models with
  Depth-Conditioned Stable Diffusion
EucliDreamer: Fast and High-Quality Texturing for 3D Models with Depth-Conditioned Stable Diffusion
Cindy X. Le
Congrui Hetang
Chendi Lin
Ang Cao
Yihui He
68
0
0
16 Apr 2024
Salient Object-Aware Background Generation using Text-Guided Diffusion
  Models
Salient Object-Aware Background Generation using Text-Guided Diffusion Models
Amir Erfan Eshratifar
JOÃO-BRUNO Soares
K. Thadani
Shaunak Mishra
Mikhail Kuznetsov
Yueh-Ning Ku
P.De Juan
DiffM
117
4
0
15 Apr 2024
MaxFusion: Plug&Play Multi-Modal Generation in Text-to-Image Diffusion
  Models
MaxFusion: Plug&Play Multi-Modal Generation in Text-to-Image Diffusion Models
Nithin Gopalakrishnan Nair
Jeya Maria Jose Valanarasu
Vishal M. Patel
MoMe
88
7
0
15 Apr 2024
Diffscaler: Enhancing the Generative Prowess of Diffusion Transformers
Diffscaler: Enhancing the Generative Prowess of Diffusion Transformers
Nithin Gopalakrishnan Nair
Jeya Maria Jose Valanarasu
Vishal M. Patel
91
1
0
15 Apr 2024
Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse
  Controls to Any Diffusion Model
Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model
Han Lin
Jaemin Cho
Abhaysinh Zala
Mohit Bansal
DiffMVGen
159
28
0
15 Apr 2024
Photo-Realistic Image Restoration in the Wild with Controlled
  Vision-Language Models
Photo-Realistic Image Restoration in the Wild with Controlled Vision-Language Models
Ziwei Luo
Fredrik K. Gustafsson
Zheng Zhao
Jens Sjölund
Thomas B. Schön
VLM
94
14
0
15 Apr 2024
In-Context Translation: Towards Unifying Image Recognition, Processing,
  and Generation
In-Context Translation: Towards Unifying Image Recognition, Processing, and Generation
Han Xue
Qianru Sun
Li Song
Wenjun Zhang
Zhiwu Huang
MLLM
74
0
0
15 Apr 2024
Text-Driven Diverse Facial Texture Generation via Progressive
  Latent-Space Refinement
Text-Driven Diverse Facial Texture Generation via Progressive Latent-Space Refinement
Chi-Yin Wang
Junming Huang
Rong Zhang
Qi Wang
Haotian Yang
Haibin Huang
Chongyang Ma
Weiwei Xu
3DH
77
2
0
15 Apr 2024
Previous
123...343536...606162
Next