ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.05543
  4. Cited By
Adding Conditional Control to Text-to-Image Diffusion Models
v1v2v3 (latest)

Adding Conditional Control to Text-to-Image Diffusion Models

10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "Adding Conditional Control to Text-to-Image Diffusion Models"

50 / 3,090 papers shown
Title
WAS: Dataset and Methods for Artistic Text Segmentation
WAS: Dataset and Methods for Artistic Text Segmentation
Xudong Xie
Yuzhe Li
Yang Liu
Zhifei Zhang
Zhaowen Wang
Wei Xiong
Xiang Bai
DiffM
90
2
0
31 Jul 2024
From Attributes to Natural Language: A Survey and Foresight on
  Text-based Person Re-identification
From Attributes to Natural Language: A Survey and Foresight on Text-based Person Re-identification
Fanzhi Jiang
Su Yang
Mark W. Jones
Liumei Zhang
104
1
0
31 Jul 2024
Localized Gaussian Splatting Editing with Contextual Awareness
Localized Gaussian Splatting Editing with Contextual Awareness
Hanyuan Xiao
Yingshu Chen
Huajian Huang
Haolin Xiong
Jing Yang
P. Prasad
Yajie Zhao
3DGSDiffM
93
4
0
31 Jul 2024
Hyper-parameter tuning for text guided image editing
Hyper-parameter tuning for text guided image editing
Shiwen Zhang
DiffM
111
1
0
31 Jul 2024
Benchmarking AIGC Video Quality Assessment: A Dataset and Unified Model
Benchmarking AIGC Video Quality Assessment: A Dataset and Unified Model
Zhichao Zhang
Xinyue Li
Wei Sun
Jun Jia
Xiongkuo Min
...
Puyi Wang
Zhongpeng Ji
Fengyu Sun
Shangling Jui
Guangtao Zhai
EGVM
68
5
0
31 Jul 2024
Segment Anything for Videos: A Systematic Survey
Segment Anything for Videos: A Systematic Survey
Chunhui Zhang
Yawen Cui
Weilin Lin
Guanjie Huang
Yan Rong
Li Liu
Shiguang Shan
VLM
86
8
0
31 Jul 2024
Add-SD: Rational Generation without Manual Reference
Add-SD: Rational Generation without Manual Reference
Lingfeng Yang
Xinyu Zhang
Xiang Li
Jinwen Chen
Kun Yao
Gang Zhang
Errui Ding
Ling-Ling Liu
Jingdong Wang
Jian Yang
69
0
0
30 Jul 2024
UniProcessor: A Text-induced Unified Low-level Image Processor
UniProcessor: A Text-induced Unified Low-level Image Processor
Huiyu Duan
Xiongkuo Min
Sijing Wu
Wei Shen
Guangtao Zhai
DiffM
75
12
0
30 Jul 2024
Diffusion Augmented Agents: A Framework for Efficient Exploration and
  Transfer Learning
Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning
Norman Di Palo
Leonard Hasenclever
Jan Humplik
Arunkumar Byravan
74
3
0
30 Jul 2024
EgoSonics: Generating Synchronized Audio for Silent Egocentric Videos
EgoSonics: Generating Synchronized Audio for Silent Egocentric Videos
Aashish Rai
Srinath Sridhar
DiffM
75
4
0
30 Jul 2024
Learning Feature-Preserving Portrait Editing from Generated Pairs
Learning Feature-Preserving Portrait Editing from Generated Pairs
Bowei Chen
Tiancheng Zhi
Peihao Zhu
Shen Sang
Jing Liu
Linjie Luo
DiffM
96
0
0
29 Jul 2024
Sun Off, Lights On: Photorealistic Monocular Nighttime Simulation for
  Robust Semantic Perception
Sun Off, Lights On: Photorealistic Monocular Nighttime Simulation for Robust Semantic Perception
Konstantinos Tzevelekakis
Shutong Zhang
Luc Van Gool
Daniel Gehrig
3DV
90
0
0
29 Jul 2024
DDAP: Dual-Domain Anti-Personalization against Text-to-Image Diffusion
  Models
DDAP: Dual-Domain Anti-Personalization against Text-to-Image Diffusion Models
Jing Yang
Runping Xi
Yingxin Lai
Xun Lin
Zitong Yu
DiffM
61
1
0
29 Jul 2024
Reproducibility Study of "ITI-GEN: Inclusive Text-to-Image Generation"
Reproducibility Study of "ITI-GEN: Inclusive Text-to-Image Generation"
Daniel Gallo Fernández
Ruazvan-Andrei Matisan
Alejandro Monroy Muñoz
Janusz Partyka
57
0
0
29 Jul 2024
Retinex-Diffusion: On Controlling Illumination Conditions in Diffusion
  Models via Retinex Theory
Retinex-Diffusion: On Controlling Illumination Conditions in Diffusion Models via Retinex Theory
Xiaoyan Xing
Vincent Tao Hu
J. H. Metzen
Konrad Groh
Sezer Karaoglu
Theo Gevers
89
4
0
29 Jul 2024
VIMs: Virtual Immunohistochemistry Multiplex staining via Text-to-Stain
  Diffusion Trained on Uniplex Stains
VIMs: Virtual Immunohistochemistry Multiplex staining via Text-to-Stain Diffusion Trained on Uniplex Stains
Shikha Dubey
Yosep Chong
Beatrice Knudsen
Shireen Y. Elhabian
VLM
99
7
0
26 Jul 2024
SHIC: Shape-Image Correspondences with no Keypoint Supervision
SHIC: Shape-Image Correspondences with no Keypoint Supervision
Aleksandar Shtedritski
Christian Rupprecht
Andrea Vedaldi
3DPC3DH3DV
70
3
0
26 Jul 2024
LinguaLinker: Audio-Driven Portraits Animation with Implicit Facial
  Control Enhancement
LinguaLinker: Audio-Driven Portraits Animation with Implicit Facial Control Enhancement
Rui Zhang
Yixiao Fang
Zhen-Zhong Lu
Pei Cheng
Zebiao Huang
Bin-Bin Fu
DiffMVGen
75
1
0
26 Jul 2024
Answerability Fields: Answerable Location Estimation via Diffusion
  Models
Answerability Fields: Answerable Location Estimation via Diffusion Models
Daich Azuma
Taiki Miyanishi
Shuhei Kurita
Koya Sakamoto
M. Kawanabe
DiffM
81
0
0
26 Jul 2024
Towards Localized Fine-Grained Control for Facial Expression Generation
Towards Localized Fine-Grained Control for Facial Expression Generation
Tuomas Varanka
Huai-Qian Khor
Yante Li
Mengting Wei
Hanwei Kung
N. Sebe
Guoying Zhao
99
4
0
25 Jul 2024
AttentionHand: Text-driven Controllable Hand Image Generation for 3D
  Hand Reconstruction in the Wild
AttentionHand: Text-driven Controllable Hand Image Generation for 3D Hand Reconstruction in the Wild
Jun-Young Park
Kyeongbo Kong
Suk-ju Kang
DiffM
75
4
0
25 Jul 2024
DragText: Rethinking Text Embedding in Point-based Image Editing
DragText: Rethinking Text Embedding in Point-based Image Editing
Gayoon Choi
Taejin Jeong
Sujung Hong
Jaehoon Joo
Seong Jae Hwang
DiffM
84
4
0
25 Jul 2024
HumanVid: Demystifying Training Data for Camera-controllable Human Image
  Animation
HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation
Zhenzhi Wang
Yixuan Li
Yanhong Zeng
Youqing Fang
Yuwei Guo
...
Jing Tan
Kai Chen
Tianfan Xue
Bo Dai
Dahua Lin
VGen3DH
162
23
0
24 Jul 2024
Diffusion Models for Monocular Depth Estimation: Overcoming Challenging
  Conditions
Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditions
Fabio Tosi
Pierluigi Zama Ramirez
Matteo Poggi
DiffMMQMDE
94
13
0
23 Jul 2024
Harmonizing Visual Text Comprehension and Generation
Harmonizing Visual Text Comprehension and Generation
Zhen Zhao
Jingqun Tang
Binghong Wu
Chunhui Lin
Shubo Wei
Hao Liu
Xin Tan
Zhizhong Zhang
Can Huang
Yuan Xie
VLM
107
26
0
23 Jul 2024
OutfitAnyone: Ultra-high Quality Virtual Try-On for Any Clothing and Any
  Person
OutfitAnyone: Ultra-high Quality Virtual Try-On for Any Clothing and Any Person
Ke Sun
Jian Cao
Qi Wang
Linrui Tian
Xindi Zhang
...
Bang Zhang
Liefeng Bo
Wenbo Zhou
Weiming Zhang
Daiheng Gao
DiffM
71
11
0
23 Jul 2024
Fréchet Video Motion Distance: A Metric for Evaluating Motion
  Consistency in Videos
Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in Videos
Jiahe Liu
Youran Qu
Qi Yan
Fangyin Wei
Lele Wang
Renjie Liao
VGenEGVM
69
15
0
23 Jul 2024
Stretching Each Dollar: Diffusion Training from Scratch on a
  Micro-Budget
Stretching Each Dollar: Diffusion Training from Scratch on a Micro-Budget
Vikash Sehwag
Xianghao Kong
Jingtao Li
Michael Spranger
Lingjuan Lyu
DiffM
90
11
0
22 Jul 2024
DiffX: Guide Your Layout to Cross-Modal Generative Modeling
DiffX: Guide Your Layout to Cross-Modal Generative Modeling
Zeyu Wang
Jingyu Lin
Yifei Qian
Yi Huang
Shicen Tian
...
Qu Yang
Lan Du
Cunjian Chen
Yufei Guo
Kejie Huang
DiffMVLM
87
3
0
22 Jul 2024
Text2Place: Affordance-aware Text Guided Human Placement
Text2Place: Affordance-aware Text Guided Human Placement
Rishubh Parihar
Harsh Gupta
VS Sachidanand
R. V. Babu
DiffM
86
5
0
22 Jul 2024
HoloDreamer: Holistic 3D Panoramic World Generation from Text
  Descriptions
HoloDreamer: Holistic 3D Panoramic World Generation from Text Descriptions
Haiyang Zhou
Xinhua Cheng
Wangbo Yu
Yonghong Tian
Li-ming Yuan
3DGSDiffM
112
11
0
21 Jul 2024
Anchored Diffusion for Video Face Reenactment
Anchored Diffusion for Video Face Reenactment
I. Kligvasser
Regev Cohen
G. Leifman
Ehud Rivlin
Michael Elad
DiffMVGen
101
1
0
21 Jul 2024
LSReGen: Large-Scale Regional Generator via Backward Guidance Framework
LSReGen: Large-Scale Regional Generator via Backward Guidance Framework
Bowen Zhang
Cheng Yang
Xuanhui Liu
DiffM
75
0
0
21 Jul 2024
MusiConGen: Rhythm and Chord Control for Transformer-Based Text-to-Music
  Generation
MusiConGen: Rhythm and Chord Control for Transformer-Based Text-to-Music Generation
Yun-Han Lan
Wen-Yi Hsiao
Hao-Chung Cheng
Yi-Hsuan Yang
90
9
0
21 Jul 2024
CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models
CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models
Zheng Chong
Xiao Dong
Haoxiang Li
Shiyue Zhang
Wenqing Zhang
Xujie Zhang
Hanqing Zhao
D. Jiang
Xiaodan Liang
DiffM
136
24
0
21 Jul 2024
Deep Learning CT Image Restoration using System Blur and Noise Models
Deep Learning CT Image Restoration using System Blur and Noise Models
Yijie Yuan
G. Gang
J. W. Stayman
MedIm
79
0
0
20 Jul 2024
CoCoG-2: Controllable generation of visual stimuli for understanding
  human concept representation
CoCoG-2: Controllable generation of visual stimuli for understanding human concept representation
Chen Wei
Jiachen Zou
Dietmar Heinke
Quanying Liu
68
0
0
20 Jul 2024
Diffusion Models as Data Mining Tools
Diffusion Models as Data Mining Tools
Ioannis Siglidis
Aleksander Holynski
Alexei A. Efros
Mathieu Aubry
Shiry Ginosar
DiffMMedIm
95
3
0
20 Jul 2024
AGLLDiff: Guiding Diffusion Models Towards Unsupervised Training-free
  Real-world Low-light Image Enhancement
AGLLDiff: Guiding Diffusion Models Towards Unsupervised Training-free Real-world Low-light Image Enhancement
Yunlong Lin
Tian-Chun Ye
Sixiang Chen
Zhenqi Fu
Yingying Wang
Wenhao Chai
Zhaohu Xing
Lei Zhu
Xinghao Ding
DiffM
103
5
0
20 Jul 2024
Intelligent Artistic Typography: A Comprehensive Review of Artistic Text
  Design and Generation
Intelligent Artistic Typography: A Comprehensive Review of Artistic Text Design and Generation
Yuhang Bai
Zichuan Huang
Wenshuo Gao
Shuai Yang
Jiaying Liu
96
6
0
20 Jul 2024
Difflare: Removing Image Lens Flare with Latent Diffusion Model
Difflare: Removing Image Lens Flare with Latent Diffusion Model
Tianwen Zhou
Qihao Duan
Zitong Yu
55
1
0
20 Jul 2024
FedDM: Enhancing Communication Efficiency and Handling Data
  Heterogeneity in Federated Diffusion Models
FedDM: Enhancing Communication Efficiency and Handling Data Heterogeneity in Federated Diffusion Models
Jayneel Vora
Nader Bouacida
Aditya Krishnan
Prasant Mohapatra
FedML
94
2
0
20 Jul 2024
Controllable and Efficient Multi-Class Pathology Nuclei Data
  Augmentation using Text-Conditioned Diffusion Models
Controllable and Efficient Multi-Class Pathology Nuclei Data Augmentation using Text-Conditioned Diffusion Models
Hyunwoo Oh
Won-Ki Jeong
MedIm
87
5
0
19 Jul 2024
Not All Noises Are Created Equally:Diffusion Noise Selection and
  Optimization
Not All Noises Are Created Equally:Diffusion Noise Selection and Optimization
Zipeng Qi
Lichen Bai
Haoyi Xiong
Zeke Xie
DiffM
120
24
0
19 Jul 2024
LogoSticker: Inserting Logos into Diffusion Models for Customized
  Generation
LogoSticker: Inserting Logos into Diffusion Models for Customized Generation
Mingkang Zhu
Xi Chen
Zhongdao Wang
Hengshuang Zhao
Jiaya Jia
DiffM
94
3
0
18 Jul 2024
Training-free Composite Scene Generation for Layout-to-Image Synthesis
Training-free Composite Scene Generation for Layout-to-Image Synthesis
Jiaqi Liu
Tao Huang
Chang Xu
DiffM
78
7
0
18 Jul 2024
Multi-sentence Video Grounding for Long Video Generation
Multi-sentence Video Grounding for Long Video Generation
Wei Feng
Xin Wang
Hong Chen
Zeyang Zhang
Wenwu Zhu
DiffM
71
0
0
18 Jul 2024
SpaDiT: Diffusion Transformer for Spatial Gene Expression Prediction
  using scRNA-seq
SpaDiT: Diffusion Transformer for Spatial Gene Expression Prediction using scRNA-seq
Xiaoyu Li
Fangfang Zhu
Wenwen Min
MedIm
35
10
0
18 Jul 2024
GenRC: Generative 3D Room Completion from Sparse Image Collections
GenRC: Generative 3D Room Completion from Sparse Image Collections
Ming-feng Li
Yueh-Feng Ku
Hong-Xuan Yen
Chi Liu
Yu-Lun Liu
Albert Y. C. Chen
Cheng-Hao Kuo
Min Sun
3DVVGen
95
4
0
17 Jul 2024
SMooDi: Stylized Motion Diffusion Model
SMooDi: Stylized Motion Diffusion Model
Lei Zhong
Yiming Xie
Varun Jampani
Deqing Sun
Huaizu Jiang
DiffM
113
18
0
17 Jul 2024
Previous
123...252627...606162
Next