ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.05543
  4. Cited By
Adding Conditional Control to Text-to-Image Diffusion Models
v1v2v3 (latest)

Adding Conditional Control to Text-to-Image Diffusion Models

10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "Adding Conditional Control to Text-to-Image Diffusion Models"

50 / 3,090 papers shown
Title
Boosting GUI Prototyping with Diffusion Models
Boosting GUI Prototyping with Diffusion Models
Jialiang Wei
A. Courbis
Thomas Lambolais
Binbin Xu
P. Bernard
Gérard Dray
DiffM
74
22
0
09 Jun 2023
GANeRF: Leveraging Discriminators to Optimize Neural Radiance Fields
GANeRF: Leveraging Discriminators to Optimize Neural Radiance Fields
Barbara Roessle
Norman Muller
Lorenzo Porzi
Samuel Rota Buló
Peter Kontschieder
Matthias Nießner
77
19
0
09 Jun 2023
Grounded Text-to-Image Synthesis with Attention Refocusing
Grounded Text-to-Image Synthesis with Attention Refocusing
Quynh Phung
Songwei Ge
Jia-Bin Huang
DiffM
117
113
0
08 Jun 2023
SyncDiffusion: Coherent Montage via Synchronized Joint Diffusions
SyncDiffusion: Coherent Montage via Synchronized Joint Diffusions
Yuseung Lee
Kunho Kim
Hyunjin Kim
Minhyuk Sung
DiffM
127
67
0
08 Jun 2023
ARTIC3D: Learning Robust Articulated 3D Shapes from Noisy Web Image
  Collections
ARTIC3D: Learning Robust Articulated 3D Shapes from Noisy Web Image Collections
Chunfeng Yao
Amit Raj
Wei-Chih Hung
Yuanzhen Li
Michael Rubinstein
Ming-Hsuan Yang
Varun Jampani
DiffM
69
20
0
07 Jun 2023
GeoDiffusion: Text-Prompted Geometric Control for Object Detection Data
  Generation
GeoDiffusion: Text-Prompted Geometric Control for Object Detection Data Generation
Kai Chen
Enze Xie
Zhe Chen
Yibo Wang
Lanqing Hong
Zhenguo Li
Dit-Yan Yeung
DiffM
140
26
0
07 Jun 2023
Matte Anything: Interactive Natural Image Matting with Segment Anything
  Models
Matte Anything: Interactive Natural Image Matting with Segment Anything Models
J. Yao
Xinggang Wang
Lang Ye
Wenyu Liu
94
42
0
07 Jun 2023
On the Design Fundamentals of Diffusion Models: A Survey
On the Design Fundamentals of Diffusion Models: A Survey
Ziyi Chang
George Alex Koulieris
Hyung Jin Chang
Hubert P. H. Shum
DiffM
183
56
0
07 Jun 2023
AI Art Curation: Re-imagining the city of Helsinki in occasion of its
  Biennial
AI Art Curation: Re-imagining the city of Helsinki in occasion of its Biennial
Ludovica Schaerf
Pepe Ballesteros
Valentine Bernasconi
Iacopo Neri
Dario Negueruela del Castillo
HAI
36
1
0
06 Jun 2023
Towards Visual Foundational Models of Physical Scenes
Towards Visual Foundational Models of Physical Scenes
Chethan Parameshwara
Alessandro Achille
Matthew Trager
Xiaolong Li
Jiawei Mo
...
A. Swaminathan
C. Taylor
D. Venkatraman
Xiaohan Fei
Stefano Soatto
DiffM
60
4
0
06 Jun 2023
DreamSparse: Escaping from Plato's Cave with 2D Frozen Diffusion Model
  Given Sparse Views
DreamSparse: Escaping from Plato's Cave with 2D Frozen Diffusion Model Given Sparse Views
Paul D. Yoo
Jiaxian Guo
Yutaka Matsuo
S. Gu
110
24
0
06 Jun 2023
HeadSculpt: Crafting 3D Head Avatars with Text
HeadSculpt: Crafting 3D Head Avatars with Text
Xiaoping Han
Yukang Cao
Kai Han
Xiatian Zhu
Jiankang Deng
Yi-Zhe Song
Tao Xiang
Kwan-Yee K. Wong
DiffM
73
47
0
05 Jun 2023
Stable Diffusion is Unstable
Stable Diffusion is Unstable
Chengbin Du
Yanxi Li
Zhongwei Qiu
Chang Xu
DiffM
93
18
0
05 Jun 2023
Efficient Text-Guided 3D-Aware Portrait Generation with Score
  Distillation Sampling on Distribution
Efficient Text-Guided 3D-Aware Portrait Generation with Score Distillation Sampling on Distribution
Yiji Cheng
Fei Yin
Xiaoke Huang
Xintong Yu
Jiaxiang Liu
Shi Feng
Yujiu Yang
Yansong Tang
DiffM
76
5
0
03 Jun 2023
VideoComposer: Compositional Video Synthesis with Motion Controllability
VideoComposer: Compositional Video Synthesis with Motion Controllability
Xiang Wang
Hangjie Yuan
Shiwei Zhang
Dayou Chen
Jiuniu Wang
Yingya Zhang
Yujun Shen
Deli Zhao
Jingren Zhou
VGenDiffM
121
341
0
03 Jun 2023
Conditional Generation from Unconditional Diffusion Models using
  Denoiser Representations
Conditional Generation from Unconditional Diffusion Models using Denoiser Representations
Alexandros Graikos
Srikar Yellapragada
Dimitris Samaras
DiffMAI4CE
72
6
0
02 Jun 2023
Probabilistic Adaptation of Text-to-Video Models
Probabilistic Adaptation of Text-to-Video Models
Mengjiao Yang
Yilun Du
Bo Dai
Dale Schuurmans
J. Tenenbaum
Pieter Abbeel
VGenDiffM
137
26
0
02 Jun 2023
Video Colorization with Pre-trained Text-to-Image Diffusion Models
Video Colorization with Pre-trained Text-to-Image Diffusion Models
Hanyuan Liu
M. Xie
Jinbo Xing
Chengze Li
T. Wong
VLMDiffM
104
13
0
02 Jun 2023
Privacy Distillation: Reducing Re-identification Risk of Multimodal
  Diffusion Models
Privacy Distillation: Reducing Re-identification Risk of Multimodal Diffusion Models
Virginia Fernandez
Pedro Sanchez
W. H. Pinaya
Grzegorz Jacenków
Sotirios A. Tsaftaris
Jorge Cardoso
82
19
0
02 Jun 2023
DeepfakeArt Challenge: A Benchmark Dataset for Generative AI Art Forgery
  and Data Poisoning Detection
DeepfakeArt Challenge: A Benchmark Dataset for Generative AI Art Forgery and Data Poisoning Detection
Hossein Aboutalebi
Daniel Mao
Rongqi Fan
Carol Xu
Chris He
Alexander Wong
AAML
71
8
0
02 Jun 2023
StyleGAN knows Normal, Depth, Albedo, and More
StyleGAN knows Normal, Depth, Albedo, and More
Anand Bhattad
Daniel McKee
Derek Hoiem
David A. Forsyth
GAN
67
36
0
01 Jun 2023
Diffusion Self-Guidance for Controllable Image Generation
Diffusion Self-Guidance for Controllable Image Generation
Dave Epstein
Allan Jabri
Ben Poole
Alexei A. Efros
Aleksander Holynski
109
266
0
01 Jun 2023
StableRep: Synthetic Images from Text-to-Image Models Make Strong Visual
  Representation Learners
StableRep: Synthetic Images from Text-to-Image Models Make Strong Visual Representation Learners
Yonglong Tian
Lijie Fan
Phillip Isola
Huiwen Chang
Dilip Krishnan
VLMDiffM
145
153
0
01 Jun 2023
Intelligent Grimm -- Open-ended Visual Storytelling via Latent Diffusion
  Models
Intelligent Grimm -- Open-ended Visual Storytelling via Latent Diffusion Models
Chang-rui Liu
Haoning Wu
Yujie Zhong
Xiaoyu Zhang
Yanfeng Wang
Weidi Xie
DiffMVLM
154
44
0
01 Jun 2023
ViCo: Plug-and-play Visual Condition for Personalized Text-to-image
  Generation
ViCo: Plug-and-play Visual Condition for Personalized Text-to-image Generation
Shaozhe Hao
Kai Han
Shihao Zhao
Kwan-Yee K. Wong
88
10
0
01 Jun 2023
Cocktail: Mixing Multi-Modality Controls for Text-Conditional Image
  Generation
Cocktail: Mixing Multi-Modality Controls for Text-Conditional Image Generation
Minghui Hu
Jianbin Zheng
Daqing Liu
Chuanxia Zheng
Chaoyue Wang
Dacheng Tao
Tat-Jen Cham
DiffM
78
9
0
01 Jun 2023
Make-Your-Video: Customized Video Generation Using Textual and
  Structural Guidance
Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance
Jinbo Xing
Menghan Xia
Yuxin Liu
Yuechen Zhang
Yong Zhang
...
Haoxin Chen
Xiaodong Cun
Xintao Wang
Ying Shan
T. Wong
VGenDiffM
82
93
0
01 Jun 2023
Versatile Backdoor Attack with Visible, Semantic, Sample-Specific, and
  Compatible Triggers
Versatile Backdoor Attack with Visible, Semantic, Sample-Specific, and Compatible Triggers
Ke Xu
Hongrui Chen
Zihao Zhu
Li Liu
Baoyuan Wu
DiffM
126
11
0
01 Jun 2023
Control4D: Efficient 4D Portrait Editing with Text
Control4D: Efficient 4D Portrait Editing with Text
Ruizhi Shao
Jingxiang Sun
Cheng Peng
Zerong Zheng
Boyao Zhou
Hongwen Zhang
Yebin Liu
DiffM
116
25
0
31 May 2023
A Geometric Perspective on Diffusion Models
A Geometric Perspective on Diffusion Models
Defang Chen
Zhenyu Zhou
Jianhan Mei
Chunhua Shen
Chun-Yen Chen
C. Wang
DiffM
78
20
0
31 May 2023
PaintSeg: Training-free Segmentation via Painting
PaintSeg: Training-free Segmentation via Painting
Xiang Li
Chung-Ching Lin
Yinpeng Chen
Zicheng Liu
Jinglu Wang
Bhiksha Raj
115
5
0
30 May 2023
Cones 2: Customizable Image Synthesis with Multiple Subjects
Cones 2: Customizable Image Synthesis with Multiple Subjects
Zhiheng Liu
Yifei Zhang
Yujun Shen
Kecheng Zheng
Kai Zhu
Ruili Feng
Yu Liu
Deli Zhao
Jingren Zhou
Yang Cao
DiffM
104
81
0
30 May 2023
Video ControlNet: Towards Temporally Consistent Synthetic-to-Real Video
  Translation Using Conditional Image Diffusion Models
Video ControlNet: Towards Temporally Consistent Synthetic-to-Real Video Translation Using Conditional Image Diffusion Models
Ernie Chu
Shuohao Lin
Jun-Cheng Chen
DiffM
64
21
0
30 May 2023
LANCE: Stress-testing Visual Models by Generating Language-guided
  Counterfactual Images
LANCE: Stress-testing Visual Models by Generating Language-guided Counterfactual Images
Viraj Prabhu
Sriram Yenamandra
Prithvijit Chattopadhyay
Judy Hoffman
104
42
0
30 May 2023
StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity
  3D Avatar Generation
StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity 3D Avatar Generation
Chi Zhang
Yiwen Chen
Yijun Fu
Zheng-Yang Zhou
YU Gang
Billzb Wang
Bin-Bin Fu
Tao Chen
Guosheng Lin
Chunhua Shen
DiffM
102
29
0
30 May 2023
GPT4Tools: Teaching Large Language Model to Use Tools via
  Self-instruction
GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction
Rui Yang
Lin Song
Yanwei Li
Sijie Zhao
Yixiao Ge
Xiu Li
Ying Shan
SyDaMLLM
88
227
0
30 May 2023
Real-World Image Variation by Aligning Diffusion Inversion Chain
Real-World Image Variation by Aligning Diffusion Inversion Chain
Yuechen Zhang
Jinbo Xing
Eric Lo
Jiaya Jia
99
35
0
30 May 2023
SAVE: Spectral-Shift-Aware Adaptation of Image Diffusion Models for
  Text-driven Video Editing
SAVE: Spectral-Shift-Aware Adaptation of Image Diffusion Models for Text-driven Video Editing
Nazmul Karim
Umar Khalid
M. Joneidi
Chen Chen
Nazanin Rahnavard
DiffMVGen
63
5
0
30 May 2023
Controllable Text-to-Image Generation with GPT-4
Controllable Text-to-Image Generation with GPT-4
Tianjun Zhang
Yi Zhang
Vibhav Vineet
Neel Joshi
Xin Eric Wang
DiffM
150
44
0
29 May 2023
RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths
RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths
Zeyue Xue
Guanglu Song
Qiushan Guo
Boxiao Liu
Zhuofan Zong
Yu Liu
Ping Luo
DiffM
173
137
0
29 May 2023
Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept
  Customization of Diffusion Models
Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models
Yuchao Gu
Xintao Wang
Jay Zhangjie Wu
Yujun Shi
Yunpeng Chen
...
Shuning Chang
Wei Wu
Yixiao Ge
Ying Shan
Mike Zheng Shou
DiffM
144
177
0
29 May 2023
Photoswap: Personalized Subject Swapping in Images
Photoswap: Personalized Subject Swapping in Images
Jing Gu
Yilin Wang
Nanxuan Zhao
Tsu-Jui Fu
Wei Xiong
...
Zhifei Zhang
He Zhang
Jianming Zhang
Hyun-Sun Jung
Xin Eric Wang
DiffM
99
43
0
29 May 2023
Gen-L-Video: Multi-Text to Long Video Generation via Temporal
  Co-Denoising
Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising
Fu Lee Wang
Wenshuo Chen
Guanglu Song
Han-Jia Ye
Yu Liu
Hongsheng Li
VGenDiffM
117
93
0
29 May 2023
GlyphControl: Glyph Conditional Control for Visual Text Generation
GlyphControl: Glyph Conditional Control for Visual Text Generation
Yukang Yang
Dongnan Gui
Yuhui Yuan
Weicong Liang
Haisong Ding
Hang-Rui Hu
Kai Chen
DiffM
90
85
0
29 May 2023
CamoDiffusion: Camouflaged Object Detection via Conditional Diffusion
  Models
CamoDiffusion: Camouflaged Object Detection via Conditional Diffusion Models
Zhongxi Chen
Ke Sun
Xianming Lin
Rongrong Ji
DiffM
97
31
0
29 May 2023
SPAC-Net: Synthetic Pose-aware Animal ControlNet for Enhanced Pose
  Estimation
SPAC-Net: Synthetic Pose-aware Animal ControlNet for Enhanced Pose Estimation
Le Jiang
Sarah Ostadabbas
65
7
0
29 May 2023
Efficient Storage of Fine-Tuned Models via Low-Rank Approximation of
  Weight Residuals
Efficient Storage of Fine-Tuned Models via Low-Rank Approximation of Weight Residuals
Simo Ryu
S. Seo
Jaejun Yoo
87
8
0
28 May 2023
Text-to-image Editing by Image Information Removal
Text-to-image Editing by Image Information Removal
Zhongping Zhang
Jian Zheng
Jacob Zhiyuan Fang
Bryan A. Plummer
DiffM
97
13
0
27 May 2023
Contrast, Attend and Diffuse to Decode High-Resolution Images from Brain
  Activities
Contrast, Attend and Diffuse to Decode High-Resolution Images from Brain Activities
Jingyuan Sun
Mingxiao Li
Zijiao Chen
Yunhao Zhang
Shaonan Wang
Marie-Francine Moens
DiffM
111
33
0
26 May 2023
ControlVideo: Conditional Control for One-shot Text-driven Video Editing
  and Beyond
ControlVideo: Conditional Control for One-shot Text-driven Video Editing and Beyond
Min Zhao
Rongzheng Wang
Fan Bao
Chongxuan Li
Jun Zhu
VGenDiffM
35
5
0
26 May 2023
Previous
123...575859606162
Next