ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.05543
  4. Cited By
Adding Conditional Control to Text-to-Image Diffusion Models
v1v2v3 (latest)

Adding Conditional Control to Text-to-Image Diffusion Models

10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "Adding Conditional Control to Text-to-Image Diffusion Models"

50 / 3,090 papers shown
Title
Multimodal Garment Designer: Human-Centric Latent Diffusion Models for
  Fashion Image Editing
Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing
Alberto Baldrati
Davide Morelli
Giuseppe Cartella
Marcella Cornia
Marco Bertini
Rita Cucchiara
DiffM
58
57
0
04 Apr 2023
Revisiting the Evaluation of Image Synthesis with GANs
Revisiting the Evaluation of Image Synthesis with GANs
Mengping Yang
Ceyuan Yang
Yichi Zhang
Qingyan Bai
Yujun Shen
Bo Dai
EGVM
65
7
0
04 Apr 2023
viz2viz: Prompt-driven stylized visualization generation using a
  diffusion model
viz2viz: Prompt-driven stylized visualization generation using a diffusion model
Jiaqi Wu
John Joon Young Chung
Eytan Adar
DiffM
46
12
0
04 Apr 2023
Follow Your Pose: Pose-Guided Text-to-Video Generation using Pose-Free
  Videos
Follow Your Pose: Pose-Guided Text-to-Video Generation using Pose-Free Videos
Yue Ma
Yin-Yin He
Xiaodong Cun
Xintao Wang
Siran Chen
Ying Shan
Xiu Li
Qifeng Chen
DiffMVGen
104
194
0
03 Apr 2023
DreamAvatar: Text-and-Shape Guided 3D Human Avatar Generation via
  Diffusion Models
DreamAvatar: Text-and-Shape Guided 3D Human Avatar Generation via Diffusion Models
Yukang Cao
Yan-Pei Cao
Kai Han
Ying Shan
Kwan-Yee K. Wong
DiffM
100
145
0
03 Apr 2023
AUDIT: Audio Editing by Following Instructions with Latent Diffusion
  Models
AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models
Yuancheng Wang
Zeqian Ju
Xuejiao Tan
Lei He
Zhizheng Wu
Jiang Bian
Sheng Zhao
DiffM
150
55
0
03 Apr 2023
A Closer Look at Parameter-Efficient Tuning in Diffusion Models
A Closer Look at Parameter-Efficient Tuning in Diffusion Models
Chendong Xiang
Fan Bao
Chongxuan Li
Hang Su
Jun Zhu
DiffM
52
16
0
31 Mar 2023
GlyphDraw: Seamlessly Rendering Text with Intricate Spatial Structures
  in Text-to-Image Generation
GlyphDraw: Seamlessly Rendering Text with Intricate Spatial Structures in Text-to-Image Generation
Jiancang Ma
Mingjun Zhao
Chen Chen
Ruichen Wang
Di Niu
H. Lu
Xiaodong Lin
DiffM
87
12
0
31 Mar 2023
Reference-based Image Composition with Sketch via Structure-aware
  Diffusion Model
Reference-based Image Composition with Sketch via Structure-aware Diffusion Model
Kangyeol Kim
S. Park
Junsoo Lee
Jaegul Choo
DiffM
58
13
0
31 Mar 2023
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging
  Face
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face
Yongliang Shen
Kaitao Song
Xu Tan
Dongsheng Li
Weiming Lu
Yueting Zhuang
MLLM
149
913
0
30 Mar 2023
DDP: Diffusion Model for Dense Visual Prediction
DDP: Diffusion Model for Dense Visual Prediction
Yuanfeng Ji
Zhe Chen
Enze Xie
Lanqing Hong
Xihui Liu
Zhaoqiang Liu
Tong Lu
Zhenguo Li
Ping Luo
DiffMVLM
133
138
0
30 Mar 2023
PAIR-Diffusion: A Comprehensive Multimodal Object-Level Image Editor
PAIR-Diffusion: A Comprehensive Multimodal Object-Level Image Editor
Vidit Goel
E. Peruzzo
Yi Ding
Dejia Xu
Xingqian Xu
N. Sebe
Trevor Darrell
Zhangyang Wang
Humphrey Shi
DiffM
69
8
0
30 Mar 2023
MDP: A Generalized Framework for Text-Guided Image Editing by
  Manipulating the Diffusion Path
MDP: A Generalized Framework for Text-Guided Image Editing by Manipulating the Diffusion Path
Qian Wang
Biao Zhang
Michael Birsak
Peter Wonka
DiffM
49
19
0
29 Mar 2023
LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init
  Attention
LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention
Renrui Zhang
Jiaming Han
Chris Liu
Peng Gao
Aojun Zhou
Xiangfei Hu
Shilin Yan
Pan Lu
Hongsheng Li
Yu Qiao
MLLM
179
787
0
28 Mar 2023
Visual Chain-of-Thought Diffusion Models
Visual Chain-of-Thought Diffusion Models
William Harvey
Frank Wood
DiffMVLM
84
8
0
28 Mar 2023
StyleDiffusion: Prompt-Embedding Inversion for Text-Based Editing
StyleDiffusion: Prompt-Embedding Inversion for Text-Based Editing
Senmao Li
Joost van de Weijer
Taihang Hu
Fahad Shahbaz Khan
Qibin Hou
Yaxing Wang
Jian Yang
DiffM
136
56
0
28 Mar 2023
The Stable Signature: Rooting Watermarks in Latent Diffusion Models
The Stable Signature: Rooting Watermarks in Latent Diffusion Models
Pierre Fernandez
Guillaume Couairon
Hervé Jégou
Matthijs Douze
Teddy Furon
WIGM
131
198
0
27 Mar 2023
Anti-DreamBooth: Protecting users from personalized text-to-image
  synthesis
Anti-DreamBooth: Protecting users from personalized text-to-image synthesis
T. Le
Hao Phung
Thuan Hoang Nguyen
Quan Dao
Ngoc N. Tran
Anh Tran
109
100
0
27 Mar 2023
Training-free Content Injection using h-space in Diffusion Models
Training-free Content Injection using h-space in Diffusion Models
Jaeseok Jeong
Mingi Kwon
Youngjung Uh
DiffM
103
28
0
27 Mar 2023
Freestyle Layout-to-Image Synthesis
Freestyle Layout-to-Image Synthesis
Han Xue
Z. Huang
Qianru Sun
Li Song
Wenjun Zhang
DiffM
68
67
0
25 Mar 2023
End-to-End Diffusion Latent Optimization Improves Classifier Guidance
End-to-End Diffusion Latent Optimization Improves Classifier Guidance
Bram Wallace
Akash Gokul
Stefano Ermon
Nikhil Naik
193
80
0
23 Mar 2023
Ablating Concepts in Text-to-Image Diffusion Models
Ablating Concepts in Text-to-Image Diffusion Models
Nupur Kumari
Bin Zhang
Sheng-Yu Wang
Eli Shechtman
Richard Y. Zhang
Jun-Yan Zhu
VLM
75
201
0
23 Mar 2023
DreamBooth3D: Subject-Driven Text-to-3D Generation
DreamBooth3D: Subject-Driven Text-to-3D Generation
Amit Raj
S. Kaza
Ben Poole
Michael Niemeyer
Nataniel Ruiz
...
Kfir Aberman
Michael Rubinstein
Jonathan T. Barron
Yuanzhen Li
Varun Jampani
DiffM
115
228
0
23 Mar 2023
Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video
  Generators
Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators
Levon Khachatryan
A. Movsisyan
Vahram Tadevosyan
Roberto Henschel
Zhangyang Wang
Shant Navasardyan
Humphrey Shi
VGen
85
581
0
23 Mar 2023
Pix2Video: Video Editing using Image Diffusion
Pix2Video: Video Editing using Image Diffusion
Duygu Ceylan
C. Huang
Niloy J. Mitra
DiffMVGen
131
261
0
22 Mar 2023
Text2Room: Extracting Textured 3D Meshes from 2D Text-to-Image Models
Text2Room: Extracting Textured 3D Meshes from 2D Text-to-Image Models
Lukas Höllein
Ang Cao
Andrew Owens
Justin Johnson
Matthias Nießner
DiffM
130
182
0
21 Mar 2023
Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D
  Object Detection
Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection
Shihao Wang
Yingfei Liu
Tiancai Wang
Ying Li
Xiangyu Zhang
3DPC
132
211
0
21 Mar 2023
CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion
CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion
Geonmo Gu
Sanghyuk Chun
Wonjae Kim
HeeJae Jun
Yoohoon Kang
Sangdoo Yun
DiffM
133
59
0
21 Mar 2023
Automatic Measures for Evaluating Generative Design Methods for
  Architects
Automatic Measures for Evaluating Generative Design Methods for Architects
Eric Yeh
Briland Hitaj
Vidyasagar Sadhu
Anirban Roy
Takuma Nakabayashi
Yoshito Tsuji
EGVM3DV
23
0
0
20 Mar 2023
Text2Tex: Text-driven Texture Synthesis via Diffusion Models
Text2Tex: Text-driven Texture Synthesis via Diffusion Models
Dave Zhenyu Chen
Yawar Siddiqui
Hsin-Ying Lee
Sergey Tulyakov
Matthias Nießner
DiffM
123
201
0
20 Mar 2023
Localizing Object-level Shape Variations with Text-to-Image Diffusion
  Models
Localizing Object-level Shape Variations with Text-to-Image Diffusion Models
Or Patashnik
Daniel Garibi
Idan Azuri
Hadar Averbuch-Elor
Daniel Cohen-Or
DiffM
95
120
0
20 Mar 2023
SVDiff: Compact Parameter Space for Diffusion Fine-Tuning
SVDiff: Compact Parameter Space for Diffusion Fine-Tuning
Ligong Han
Yinxiao Li
Han Zhang
P. Milanfar
Dimitris N. Metaxas
Feng Yang
DiffM
151
286
0
20 Mar 2023
AnimeDiffusion: Anime Face Line Drawing Colorization via Diffusion
  Models
AnimeDiffusion: Anime Face Line Drawing Colorization via Diffusion Models
Yu Cao
Xiangqiao Meng
P. Y. Mok
Xueting Liu
Tong-Yee Lee
Ping Li
DiffM
62
14
0
20 Mar 2023
Object-Centric Slot Diffusion
Object-Centric Slot Diffusion
Jindong Jiang
Fei Deng
Gautam Singh
S. Ahn
DiffMBDLOCL
129
61
0
20 Mar 2023
SKED: Sketch-guided Text-based 3D Editing
SKED: Sketch-guided Text-based 3D Editing
Aryan Mikaeili
Or Perel
Mehdi Safaee
Daniel Cohen-Or
Ali Mahdavi-Amiri
DiffM
109
67
0
19 Mar 2023
A Recipe for Watermarking Diffusion Models
A Recipe for Watermarking Diffusion Models
Yunqing Zhao
Tianyu Pang
Chao Du
Xiao Yang
Ngai-Man Cheung
Min Lin
WIGM
103
124
0
17 Mar 2023
GlueGen: Plug and Play Multi-modal Encoders for X-to-image Generation
GlueGen: Plug and Play Multi-modal Encoders for X-to-image Generation
Can Qin
Ning Yu
Chen Xing
Shu Zhen Zhang
Zeyuan Chen
Stefano Ermon
Yun Fu
Caiming Xiong
Ran Xu
DiffM
129
21
0
17 Mar 2023
FreeDoM: Training-Free Energy-Guided Conditional Diffusion Model
FreeDoM: Training-Free Energy-Guided Conditional Diffusion Model
Jiwen Yu
Yinhuai Wang
Chen Zhao
Guohao Li
Jian Zhang
DiffM
81
190
0
17 Mar 2023
Denoising Diffusion Autoencoders are Unified Self-supervised Learners
Denoising Diffusion Autoencoders are Unified Self-supervised Learners
Weilai Xiang
Hongyu Yang
Di Huang
Yunhong Wang
DiffM
125
78
0
17 Mar 2023
HIVE: Harnessing Human Feedback for Instructional Visual Editing
HIVE: Harnessing Human Feedback for Instructional Visual Editing
Shu Zhen Zhang
Xinyi Yang
Yihao Feng
Can Qin
Chia-Chih Chen
...
Haiquan Wang
Silvio Savarese
Stefano Ermon
Caiming Xiong
Ran Xu
93
116
0
16 Mar 2023
Diffusion-HPC: Synthetic Data Generation for Human Mesh Recovery in
  Challenging Domains
Diffusion-HPC: Synthetic Data Generation for Human Mesh Recovery in Challenging Domains
Zhenzhen Weng
Laura Bravo Sánchez
Serena Yeung-Levy
DiffM
42
0
0
16 Mar 2023
FateZero: Fusing Attentions for Zero-shot Text-based Video Editing
FateZero: Fusing Attentions for Zero-shot Text-based Video Editing
Chenyang Qi
Xiaodong Cun
Yong Zhang
Chenyang Lei
Xintao Wang
Ying Shan
Qifeng Chen
VGen
117
356
0
16 Mar 2023
Let 2D Diffusion Model Know 3D-Consistency for Robust Text-to-3D
  Generation
Let 2D Diffusion Model Know 3D-Consistency for Robust Text-to-3D Generation
Junyoung Seo
Wooseok Jang
Minseop Kwak
Ines Hyeonsu Kim
Jaehoon Ko
Junho Kim
Jin-Hwa Kim
Jiyoung Lee
Seung Wook Kim
DiffM
82
138
0
14 Mar 2023
Prompting AI Art: An Investigation into the Creative Skill of Prompt
  Engineering
Prompting AI Art: An Investigation into the Creative Skill of Prompt Engineering
J. Oppenlaender
Rhema Linder
Johanna M. Silvennoinen
71
86
0
13 Mar 2023
PARASOL: Parametric Style Control for Diffusion Image Synthesis
PARASOL: Parametric Style Control for Diffusion Image Synthesis
Gemma Canet Tarrés
Dan Ruta
Tu Bui
John Collomosse
DiffM
83
7
0
11 Mar 2023
KGNv2: Separating Scale and Pose Prediction for Keypoint-based 6-DoF
  Grasp Synthesis on RGB-D input
KGNv2: Separating Scale and Pose Prediction for Keypoint-based 6-DoF Grasp Synthesis on RGB-D input
Yiye Chen
Ruinian Xu
Yunzhi Lin
Hongyi Chen
Patricio A. Vela
3DPC
75
4
0
09 Mar 2023
Learning Stationary Markov Processes with Contrastive Adjustment
Learning Stationary Markov Processes with Contrastive Adjustment
L. Bergenstråhle
J. Lagergren
J. Lundeberg
BDL
28
0
0
09 Mar 2023
Identification of Systematic Errors of Image Classifiers on Rare
  Subgroups
Identification of Systematic Errors of Image Classifiers on Rare Subgroups
J. H. Metzen
Robin Hutmacher
N. G. Hua
Valentyn Boreiko
Dan Zhang
AAMLVLM
99
19
0
09 Mar 2023
Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation
  Models
Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models
Chenfei Wu
Sheng-Kai Yin
Weizhen Qi
Xiaodong Wang
Zecheng Tang
Nan Duan
MLLMLRM
144
649
0
08 Mar 2023
Exploring Efficient-Tuned Learning Audio Representation Method from
  BriVL
Exploring Efficient-Tuned Learning Audio Representation Method from BriVL
Sen Fang
Yang Wu
Bowen Gao
Jingwen Cai
T. Teoh
DiffM
46
1
0
08 Mar 2023
Previous
123...606162
Next