ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.05543
  4. Cited By
Adding Conditional Control to Text-to-Image Diffusion Models
v1v2v3 (latest)

Adding Conditional Control to Text-to-Image Diffusion Models

10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "Adding Conditional Control to Text-to-Image Diffusion Models"

50 / 3,090 papers shown
Title
HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image
  Models
HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Wei Wei
Tingbo Hou
Yael Pritch
Neal Wadhwa
Michael Rubinstein
Kfir Aberman
DiffM
101
183
0
13 Jul 2023
Improving 2D Human Pose Estimation in Rare Camera Views with Synthetic
  Data
Improving 2D Human Pose Estimation in Rare Camera Views with Synthetic Data
Miroslav Purkrábek
Jivrí Matas
73
2
0
13 Jul 2023
My3DGen: A Scalable Personalized 3D Generative Model
My3DGen: A Scalable Personalized 3D Generative Model
Luchao Qi
Jiaye Wu
Annie N. Wang
Sheng-Yu Wang
Roni Sengupta
3DH
91
5
0
11 Jul 2023
Semantic-SAM: Segment and Recognize Anything at Any Granularity
Semantic-SAM: Segment and Recognize Anything at Any Granularity
Feng Li
Hao Zhang
Pei Sun
Xueyan Zou
Siyi Liu
Jianwei Yang
Chun-yue Li
Lei Zhang
Jianfeng Gao
VLM
112
177
0
10 Jul 2023
Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image
  Alignment with Iterative VQA Feedback
Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Feedback
Jaskirat Singh
Liang Zheng
107
19
0
10 Jul 2023
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models
  without Specific Tuning
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Yuwei Guo
Ceyuan Yang
Anyi Rao
Zhengyang Liang
Yaohui Wang
Yu Qiao
Maneesh Agrawala
Dahua Lin
Bo Dai
VGen
152
882
0
10 Jul 2023
FreeDrag: Feature Dragging for Reliable Point-based Image Editing
FreeDrag: Feature Dragging for Reliable Point-based Image Editing
Pengyang Ling
Lin Chen
Pan Zhang
H. Chen
Yi Jin
Jinjin Zheng
DiffM
108
16
0
10 Jul 2023
DIFF-NST: Diffusion Interleaving For deFormable Neural Style Transfer
DIFF-NST: Diffusion Interleaving For deFormable Neural Style Transfer
Dan Ruta
Gemma Canet Tarrés
Andrew Gilbert
Eli Shechtman
Nicholas I. Kolkin
John Collomosse
DiffM
96
5
0
09 Jul 2023
Blocks2World: Controlling Realistic Scenes with Editable Primitives
Blocks2World: Controlling Realistic Scenes with Editable Primitives
Vaibhav Vavilala
Seemandhar Jain
R. Vasanth
Anand Bhattad
David A. Forsyth
VGen
89
4
0
07 Jul 2023
Text-Guided Synthesis of Eulerian Cinemagraphs
Text-Guided Synthesis of Eulerian Cinemagraphs
Aniruddha Mahapatra
Aliaksandr Siarohin
Hsin-Ying Lee
Sergey Tulyakov
Sitong Su
DiffMVGen
92
21
0
06 Jul 2023
A Critical Look at the Current Usage of Foundation Model for Dense
  Recognition Task
A Critical Look at the Current Usage of Foundation Model for Dense Recognition Task
Shiqi Yang
Atsushi Hashimoto
Yoshitaka Ushiku
DiffMVLM
76
1
0
06 Jul 2023
DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models
DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models
Chong Mou
Xintao Wang
Jie Song
Ying Shan
Jian Zhang
DiffM
125
154
0
05 Jul 2023
RADiff: Controllable Diffusion Models for Radio Astronomical Maps
  Generation
RADiff: Controllable Diffusion Models for Radio Astronomical Maps Generation
Renato Sortino
T. Cecconello
A. DeMarco
G. Fiameni
Andrea Pilzer
...
E. Sciacca
A. Ingallinera
C. Bordiu
F. Bufano
C. Spampinato
DiffM
57
2
0
05 Jul 2023
Towards Open Federated Learning Platforms: Survey and Vision from
  Technical and Legal Perspectives
Towards Open Federated Learning Platforms: Survey and Vision from Technical and Legal Perspectives
Moming Duan
Qinbin Li
Linshan Jiang
Bingsheng He
FedML
105
5
0
05 Jul 2023
SDXL: Improving Latent Diffusion Models for High-Resolution Image
  Synthesis
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis
Dustin Podell
Zion English
Kyle Lacey
A. Blattmann
Tim Dockhorn
Jonas Muller
Joe Penna
Robin Rombach
302
2,457
0
04 Jul 2023
Text + Sketch: Image Compression at Ultra Low Rates
Text + Sketch: Image Compression at Ultra Low Rates
Eric Lei
Yiugit Berkay Uslu
Hamed Hassani
Shirin Saeedi Bidokhti
DiffM
75
50
0
04 Jul 2023
Collaborative Score Distillation for Consistent Visual Synthesis
Collaborative Score Distillation for Consistent Visual Synthesis
Subin Kim
Kyungmin Lee
June Suk Choi
Jongheon Jeong
Kihyuk Sohn
Jinwoo Shin
DiffM
62
21
0
04 Jul 2023
MVDiffusion: Enabling Holistic Multi-view Image Generation with
  Correspondence-Aware Diffusion
MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion
Shitao Tang
Fuyang Zhang
Jiacheng Chen
Peng Wang
Yasutaka Furukawa
131
156
0
03 Jul 2023
DifFSS: Diffusion Model for Few-Shot Semantic Segmentation
DifFSS: Diffusion Model for Few-Shot Semantic Segmentation
Weimin Tan
Siyuan Chen
Bo Yan
DiffM
85
25
0
03 Jul 2023
DreamIdentity: Improved Editability for Efficient Face-identity
  Preserved Image Generation
DreamIdentity: Improved Editability for Efficient Face-identity Preserved Image Generation
Zhuowei Chen
Shancheng Fang
Wei Liu
Qian He
Mengqi Huang
Yongdong Zhang
Zhendong Mao
DiffM
125
24
0
01 Jul 2023
AIGCIQA2023: A Large-scale Image Quality Assessment Database for AI
  Generated Images: from the Perspectives of Quality, Authenticity and
  Correspondence
AIGCIQA2023: A Large-scale Image Quality Assessment Database for AI Generated Images: from the Perspectives of Quality, Authenticity and Correspondence
Jiarui Wang
Huiyu Duan
Jing Liu
S. Chen
Xiongkuo Min
Guangtao Zhai
EGVM
96
60
0
01 Jul 2023
DisCo: Disentangled Control for Realistic Human Dance Generation
DisCo: Disentangled Control for Realistic Human Dance Generation
Tan Wang
Linjie Li
Kevin Qinghong Lin
Yuanhao Zhai
Chung-Ching Lin
Zhengyuan Yang
Hanwang Zhang
Zicheng Liu
Lijuan Wang
VGen
141
89
0
30 Jun 2023
Counting Guidance for High Fidelity Text-to-Image Synthesis
Counting Guidance for High Fidelity Text-to-Image Synthesis
Wonjune Kang
Kevin Galim
H. Koo
Nam Ik Cho
DiffM
122
10
0
30 Jun 2023
Generate Anything Anywhere in Any Scene
Generate Anything Anywhere in Any Scene
Yuheng Li
Haotian Liu
Yangming Wen
Yong Jae Lee
DiffM
134
12
0
29 Jun 2023
Filtered-Guided Diffusion: Fast Filter Guidance for Black-Box Diffusion
  Models
Filtered-Guided Diffusion: Fast Filter Guidance for Black-Box Diffusion Models
Zeqi Gu
Abe Davis
DiffM
54
2
0
29 Jun 2023
ID-Pose: Sparse-view Camera Pose Estimation by Inverting Diffusion
  Models
ID-Pose: Sparse-view Camera Pose Estimation by Inverting Diffusion Models
Weihao Cheng
Yan-Pei Cao
Ying Shan
DiffM
113
6
0
29 Jun 2023
Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text
  Aligned Latent Representation
Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation
Zibo Zhao
Wen Liu
Xin Chen
Xi Zeng
Rui Wang
Pei Cheng
Bin-Bin Fu
Tao Chen
Gang Yu
Shenghua Gao
DiffM
155
107
0
29 Jun 2023
One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape
  Optimization
One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization
Minghua Liu
Chao Xu
Haian Jin
Ling-Hao Chen
T. MukundVarma
Zexiang Xu
Hao Su
140
469
0
29 Jun 2023
DiffComplete: Diffusion-based Generative 3D Shape Completion
DiffComplete: Diffusion-based Generative 3D Shape Completion
Ruihang Chu
Enze Xie
Shentong Mo
Zhenguo Li
Matthias Nießner
Chi-Wing Fu
Jiaya Jia
DiffM
73
23
0
28 Jun 2023
Next Steps for Human-Centered Generative AI: A Technical Perspective
Next Steps for Human-Centered Generative AI: A Technical Perspective
Xiang Ánthony' Chen
Jeff Burke
Andrea Colaço
Matthew K. Hong
Jennifer Jacobs
...
Dingzeyu Li
Nanyun Peng
Karl D. D. Willis
Chien-Sheng Wu
Bolei Zhou
LLMAG
91
35
0
27 Jun 2023
A-STAR: Test-time Attention Segregation and Retention for Text-to-image
  Synthesis
A-STAR: Test-time Attention Segregation and Retention for Text-to-image Synthesis
Aishwarya Agarwal
Srikrishna Karanam
K. J. Joseph
Apoorv Saxena
Koustava Goswami
Balaji Vasan Srinivasan
VLMDiffM
39
51
0
26 Jun 2023
Text-Anchored Score Composition: Tackling Condition Misalignment in
  Text-to-Image Diffusion Models
Text-Anchored Score Composition: Tackling Condition Misalignment in Text-to-Image Diffusion Models
Luozhou Wang
Guibao Shen
Wenhang Ge
Guangyong Chen
Yijun Li
Yingke Chen
DiffM
78
4
0
26 Jun 2023
Zero-shot spatial layout conditioning for text-to-image diffusion models
Zero-shot spatial layout conditioning for text-to-image diffusion models
Guillaume Couairon
Marlene Careil
Matthieu Cord
Stéphane Lathuilière
Jakob Verbeek
VLM
77
65
0
23 Jun 2023
DiffInfinite: Large Mask-Image Synthesis via Parallel Random Patch
  Diffusion in Histopathology
DiffInfinite: Large Mask-Image Synthesis via Parallel Random Patch Diffusion in Histopathology
Marco Aversa
Gabriel Nobis
Miriam Hagele
Kai Standvoss
Mihaela Chirica
...
D. Ivanova
Wojciech Samek
Frederick Klauschen
B. Sanguinetti
Luis Oala
MedIm
100
23
0
23 Jun 2023
Continuous Layout Editing of Single Images with Diffusion Models
Continuous Layout Editing of Single Images with Diffusion Models
Zhiyuan Zhang
Zhitong Huang
J. Liao
DiffM
63
10
0
22 Jun 2023
Eliminating Lipschitz Singularities in Diffusion Models
Eliminating Lipschitz Singularities in Diffusion Models
Zhantao Yang
Ruili Feng
Han Zhang
Yujun Shen
Kaixuan Zhu
...
Yifei Zhang
Yu Liu
Deli Zhao
Jingren Zhou
Fan Cheng
57
12
0
20 Jun 2023
Image Harmonization with Diffusion Model
Image Harmonization with Diffusion Model
Jia-jin Li
Jian Wang
Chen Wang
Jinjun Xiong
DiffM
62
3
0
17 Jun 2023
AvatarBooth: High-Quality and Customizable 3D Human Avatar Generation
AvatarBooth: High-Quality and Customizable 3D Human Avatar Generation
Yifei Zeng
Yuanxun Lu
Xinya Ji
Yao Yao
Hao Zhu
Xun Cao
DiffM
74
30
0
16 Jun 2023
Evaluating the Robustness of Text-to-image Diffusion Models against
  Real-world Attacks
Evaluating the Robustness of Text-to-image Diffusion Models against Real-world Attacks
Hongcheng Gao
Hao Zhang
Yinpeng Dong
Zhijie Deng
AAML
109
23
0
16 Jun 2023
Linguistic Binding in Diffusion Models: Enhancing Attribute
  Correspondence through Attention Map Alignment
Linguistic Binding in Diffusion Models: Enhancing Attribute Correspondence through Attention Map Alignment
Royi Rassin
Eran Hirsch
Daniel Glickman
Shauli Ravfogel
Yoav Goldberg
Gal Chechik
DiffM
115
108
0
15 Jun 2023
VidEdit: Zero-Shot and Spatially Aware Text-Driven Video Editing
VidEdit: Zero-Shot and Spatially Aware Text-Driven Video Editing
Paul Couairon
Clément Rambour
Jean-Emmanuel Haugeard
Nicolas Thome
DiffMVGen
76
30
0
14 Jun 2023
Diffusion in Diffusion: Cyclic One-Way Diffusion for
  Text-Vision-Conditioned Generation
Diffusion in Diffusion: Cyclic One-Way Diffusion for Text-Vision-Conditioned Generation
Ruoyu Wang
Yongqi Yang
Zhihao Qian
Ye Zhu
Yuehua Wu
DiffM
97
14
0
14 Jun 2023
ZeroForge: Feedforward Text-to-Shape Without 3D Supervision
ZeroForge: Feedforward Text-to-Shape Without 3D Supervision
Kelly O. Marshall
Minh Pham
Ameya Joshi
Anushrut Jignasu
Aditya Balu
Adarsh Krishnamurthy
A. Hegde
CLIP
60
3
0
14 Jun 2023
Generating Images with 3D Annotations Using Diffusion Models
Generating Images with 3D Annotations Using Diffusion Models
Wufei Ma
Qihao Liu
Jiahao Wang
Angtian Wang
Xiaoding Yuan
...
Ruxiao Duan
Yongrui Qi
Adam Kortylewski
Yaoyao Liu
Alan Yuille
DiffM
86
5
0
13 Jun 2023
Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
Shuai Yang
Yifan Zhou
Ziwei Liu
Chen Change Loy
VGenDiffM
102
221
0
13 Jun 2023
VISION Datasets: A Benchmark for Vision-based InduStrial InspectiON
VISION Datasets: A Benchmark for Vision-based InduStrial InspectiON
Haoping Bai
Shancong Mou
Tatiana Likhomanenko
R. G. Cinbis
Oncel Tuzel
Ping Huang
Jiulong Shan
Jianjun Shi
Mengsi Cao
VLM
73
26
0
13 Jun 2023
Paste, Inpaint and Harmonize via Denoising: Subject-Driven Image Editing
  with Pre-Trained Diffusion Model
Paste, Inpaint and Harmonize via Denoising: Subject-Driven Image Editing with Pre-Trained Diffusion Model
Xinyu Zhang
Jiaxian Guo
Paul D. Yoo
Yutaka Matsuo
Yusuke Iwasawa
DiffM
108
22
0
13 Jun 2023
Controlling Text-to-Image Diffusion by Orthogonal Finetuning
Controlling Text-to-Image Diffusion by Orthogonal Finetuning
Zeju Qiu
Wei-yu Liu
Haiwen Feng
Yuxuan Xue
Yao Feng
Zhen Liu
Dan Zhang
Adrian Weller
Bernhard Schölkopf
DiffM
126
158
0
12 Jun 2023
Scalable 3D Captioning with Pretrained Models
Scalable 3D Captioning with Pretrained Models
Tiange Luo
C. Rockwell
Honglak Lee
Justin Johnson
116
160
0
12 Jun 2023
MovieFactory: Automatic Movie Creation from Text using Large Generative
  Models for Language and Images
MovieFactory: Automatic Movie Creation from Text using Large Generative Models for Language and Images
Sitong Su
Huan Yang
Huiguo He
Wenjing Wang
Zixi Tuo
Wen-Huang Cheng
Lianli Gao
Jingkuan Song
Jianlong Fu
VGenDiffM
90
40
0
12 Jun 2023
Previous
123...565758...606162
Next