ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.05543
  4. Cited By
Adding Conditional Control to Text-to-Image Diffusion Models
v1v2v3 (latest)

Adding Conditional Control to Text-to-Image Diffusion Models

10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "Adding Conditional Control to Text-to-Image Diffusion Models"

50 / 3,090 papers shown
Title
PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding
PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding
Zhen Li
Mingdeng Cao
Xintao Wang
Zhongang Qi
Ming-Ming Cheng
Ying Shan
DiffM
138
201
0
07 Dec 2023
DreamVideo: Composing Your Dream Videos with Customized Subject and
  Motion
DreamVideo: Composing Your Dream Videos with Customized Subject and Motion
Yujie Wei
Shiwei Zhang
Zhiwu Qing
Hangjie Yuan
Zhiheng Liu
Yu Liu
Yingya Zhang
Jingren Zhou
Hongming Shan
DiffMVGen
75
98
0
07 Dec 2023
Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models
Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models
Jiayi Guo
Xingqian Xu
Yifan Pu
Zanlin Ni
Chaofei Wang
Manushree Vasu
Shiji Song
Gao Huang
Humphrey Shi
DiffM
76
32
0
07 Dec 2023
DemoCaricature: Democratising Caricature Generation with a Rough Sketch
DemoCaricature: Democratising Caricature Generation with a Rough Sketch
Dar-Yen Chen
A. Bhunia
Subhadeep Koley
Aneeshan Sain
Pinaki Nath Chowdhury
Yi-Zhe Song
94
8
0
07 Dec 2023
Detecting and Restoring Non-Standard Hands in Stable Diffusion Generated
  Images
Detecting and Restoring Non-Standard Hands in Stable Diffusion Generated Images
Yiqun Zhang
Zhen Qin
Yang Liu
Dylan Campbell
65
2
0
07 Dec 2023
Text-to-3D Generation with Bidirectional Diffusion using both 2D and 3D
  priors
Text-to-3D Generation with Bidirectional Diffusion using both 2D and 3D priors
Lihe Ding
Shaocong Dong
Zhanpeng Huang
Zibin Wang
Yiyuan Zhang
Kaixiong Gong
Dan Xu
Tianfan Xue
66
17
0
07 Dec 2023
Style Transfer to Calvin and Hobbes comics using Stable Diffusion
Style Transfer to Calvin and Hobbes comics using Stable Diffusion
Sloke Shrestha
Sundar Sripada
Asvin Venkataramanan
DiffM
52
1
0
07 Dec 2023
Stable Diffusion for Data Augmentation in COCO and Weed Datasets
Stable Diffusion for Data Augmentation in COCO and Weed Datasets
Boyang Deng
69
2
0
07 Dec 2023
AVID: Any-Length Video Inpainting with Diffusion Model
AVID: Any-Length Video Inpainting with Diffusion Model
Zhixing Zhang
Bichen Wu
Xiaoyan Wang
Yaqiao Luo
Luxin Zhang
Yinan Zhao
Peter Vajda
Dimitris N. Metaxas
Licheng Yu
VGenDiffM
128
42
0
06 Dec 2023
MotionCtrl: A Unified and Flexible Motion Controller for Video
  Generation
MotionCtrl: A Unified and Flexible Motion Controller for Video Generation
Zhouxia Wang
Ziyang Yuan
Xintao Wang
Tianshui Chen
Menghan Xia
Ping Luo
Ying Shan
DiffMVGen
161
230
0
06 Dec 2023
TokenCompose: Text-to-Image Diffusion with Token-level Supervision
TokenCompose: Text-to-Image Diffusion with Token-level Supervision
Zirui Wang
Zhizhou Sha
Zheng Ding
Yilin Wang
Zhuowen Tu
DiffM
105
23
0
06 Dec 2023
DreamComposer: Controllable 3D Object Generation via Multi-View
  Conditions
DreamComposer: Controllable 3D Object Generation via Multi-View Conditions
Yu-nuo Yang
Yukun Huang
Xiaoyang Wu
Yuanchen Guo
Song-Hai Zhang
Hengshuang Zhao
Tong He
Xihui Liu
DiffM
86
12
0
06 Dec 2023
DiffusionSat: A Generative Foundation Model for Satellite Imagery
DiffusionSat: A Generative Foundation Model for Satellite Imagery
Samar Khanna
Patrick Liu
Linqi Zhou
Chenlin Meng
Robin Rombach
Marshall Burke
David B. Lobell
Stefano Ermon
87
65
0
06 Dec 2023
A Task is Worth One Word: Learning with Task Prompts for High-Quality
  Versatile Image Inpainting
A Task is Worth One Word: Learning with Task Prompts for High-Quality Versatile Image Inpainting
Junhao Zhuang
Yanhong Zeng
Wenran Liu
Chun Yuan
Kai Chen
DiffM
141
79
0
06 Dec 2023
Context Diffusion: In-Context Aware Image Generation
Context Diffusion: In-Context Aware Image Generation
Ivona Najdenkoska
Animesh Sinha
Abhimanyu Dubey
Dhruv Mahajan
Vignesh Ramanathan
Filip Radenovic
DiffM
53
14
0
06 Dec 2023
Personalized Face Inpainting with Diffusion Models by Parallel Visual
  Attention
Personalized Face Inpainting with Diffusion Models by Parallel Visual Attention
Jianjin Xu
Saman Motamed
Praneetha Vaddamanu
C. Wu
Christian Haene
Jean-Charles Bazin
Fernando de la Torre
76
14
0
06 Dec 2023
FRDiff : Feature Reuse for Universal Training-free Acceleration of
  Diffusion Models
FRDiff : Feature Reuse for Universal Training-free Acceleration of Diffusion Models
Junhyuk So
Jungwon Lee
Eunhyeok Park
DiffM
85
11
0
06 Dec 2023
AnimatableDreamer: Text-Guided Non-rigid 3D Model Generation and
  Reconstruction with Canonical Score Distillation
AnimatableDreamer: Text-Guided Non-rigid 3D Model Generation and Reconstruction with Canonical Score Distillation
Xinzhou Wang
Yikai Wang
Junliang Ye
Zhengyi Wang
Gang Hua
Pengkun Liu
Ling Wang
Kai Sun
Xintong Wang
Bin He
DiffM
101
19
0
06 Dec 2023
Kandinsky 3.0 Technical Report
Kandinsky 3.0 Technical Report
V.Ya. Arkhipkin
Andrei Filatov
Viacheslav Vasilev
Anastasia Maltseva
Said Azizov
Igor Pavlov
Julia Agafonova
Andrey Kuznetsov
Denis Dimitrov
DiffM
110
13
0
06 Dec 2023
AnimateZero: Video Diffusion Models are Zero-Shot Image Animators
AnimateZero: Video Diffusion Models are Zero-Shot Image Animators
Jiwen Yu
Xiaodong Cun
Chenyang Qi
Yong Zhang
Xintao Wang
Ying Shan
Jian Zhang
VGen
80
14
0
06 Dec 2023
Molecule Joint Auto-Encoding: Trajectory Pretraining with 2D and 3D
  Diffusion
Molecule Joint Auto-Encoding: Trajectory Pretraining with 2D and 3D Diffusion
Weitao Du
Jiujiu Chen
Xuecang Zhang
Zhiming Ma
Shengchao Liu
DiffM
82
9
0
06 Dec 2023
FAAC: Facial Animation Generation with Anchor Frame and Conditional
  Control for Superior Fidelity and Editability
FAAC: Facial Animation Generation with Anchor Frame and Conditional Control for Superior Fidelity and Editability
Linze Li
Sunqi Fan
Hengjun Pu
Z. Bing
Yao Tang
Tianzhu Ye
Tong Yang
Liangyu Chen
Jiajun Liang
VGenDiffM
48
0
0
06 Dec 2023
DiffusionAtlas: High-Fidelity Consistent Diffusion Video Editing
DiffusionAtlas: High-Fidelity Consistent Diffusion Video Editing
Shao-Yu Chang
Hwann-Tzong Chen
Tyng-Luh Liu
DiffMVGen
102
3
0
05 Dec 2023
DreamInpainter: Text-Guided Subject-Driven Image Inpainting with
  Diffusion Models
DreamInpainter: Text-Guided Subject-Driven Image Inpainting with Diffusion Models
Shaoan Xie
Yang Zhao
Zhisheng Xiao
Kelvin C. K. Chan
Yandong Li
Yanwu Xu
Kun Zhang
Tingbo Hou
DiffM
98
28
0
05 Dec 2023
ViscoNet: Bridging and Harmonizing Visual and Textual Conditioning for
  ControlNet
ViscoNet: Bridging and Harmonizing Visual and Textual Conditioning for ControlNet
Soon Yau Cheong
Armin Mustafa
Andrew Gilbert
DiffM
65
5
0
05 Dec 2023
Gaussian3Diff: 3D Gaussian Diffusion for 3D Full Head Synthesis and
  Editing
Gaussian3Diff: 3D Gaussian Diffusion for 3D Full Head Synthesis and Editing
Yushi Lan
Feitong Tan
Di Qiu
Qiangeng Xu
Kyle Genova
...
S. Fanello
Rohit Pandey
Thomas Funkhouser
Chen Change Loy
Yinda Zhang
3DGS
84
18
0
05 Dec 2023
LooseControl: Lifting ControlNet for Generalized Depth Conditioning
LooseControl: Lifting ControlNet for Generalized Depth Conditioning
Shariq Farooq Bhat
Niloy J. Mitra
Peter Wonka
AI4CEDiffM
78
40
0
05 Dec 2023
Alchemist: Parametric Control of Material Properties with Diffusion
  Models
Alchemist: Parametric Control of Material Properties with Diffusion Models
Prafull Sharma
Varun Jampani
Yuanzhen Li
Xuhui Jia
Dmitry Lagun
Frédo Durand
William T. Freeman
Mark J. Matthews
DiffM
130
26
0
05 Dec 2023
DGInStyle: Domain-Generalizable Semantic Segmentation with Image
  Diffusion Models and Stylized Semantic Control
DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control
Yuru Jia
Lukas Hoyer
Shengyu Huang
Tianfu Wang
Luc Van Gool
Konrad Schindler
Anton Obukhov
DiffM
123
24
0
05 Dec 2023
WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera
  Driving Scene Generation
WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation
Jiachen Lu
Ze Huang
Zeyu Yang
Jiahui Zhang
Li Zhang
VGen
94
46
0
05 Dec 2023
LivePhoto: Real Image Animation with Text-guided Motion Control
LivePhoto: Real Image Animation with Text-guided Motion Control
Xi Chen
Zhiheng Liu
Mengting Chen
Yutong Feng
Yu Liu
Yujun Shen
Hengshuang Zhao
VGenDiffM
85
33
0
05 Dec 2023
MagicStick: Controllable Video Editing via Control Handle
  Transformations
MagicStick: Controllable Video Editing via Control Handle Transformations
Yue Ma
Xiaodong Cun
Yin-Yin He
Chenyang Qi
Xintao Wang
Ying Shan
Xiu Li
Qifeng Chen
VGen
120
26
0
05 Dec 2023
Fine-grained Controllable Video Generation via Object Appearance and
  Context
Fine-grained Controllable Video Generation via Object Appearance and Context
Hsin-Ping Huang
Yu-Chuan Su
Deqing Sun
Lu Jiang
Xuhui Jia
Yukun Zhu
Ming-Hsuan Yang
DiffMVGen
73
15
0
05 Dec 2023
Diversified in-domain synthesis with efficient fine-tuning for few-shot
  classification
Diversified in-domain synthesis with efficient fine-tuning for few-shot classification
Victor G. Turrisi da Costa
Nicola Dall’Asen
Yiming Wang
N. Sebe
Elisa Ricci
91
5
0
05 Dec 2023
BIVDiff: A Training-Free Framework for General-Purpose Video Synthesis
  via Bridging Image and Video Diffusion Models
BIVDiff: A Training-Free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models
Fengyuan Shi
Jiaxi Gu
Hang Xu
Songcen Xu
Wei Zhang
Limin Wang
VGenDiffM
70
14
0
05 Dec 2023
Analyzing and Improving the Training Dynamics of Diffusion Models
Analyzing and Improving the Training Dynamics of Diffusion Models
Tero Karras
M. Aittala
J. Lehtinen
Janne Hellsten
Timo Aila
S. Laine
153
203
0
05 Dec 2023
FaceStudio: Put Your Face Everywhere in Seconds
FaceStudio: Put Your Face Everywhere in Seconds
Yuxuan Yan
C. Zhang
Rui Wang
Yichao Zhou
Gege Zhang
Pei Cheng
Gang Yu
Bin-Bin Fu
DiffM
70
41
0
05 Dec 2023
Diffusion Noise Feature: Accurate and Fast Generated Image Detection
Diffusion Noise Feature: Accurate and Fast Generated Image Detection
Yichi Zhang
Xiaogang Xu
DiffM
91
13
0
05 Dec 2023
DreaMo: Articulated 3D Reconstruction From A Single Casual Video
DreaMo: Articulated 3D Reconstruction From A Single Casual Video
Tao Tu
Ming-feng Li
C. Lin
Yen-Chi Cheng
Min Sun
Ming-Hsuan Yang
84
5
0
05 Dec 2023
Prompt2NeRF-PIL: Fast NeRF Generation via Pretrained Implicit Latent
Prompt2NeRF-PIL: Fast NeRF Generation via Pretrained Implicit Latent
Jianmeng Liu
Yuyao Zhang
Zeyuan Meng
Yu-Wing Tai
Chi-Keung Tang
VLMDiffMAI4CE
64
1
0
05 Dec 2023
Training on Synthetic Data Beats Real Data in Multimodal Relation
  Extraction
Training on Synthetic Data Beats Real Data in Multimodal Relation Extraction
Zilin Du
Haoxin Li
Xu Guo
Boyang Li
93
1
0
05 Dec 2023
Retrieving Conditions from Reference Images for Diffusion Models
Retrieving Conditions from Reference Images for Diffusion Models
Haoran Tang
Xin Zhou
Jieren Deng
Zhihong Pan
Hao Tian
Pratik Chaudhari
77
2
0
05 Dec 2023
Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language
  Models with Creative Humor Generation
Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation
Shan Zhong
Zhongzhan Huang
Shanghua Gao
Wushao Wen
Liang Lin
Marinka Zitnik
Pan Zhou
LLMAGLRM
105
40
0
05 Dec 2023
GPT4Point: A Unified Framework for Point-Language Understanding and Generation
GPT4Point: A Unified Framework for Point-Language Understanding and Generation
Zhangyang Qi
Ye Fang
Zeyi Sun
Xiaoyang Wu
Tong Wu
Jiaqi Wang
Dahua Lin
Hengshuang Zhao
MLLM
184
42
0
05 Dec 2023
PatchFusion: An End-to-End Tile-Based Framework for High-Resolution
  Monocular Metric Depth Estimation
PatchFusion: An End-to-End Tile-Based Framework for High-Resolution Monocular Metric Depth Estimation
Zhenyu Li
Shariq Farooq Bhat
Peter Wonka
MDE
87
25
0
04 Dec 2023
Readout Guidance: Learning Control from Diffusion Features
Readout Guidance: Learning Control from Diffusion Features
Grace Luo
Trevor Darrell
Oliver Wang
Dan B. Goldman
Aleksander Holynski
101
27
0
04 Dec 2023
Generative Powers of Ten
Generative Powers of Ten
Xiaojuan Wang
Janne Kontkanen
Brian L. Curless
Steven M. Seitz
Ira Kemelmacher-Shlizerman
B. Mildenhall
Pratul P. Srinivasan
Dor Verbin
Aleksander Holynski
77
10
0
04 Dec 2023
Repurposing Diffusion-Based Image Generators for Monocular Depth
  Estimation
Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
Bingxin Ke
Anton Obukhov
Shengyu Huang
Nando Metzger
Rodrigo Caye Daudt
Konrad Schindler
VLMMDE
142
173
0
04 Dec 2023
Style Aligned Image Generation via Shared Attention
Style Aligned Image Generation via Shared Attention
Amir Hertz
Andrey Voynov
Shlomi Fruchter
Daniel Cohen-Or
DiffM
68
135
0
04 Dec 2023
ArtAdapter: Text-to-Image Style Transfer using Multi-Level Style Encoder
  and Explicit Adaptation
ArtAdapter: Text-to-Image Style Transfer using Multi-Level Style Encoder and Explicit Adaptation
Dar-Yen Chen
Hamish Tennent
Ching-Wen Hsu
DiffM
114
27
0
04 Dec 2023
Previous
123...474849...606162
Next