ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.05543
  4. Cited By
Adding Conditional Control to Text-to-Image Diffusion Models
v1v2v3 (latest)

Adding Conditional Control to Text-to-Image Diffusion Models

10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "Adding Conditional Control to Text-to-Image Diffusion Models"

50 / 3,090 papers shown
Title
S-DyRF: Reference-Based Stylized Radiance Fields for Dynamic Scenes
S-DyRF: Reference-Based Stylized Radiance Fields for Dynamic Scenes
Xingyi Li
Zhiguo Cao
Yizheng Wu
Kewei Wang
Ke Xian
Zhe Wang
Guo-Shing Lin
89
3
0
10 Mar 2024
Text-to-Audio Generation Synchronized with Videos
Text-to-Audio Generation Synchronized with Videos
Shentong Mo
Jing Shi
Yapeng Tian
DiffMVGen
92
18
0
08 Mar 2024
Audio-Synchronized Visual Animation
Audio-Synchronized Visual Animation
Lin Zhang
Shentong Mo
Yijing Zhang
Pedro Morgado
DiffM
101
20
0
08 Mar 2024
VideoElevator: Elevating Video Generation Quality with Versatile
  Text-to-Image Diffusion Models
VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models
Yabo Zhang
Yuxiang Wei
Xianhui Lin
Zheng Hui
Peiran Ren
Xuansong Xie
Xiangyang Ji
Wangmeng Zuo
VGen
87
7
0
08 Mar 2024
Towards Effective Usage of Human-Centric Priors in Diffusion Models for
  Text-based Human Image Generation
Towards Effective Usage of Human-Centric Priors in Diffusion Models for Text-based Human Image Generation
Junyan Wang
Zhenhong Sun
Zhiyu Tan
Xuanbai Chen
Weihua Chen
Hao Li
Cheng Zhang
Yang Song
96
12
0
08 Mar 2024
Tracking Meets LoRA: Faster Training, Larger Model, Stronger Performance
Tracking Meets LoRA: Faster Training, Larger Model, Stronger Performance
Liting Lin
Heng Fan
Zhipeng Zhang
Yaowei Wang
Yong-mei Xu
Haibin Ling
129
36
0
08 Mar 2024
Improving Diffusion Models for Virtual Try-on
Improving Diffusion Models for Virtual Try-on
Yisol Choi
Sangkyung Kwak
Kyungmin Lee
Hyungwon Choi
Jinwoo Shin
DiffM
104
29
0
08 Mar 2024
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
Xiwei Hu
Rui Wang
Yixiao Fang
Bin-Bin Fu
Pei Cheng
Gang Yu
VLM
124
103
0
08 Mar 2024
Stealing Stable Diffusion Prior for Robust Monocular Depth Estimation
Stealing Stable Diffusion Prior for Robust Monocular Depth Estimation
Yifan Mao
Jian Liu
Xianming Liu
DiffMMDE
74
3
0
08 Mar 2024
XPSR: Cross-modal Priors for Diffusion-based Image Super-Resolution
XPSR: Cross-modal Priors for Diffusion-based Image Super-Resolution
Yunpeng Qu
Kun Yuan
Kai Zhao
Qizhi Xie
Jinhua Hao
Ming Sun
Chao Zhou
88
19
0
08 Mar 2024
InstructGIE: Towards Generalizable Image Editing
InstructGIE: Towards Generalizable Image Editing
Zichong Meng
Changdi Yang
Jun Liu
Hao Tang
Pu Zhao
Yanzhi Wang
DiffM
99
9
0
08 Mar 2024
StereoDiffusion: Training-Free Stereo Image Generation Using Latent
  Diffusion Models
StereoDiffusion: Training-Free Stereo Image Generation Using Latent Diffusion Models
Lezhong Wang
J. Frisvad
Mark Bo Jensen
Siavash Bigdeli
DiffM
74
12
0
08 Mar 2024
Pix2Gif: Motion-Guided Diffusion for GIF Generation
Pix2Gif: Motion-Guided Diffusion for GIF Generation
Hitesh Kandala
Jianfeng Gao
Jianwei Yang
VGenDiffM
88
3
0
07 Mar 2024
Controllable Generation with Text-to-Image Diffusion Models: A Survey
Controllable Generation with Text-to-Image Diffusion Models: A Survey
Pu Cao
Feng Zhou
Qing-Huang Song
Lu Yang
134
38
0
07 Mar 2024
NoiseCollage: A Layout-Aware Text-to-Image Diffusion Model Based on
  Noise Cropping and Merging
NoiseCollage: A Layout-Aware Text-to-Image Diffusion Model Based on Noise Cropping and Merging
Takahiro Shirakawa
Seiichi Uchida
DiffM
62
19
0
06 Mar 2024
Towards Understanding Cross and Self-Attention in Stable Diffusion for
  Text-Guided Image Editing
Towards Understanding Cross and Self-Attention in Stable Diffusion for Text-Guided Image Editing
Bingyan Liu
Chengyu Wang
Tingfeng Cao
Kui Jia
Jun Huang
DiffM
82
63
0
06 Mar 2024
Neural Image Compression with Text-guided Encoding for both Pixel-level
  and Perceptual Fidelity
Neural Image Compression with Text-guided Encoding for both Pixel-level and Perceptual Fidelity
Hagyeong Lee
Minkyu Kim
Jun-Hyuk Kim
Seungeon Kim
Dokwan Oh
Jaeho Lee
DiffM
90
6
0
05 Mar 2024
Cross-Domain Image Conversion by CycleDM
Cross-Domain Image Conversion by CycleDM
Sho Shimotsumagari
Shumpei Takezaki
Daichi Haraguchi
Seiichi Uchida
81
0
0
05 Mar 2024
Tuning-Free Noise Rectification for High Fidelity Image-to-Video
  Generation
Tuning-Free Noise Rectification for High Fidelity Image-to-Video Generation
Weijie Li
Litong Gong
Yiran Zhu
Fanda Fan
Biao Wang
Tiezheng Ge
Bo Zheng
VGenDiffM
60
3
0
05 Mar 2024
RT-Sketch: Goal-Conditioned Imitation Learning from Hand-Drawn Sketches
RT-Sketch: Goal-Conditioned Imitation Learning from Hand-Drawn Sketches
Priya Sundaresan
Q. Vuong
Jiayuan Gu
Peng Xu
Ted Xiao
...
Ajinkya Jain
Karol Hausman
Dorsa Sadigh
Jeannette Bohg
S. Schaal
VGen
99
26
0
05 Mar 2024
Semantic Human Mesh Reconstruction with Textures
Semantic Human Mesh Reconstruction with Textures
Xiaoyu Zhan
Jianxin Yang
Yuanqi Li
Jie Guo
Yanwen Guo
Wenping Wang
3DH
83
2
0
05 Mar 2024
UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video
  Diffusion Models via Training-Free Unified Attention Control
UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified Attention Control
Xuweiyi Chen
Tian Xia
Sihan Xu
VGenDiffM
105
8
0
04 Mar 2024
DiffMOT: A Real-time Diffusion-based Multiple Object Tracker with
  Non-linear Prediction
DiffMOT: A Real-time Diffusion-based Multiple Object Tracker with Non-linear Prediction
Weiyi Lv
Yuhang Huang
Ning Zhang
Ruei-Sung Lin
Mei Han
Dan Zeng
DiffM
138
20
0
04 Mar 2024
PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis
PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis
Zheng Lv
Yuxiang Wei
Wangmeng Zuo
Kwan-Yee K. Wong
89
17
0
04 Mar 2024
ViewDiff: 3D-Consistent Image Generation with Text-to-Image Models
ViewDiff: 3D-Consistent Image Generation with Text-to-Image Models
Lukas Höllein
Aljavz Bovzivc
Norman Muller
David Novotny
Hung-Yu Tseng
Christian Richardt
Michael Zollhöfer
Matthias Nießner
DiffM
88
45
0
04 Mar 2024
AtomoVideo: High Fidelity Image-to-Video Generation
AtomoVideo: High Fidelity Image-to-Video Generation
Litong Gong
Yiran Zhu
Weijie Li
Xiaoyang Kang
Biao Wang
Tiezheng Ge
Bo Zheng
DiffMVGen
196
12
0
04 Mar 2024
OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable
  Virtual Try-on
OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
Yuhao Xu
Tao Gu
Weifeng Chen
Chengcai Chen
DiffM
97
66
0
04 Mar 2024
HanDiffuser: Text-to-Image Generation With Realistic Hand Appearances
HanDiffuser: Text-to-Image Generation With Realistic Hand Appearances
Supreeth Narasimhaswamy
Uttaran Bhattacharya
Xiang Chen
Ishita Dasgupta
Saayan Mitra
Minh Hoai
DiffM
75
25
0
04 Mar 2024
ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models
ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models
Jiaxiang Cheng
Pan Xie
Xin Xia
Jiashi Li
Jie Wu
Yuxi Ren
Huixia Li
Xuefeng Xiao
Min Zheng
Lean Fu
114
12
0
04 Mar 2024
APISR: Anime Production Inspired Real-World Anime Super-Resolution
APISR: Anime Production Inspired Real-World Anime Super-Resolution
Boyang Wang
Fengyu Yang
Xihang Yu
Chao Zhang
Han Zhao
SupR
91
13
0
03 Mar 2024
Benchmarking Segmentation Models with Mask-Preserved Attribute Editing
Benchmarking Segmentation Models with Mask-Preserved Attribute Editing
Zijin Yin
Kongming Liang
Bing Li
Zhanyu Ma
Jun Guo
VLM
134
2
0
02 Mar 2024
Face Swap via Diffusion Model
Face Swap via Diffusion Model
Feifei Wang
DiffM
59
1
0
02 Mar 2024
Rethinking Inductive Biases for Surface Normal Estimation
Rethinking Inductive Biases for Surface Normal Estimation
Gwangbin Bae
Andrew J. Davison
125
52
0
01 Mar 2024
Diff-Plugin: Revitalizing Details for Diffusion-based Low-level Tasks
Diff-Plugin: Revitalizing Details for Diffusion-based Low-level Tasks
Yuhao Liu
Zhanghan Ke
Fang Liu
Nanxuan Zhao
Rynson W. H. Lau
DiffM
114
23
0
01 Mar 2024
When ControlNet Meets Inexplicit Masks: A Case Study of ControlNet on
  its Contour-following Ability
When ControlNet Meets Inexplicit Masks: A Case Study of ControlNet on its Contour-following Ability
Wenjie Xuan
Yufei Xu
Shanshan Zhao
Chaoyue Wang
Juhua Liu
Bo Du
Dacheng Tao
86
3
0
01 Mar 2024
LoMOE: Localized Multi-Object Editing via Multi-Diffusion
LoMOE: Localized Multi-Object Editing via Multi-Diffusion
Goirik Chakrabarty
Aditya Chandrasekar
Ramya Hebbalaguppe
AP Prathosh
DiffM
80
6
0
01 Mar 2024
A Novel Approach to Industrial Defect Generation through Blended Latent
  Diffusion Model with Online Adaptation
A Novel Approach to Industrial Defect Generation through Blended Latent Diffusion Model with Online Adaptation
Hanxi Li
Zhengxun Zhang
Hao Chen
Lin Wu
Bo Li
Deyin Liu
Mingwen Wang
81
3
0
29 Feb 2024
Mirage: Cross-Embodiment Zero-Shot Policy Transfer with Cross-Painting
Mirage: Cross-Embodiment Zero-Shot Policy Transfer with Cross-Painting
Lawrence Yunliang Chen
Kush Hari
K. Dharmarajan
Chenfeng Xu
Quan Vuong
Ken Goldberg
148
23
0
29 Feb 2024
OHTA: One-shot Hand Avatar via Data-driven Implicit Priors
OHTA: One-shot Hand Avatar via Data-driven Implicit Priors
Xiaozheng Zheng
Chao Wen
Zhuo Su
Zeran Xu
Zhaohu Li
Yang Zhao
Zhou Xue
77
7
0
29 Feb 2024
ViewFusion: Towards Multi-View Consistency via Interpolated Denoising
ViewFusion: Towards Multi-View Consistency via Interpolated Denoising
Xianghui Yang
Yan Zuo
Sameera Ramasinghe
Loris Bazzani
Gil Avraham
Anton Van Den Hengel
DiffM
80
7
0
29 Feb 2024
From Summary to Action: Enhancing Large Language Models for Complex
  Tasks with Open World APIs
From Summary to Action: Enhancing Large Language Models for Complex Tasks with Open World APIs
Yulong Liu
Yunlong Yuan
Chunwei Wang
Jianhua Han
Yongqiang Ma
Li Zhang
Nanning Zheng
Hang Xu
LLMAG
58
5
0
28 Feb 2024
Context-aware Talking Face Video Generation
Context-aware Talking Face Video Generation
Meidai Xuanyuan
Yuwang Wang
Honglei Guo
Qionghai Dai
DiffM
75
0
0
28 Feb 2024
Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis
Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis
Yanzuo Lu
Manlin Zhang
Andy J. Ma
Xiaohua Xie
Jian-Huang Lai
DiffM
68
24
0
28 Feb 2024
SynArtifact: Classifying and Alleviating Artifacts in Synthetic Images
  via Vision-Language Model
SynArtifact: Classifying and Alleviating Artifacts in Synthetic Images via Vision-Language Model
Bin Cao
Jianhao Yuan
Yexin Liu
Jian Li
Shuyang Sun
Jing Liu
Bo Zhao
DiffM
108
9
0
28 Feb 2024
On the Challenges and Opportunities in Generative AI
On the Challenges and Opportunities in Generative AI
Laura Manduchi
Kushagra Pandey
Robert Bamler
Ryan Cotterell
Sina Daubener
...
F. Wenzel
Frank Wood
Stephan Mandt
Vincent Fortuin
Vincent Fortuin
290
22
0
28 Feb 2024
Box It to Bind It: Unified Layout Control and Attribute Binding in T2I
  Diffusion Models
Box It to Bind It: Unified Layout Control and Attribute Binding in T2I Diffusion Models
Ashkan Taghipour
Morteza Ghahremani
Bennamoun
Aref Miri Rekavandi
Hamid Laga
F. Boussaïd
DiffM
77
6
0
27 Feb 2024
Diffusion Meets DAgger: Supercharging Eye-in-hand Imitation Learning
Diffusion Meets DAgger: Supercharging Eye-in-hand Imitation Learning
Xiaoyu Zhang
Matthew Chang
Pranav Kumar
Saurabh Gupta
DiffMOffRL
117
15
0
27 Feb 2024
CustomSketching: Sketch Concept Extraction for Sketch-based Image
  Synthesis and Editing
CustomSketching: Sketch Concept Extraction for Sketch-based Image Synthesis and Editing
Chufeng Xiao
Hongbo Fu
DiffM
83
3
0
27 Feb 2024
DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized
  Diffusion Models
DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized Diffusion Models
Shyam Marjit
Harshit Singh
Nityanand Mathur
Sayak Paul
Chia-Mu Yu
Pin-Yu Chen
DiffM
77
7
0
27 Feb 2024
SDDGR: Stable Diffusion-based Deep Generative Replay for Class
  Incremental Object Detection
SDDGR: Stable Diffusion-based Deep Generative Replay for Class Incremental Object Detection
Junsu Kim
Hoseong Cho
Jihyeon Kim
Yihalem Yimolal Tiruneh
Seungryul Baek
DiffM
137
21
0
27 Feb 2024
Previous
123...404142...606162
Next