ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.05543
  4. Cited By
Adding Conditional Control to Text-to-Image Diffusion Models
v1v2v3 (latest)

Adding Conditional Control to Text-to-Image Diffusion Models

10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "Adding Conditional Control to Text-to-Image Diffusion Models"

50 / 3,090 papers shown
Title
All in One Framework for Multimodal Re-identification in the Wild
All in One Framework for Multimodal Re-identification in the Wild
He Li
Mang Ye
Ming Zhang
Bo Du
83
11
0
08 May 2024
TexControl: Sketch-Based Two-Stage Fashion Image Generation Using
  Diffusion Model
TexControl: Sketch-Based Two-Stage Fashion Image Generation Using Diffusion Model
Yongming Zhang
Tianyu Zhang
Haoran Xie
DiffM
59
0
0
07 May 2024
Edit-Your-Motion: Space-Time Diffusion Decoupling Learning for Video
  Motion Editing
Edit-Your-Motion: Space-Time Diffusion Decoupling Learning for Video Motion Editing
Yi Zuo
Lingling Li
Licheng Jiao
Fang Liu
Xu Liu
Wenping Ma
Shuyuan Yang
Yuwei Guo
VGenDiffM
90
1
0
07 May 2024
Diffusion-driven GAN Inversion for Multi-Modal Face Image Generation
Diffusion-driven GAN Inversion for Multi-Modal Face Image Generation
Jihyun Kim
Changjae Oh
Hoseok Do
Soohyun Kim
Kwanghoon Sohn
DiffM
91
12
0
07 May 2024
Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion
  Transformer
Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer
Zhuoyi Yang
Heyang Jiang
Wenyi Hong
Jiayan Teng
Wendi Zheng
Yuxiao Dong
Ming Ding
Jie Tang
SupR
65
6
0
07 May 2024
Vidu: a Highly Consistent, Dynamic and Skilled Text-to-Video Generator
  with Diffusion Models
Vidu: a Highly Consistent, Dynamic and Skilled Text-to-Video Generator with Diffusion Models
Fan Bao
Chendong Xiang
Gang Yue
Guande He
Hongzhou Zhu
Kaiwen Zheng
Min Zhao
Shilong Liu
Yaole Wang
Jun Zhu
VGen
193
73
0
07 May 2024
An Empty Room is All We Want: Automatic Defurnishing of Indoor Panoramas
An Empty Room is All We Want: Automatic Defurnishing of Indoor Panoramas
Mira Slavcheva
Dave Gausebeck
Kevin Chen
David Buchhofer
Azwad Sabik
Chen Ma
Sachal Dhillon
Olaf Brandt
Alan Dolhasz
66
6
0
06 May 2024
Is Sora a World Simulator? A Comprehensive Survey on General World
  Models and Beyond
Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond
Zheng Zhu
Xiaofeng Wang
Wangbo Zhao
Chen Min
Nianchen Deng
...
Dawei Zhao
Liang Xiao
Jian-jun Zhao
Jiwen Lu
Guan Huang
VGenLM&Ro
179
48
0
06 May 2024
Gaussian Splatting: 3D Reconstruction and Novel View Synthesis, a Review
Gaussian Splatting: 3D Reconstruction and Novel View Synthesis, a Review
Anurag Dalal
Daniel Hagen
K. Robbersmyr
Kristian Muri Knausgård
GP3DV3DGS
92
26
0
06 May 2024
Enhancing Spatiotemporal Disease Progression Models via Latent Diffusion
  and Prior Knowledge
Enhancing Spatiotemporal Disease Progression Models via Latent Diffusion and Prior Knowledge
Lemuel Puglisi
Daniel C. Alexander
Daniele Ravi
MedIm
61
12
0
06 May 2024
Video Diffusion Models: A Survey
Video Diffusion Models: A Survey
Andrew Melnik
Michal Ljubljanac
Cong Lu
Qi Yan
Weiming Ren
Helge J. Ritter
VGen
147
16
0
06 May 2024
DiffMap: Enhancing Map Segmentation with Map Prior Using Diffusion Model
DiffMap: Enhancing Map Segmentation with Map Prior Using Diffusion Model
Peijing Jia
Tuopu Wen
Ziang Luo
Mengmeng Yang
Kun Jiang
...
Ziyuan Liu
Le Cui
Kehua Sheng
Bo Zhang
Diange Yang
96
4
0
03 May 2024
AI-generated art perceptions with GenFrame -- an image-generating
  picture frame
AI-generated art perceptions with GenFrame -- an image-generating picture frame
Peter Kun
Matthias Anton Freiberger
A. Løvlie
Sebastian Risi
66
2
0
03 May 2024
Customizing Text-to-Image Models with a Single Image Pair
Customizing Text-to-Image Models with a Single Image Pair
Maxwell Jones
Sheng-Yu Wang
Nupur Kumari
David Bau
Jun-Yan Zhu
DiffM
103
21
0
02 May 2024
StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video
  Generation
StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation
Yupeng Zhou
Daquan Zhou
Ming-Ming Cheng
Jiashi Feng
Qibin Hou
DiffMVGen
124
101
0
02 May 2024
Improving Subject-Driven Image Synthesis with Subject-Agnostic Guidance
Improving Subject-Driven Image Synthesis with Subject-Agnostic Guidance
Kelvin C. K. Chan
Yang Zhao
Xuhui Jia
Ming-Hsuan Yang
Huisheng Wang
114
3
0
02 May 2024
DiffusionPipe: Training Large Diffusion Models with Efficient Pipelines
DiffusionPipe: Training Large Diffusion Models with Efficient Pipelines
Ye Tian
Zhen Jia
Ziyue Luo
Yida Wang
Chuan Wu
AI4CE
48
4
0
02 May 2024
X-Oscar: A Progressive Framework for High-quality Text-guided 3D
  Animatable Avatar Generation
X-Oscar: A Progressive Framework for High-quality Text-guided 3D Animatable Avatar Generation
Yiwei Ma
Zhekai Lin
Jiayi Ji
Yijun Fan
Xiaoshuai Sun
Rongrong Ji
106
7
0
02 May 2024
Guided Conditional Diffusion Classifier (ConDiff) for Enhanced
  Prediction of Infection in Diabetic Foot Ulcers
Guided Conditional Diffusion Classifier (ConDiff) for Enhanced Prediction of Infection in Diabetic Foot Ulcers
Palawat Busaranuvong
Emmanuel O. Agu
Deepak Kumar
Shefalika Gautam
Reza Saadati Fard
B. Tulu
Diane Strong
MedIm
67
0
0
01 May 2024
GraCo: Granularity-Controllable Interactive Segmentation
GraCo: Granularity-Controllable Interactive Segmentation
Yian Zhao
Kehan Li
Ze-Long Cheng
Pengchong Qiao
Xiawu Zheng
Rongrong Ji
Chang Liu
Li-ming Yuan
Jie Chen
116
9
0
01 May 2024
Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models
Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models
Xiaoshi Wu
Yiming Hao
Manyuan Zhang
Keqiang Sun
Zhaoyang Huang
Guanglu Song
Yu Liu
Hongsheng Li
EGVM
127
25
0
01 May 2024
UWAFA-GAN: Ultra-Wide-Angle Fluorescein Angiography Transformation via
  Multi-scale Generation and Registration Enhancement
UWAFA-GAN: Ultra-Wide-Angle Fluorescein Angiography Transformation via Multi-scale Generation and Registration Enhancement
Ruiquan Ge
Zhaojie Fang
Pengxue Wei
Zhanghao Chen
Hongyang Jiang
Ahmed Elazab
Wangting Li
Xiang Wan
Shaochong Zhang
Changmiao Wang
MedIm
45
5
0
01 May 2024
Lazy Layers to Make Fine-Tuned Diffusion Models More Traceable
Lazy Layers to Make Fine-Tuned Diffusion Models More Traceable
Haozhe Liu
Wentian Zhang
Bing Li
Bernard Ghanem
Jürgen Schmidhuber
DiffMWIGMAAML
85
1
0
01 May 2024
ASAM: Boosting Segment Anything Model with Adversarial Tuning
ASAM: Boosting Segment Anything Model with Adversarial Tuning
Bo Li
Haoke Xiao
Lv Tang
107
11
0
01 May 2024
Semantically Consistent Video Inpainting with Conditional Diffusion
  Models
Semantically Consistent Video Inpainting with Conditional Diffusion Models
Dylan Green
William Harvey
Saeid Naderiparizi
Matthew Niedoba
Yunpeng Liu
...
Vasileios Lioutas
Setareh Dabiri
Adam Scibior
Berend Zwartsenberg
Frank Wood
DiffM
113
1
0
30 Apr 2024
PACER+: On-Demand Pedestrian Animation Controller in Driving Scenarios
PACER+: On-Demand Pedestrian Animation Controller in Driving Scenarios
Jingbo Wang
Zhengyi Luo
Ye Yuan
Yixuan Li
Bo Dai
102
15
0
30 Apr 2024
VimTS: A Unified Video and Image Text Spotter for Enhancing the
  Cross-domain Generalization
VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-domain Generalization
Yuliang Liu
Mingxin Huang
Hao Yan
Linger Deng
Weijia Wu
Hao Lu
Chunhua Shen
Lianwen Jin
Xiang Bai
86
0
0
30 Apr 2024
NeRF-Insert: 3D Local Editing with Multimodal Control Signals
NeRF-Insert: 3D Local Editing with Multimodal Control Signals
Benet Oriol Sabat
Alessandro Achille
Matthew Trager
Stefano Soatto
69
2
0
30 Apr 2024
DGE: Direct Gaussian 3D Editing by Consistent Multi-view Editing
DGE: Direct Gaussian 3D Editing by Consistent Multi-view Editing
Minghao Chen
Iro Laina
Andrea Vedaldi
3DGS
98
29
0
29 Apr 2024
A Survey on Diffusion Models for Time Series and Spatio-Temporal Data
A Survey on Diffusion Models for Time Series and Spatio-Temporal Data
Yiyuan Yang
Ming Jin
Haomin Wen
Chaoli Zhang
Yuxuan Liang
...
Bin Yang
Zenglin Xu
Jiang Bian
Shirui Pan
Qingsong Wen
DiffMAI4TSSyDa
130
45
0
29 Apr 2024
Towards Extreme Image Compression with Latent Feature Guidance and
  Diffusion Prior
Towards Extreme Image Compression with Latent Feature Guidance and Diffusion Prior
Zhiyuan Li
Yanhui Zhou
Hao Wei
Chenyang Ge
Jingwen Jiang
DiffM
89
15
0
29 Apr 2024
PKU-AIGIQA-4K: A Perceptual Quality Assessment Database for Both
  Text-to-Image and Image-to-Image AI-Generated Images
PKU-AIGIQA-4K: A Perceptual Quality Assessment Database for Both Text-to-Image and Image-to-Image AI-Generated Images
Jiquan Yuan
Fanyi Yang
Jihe Li
Xinyan Cao
Jinming Che
Jinlong Lin
Xixin Cao
EGVM
79
2
0
29 Apr 2024
TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation
TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation
Junhao Cheng
Baiqiao Yin
Kaixin Cai
Minbin Huang
Hanhui Li
...
Yue Li
Yifei Li
Yuhao Cheng
Yiqiang Yan
Xiaodan Liang
DiffMMLLM
138
13
0
29 Apr 2024
Anywhere: A Multi-Agent Framework for User-Guided, Reliable, and Diverse Foreground-Conditioned Image Generation
Anywhere: A Multi-Agent Framework for User-Guided, Reliable, and Diverse Foreground-Conditioned Image Generation
Tianyidan Xie
Rui Ma
Qian Wang
Xiaoqian Ye
Feixuan Liu
Ying Tai
Zhenyu Zhang
Lanjun Wang
Zili Yi
DiffMMLLM
106
2
0
29 Apr 2024
DM-Align: Leveraging the Power of Natural Language Instructions to Make
  Changes to Images
DM-Align: Leveraging the Power of Natural Language Instructions to Make Changes to Images
Maria Mihaela Truşcǎ
Tinne Tuytelaars
Marie-Francine Moens
DiffM
79
1
0
27 Apr 2024
Diffusion-Aided Joint Source Channel Coding For High Realism Wireless Image Transmission
Diffusion-Aided Joint Source Channel Coding For High Realism Wireless Image Transmission
Mingyu Yang
Bowen Liu
Boyang Wang
Hun-Seok Kim
DiffM
111
6
0
27 Apr 2024
ObjectAdd: Adding Objects into Image via a Training-Free Diffusion Modification Fashion
ObjectAdd: Adding Objects into Image via a Training-Free Diffusion Modification Fashion
Ziyue Zhang
Mingbao Lin
Rongrong Ji
Rongrong Ji
DiffM
144
3
0
26 Apr 2024
TELA: Text to Layer-wise 3D Clothed Human Generation
TELA: Text to Layer-wise 3D Clothed Human Generation
Junting Dong
Qi Fang
Zehuan Huang
Xudong Xu
Jingbo Wang
Sida Peng
Bo Dai
3DH
55
10
0
25 Apr 2024
Multimodal Semantic-Aware Automatic Colorization with Diffusion Prior
Multimodal Semantic-Aware Automatic Colorization with Diffusion Prior
Han Wang
Xinning Chai
Yiwen Wang
Yuhong Zhang
Rong Xie
Li Song
DiffM
68
2
0
25 Apr 2024
AudioScenic: Audio-Driven Video Scene Editing
AudioScenic: Audio-Driven Video Scene Editing
Kaixin Shen
Ruijie Quan
Linchao Zhu
Jun Xiao
Yi Yang
VGenDiffM
69
1
0
25 Apr 2024
OpenDlign: Enhancing Open-World 3D Learning with Depth-Aligned Images
OpenDlign: Enhancing Open-World 3D Learning with Depth-Aligned Images
Ye Mao
Junpeng Jing
K. Mikolajczyk
VLM
52
0
0
25 Apr 2024
Interactive3D: Create What You Want by Interactive 3D Generation
Interactive3D: Create What You Want by Interactive 3D Generation
Shaocong Dong
Lihe Ding
Zhanpeng Huang
Zibin Wang
Tianfan Xue
Dan Xu
58
10
0
25 Apr 2024
CoCoG: Controllable Visual Stimuli Generation based on Human Concept
  Representations
CoCoG: Controllable Visual Stimuli Generation based on Human Concept Representations
Chen Wei
Jiachen Zou
Dietmar Heinke
Quanying Liu
87
3
0
25 Apr 2024
SynCellFactory: Generative Data Augmentation for Cell Tracking
SynCellFactory: Generative Data Augmentation for Cell Tracking
Moritz Sturm
Lorenzo Cerrone
Fred A. Hamprecht
86
4
0
25 Apr 2024
Fake Artificial Intelligence Generated Contents (FAIGC): A Survey of
  Theories, Detection Methods, and Opportunities
Fake Artificial Intelligence Generated Contents (FAIGC): A Survey of Theories, Detection Methods, and Opportunities
Xiaomin Yu
Yezhaohui Wang
Yanfang Chen
Zhen Tao
Dinghao Xi
Shichao Song
Pengnian Qi
Zhiyu Li
135
10
0
25 Apr 2024
An Analysis of Recent Advances in Deepfake Image Detection in an
  Evolving Threat Landscape
An Analysis of Recent Advances in Deepfake Image Detection in an Evolving Threat Landscape
Sifat Muhammad Abdullah
Aravind Cheruvu
Shravya Kanchi
Taejoong Chung
Peng Gao
Murtuza Jadliwala
Bimal Viswanath
AAML
96
18
0
24 Apr 2024
Editable Image Elements for Controllable Synthesis
Editable Image Elements for Controllable Synthesis
Jiteng Mu
Michael Gharbi
Richard Zhang
Eli Shechtman
Nuno Vasconcelos
Xiaolong Wang
Taesung Park
DiffM
92
9
0
24 Apr 2024
PuLID: Pure and Lightning ID Customization via Contrastive Alignment
PuLID: Pure and Lightning ID Customization via Contrastive Alignment
Zinan Guo
Yanze Wu
Zhuowei Chen
Lang Chen
Qian He
DiffM
105
66
0
24 Apr 2024
Learning Long-form Video Prior via Generative Pre-Training
Learning Long-form Video Prior via Generative Pre-Training
Jinheng Xie
Jiajun Feng
Zhaoxu Tian
Kevin Qinghong Lin
Yawen Huang
...
Nanxu Gong
Xu Zuo
Jiaqi Yang
Yefeng Zheng
Mike Zheng Shou
69
6
0
24 Apr 2024
Sketch2Human: Deep Human Generation with Disentangled Geometry and
  Appearance Control
Sketch2Human: Deep Human Generation with Disentangled Geometry and Appearance Control
Linzi Qu
Jiaxiang Shang
Hui Ye
Xiaoguang Han
Hongbo Fu
3DV
83
0
0
24 Apr 2024
Previous
123...333435...606162
Next