ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.05543
  4. Cited By
Adding Conditional Control to Text-to-Image Diffusion Models
v1v2v3 (latest)

Adding Conditional Control to Text-to-Image Diffusion Models

10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "Adding Conditional Control to Text-to-Image Diffusion Models"

50 / 3,090 papers shown
Title
From Poses to Identity: Training-Free Person Re-Identification via Feature Centralization
Chao Yuan
Guiwei Zhang
Changxiao Ma
Tianyi Zhang
Guanglin Niu
OOD
85
3
0
02 Mar 2025
Learning to Animate Images from A Few Videos to Portray Delicate Human Actions
Haoxin Li
Yingchen Yu
Qilong Wu
Hanwang Zhang
Boyang Li
Song Bai
3DHVGen
499
0
0
01 Mar 2025
Flow Matching for Medical Image Synthesis: Bridging the Gap Between Speed and Quality
M. Yazdani
Yasamin Medghalchi
Pooria Ashrafian
Ilker Hacihaliloglu
Dena Shahriari
MedIm
67
0
0
01 Mar 2025
T2ICount: Enhancing Cross-modal Understanding for Zero-Shot Counting
T2ICount: Enhancing Cross-modal Understanding for Zero-Shot Counting
Yifei Qian
Zhongliang Guo
Bowen Deng
Chun Tong Lei
Shuai Zhao
Chun Pong Lau
Xiaopeng Hong
Michael P. Pound
DiffM
190
1
0
28 Feb 2025
Diffusion Restoration Adapter for Real-World Image Restoration
Diffusion Restoration Adapter for Real-World Image Restoration
Hanbang Liang
Zhen Wang
Weihui Deng
DiffM
79
1
0
28 Feb 2025
Training-free and Adaptive Sparse Attention for Efficient Long Video Generation
Training-free and Adaptive Sparse Attention for Efficient Long Video Generation
Yifei Xia
Suhan Ling
Fangcheng Fu
Yijiao Wang
Huixia Li
Xuefeng Xiao
Tengjiao Wang
VGen
149
11
0
28 Feb 2025
Attention Distillation: A Unified Approach to Visual Characteristics Transfer
Attention Distillation: A Unified Approach to Visual Characteristics Transfer
Yang Zhou
Xu Gao
Zichong Chen
Hui Huang
DiffM
113
7
0
27 Feb 2025
Tight Inversion: Image-Conditioned Inversion for Real Image Editing
Tight Inversion: Image-Conditioned Inversion for Real Image Editing
Edo Kadosh
Nir Goren
Or Patashnik
Daniel Garibi
Daniel Cohen-Or
DiffM
116
0
0
27 Feb 2025
Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think
Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think
L. Chen
S. Bai
Wenhao Chai
Weichu Xie
Haozhe Zhao
Leon Vinci
Junyang Lin
Baobao Chang
DiffM
150
8
0
27 Feb 2025
BEVDiffuser: Plug-and-Play Diffusion Model for BEV Denoising with Ground-Truth Guidance
BEVDiffuser: Plug-and-Play Diffusion Model for BEV Denoising with Ground-Truth Guidance
Xin Ye
Burhaneddin Yaman
Sheng Cheng
Feng Tao
Abhirup Mallik
Liu Ren
DiffM
117
2
0
27 Feb 2025
C-Drag: Chain-of-Thought Driven Motion Controller for Video Generation
C-Drag: Chain-of-Thought Driven Motion Controller for Video Generation
Yuhao Li
Mirana Claire Angel
Salman Khan
Yu Zhu
Jinqiu Sun
Yanning Zhang
Fahad Shahbaz Khan
VGen
103
1
0
27 Feb 2025
Image Referenced Sketch Colorization Based on Animation Creation Workflow
Image Referenced Sketch Colorization Based on Animation Creation Workflow
Dingkun Yan
Xinrui Wang
Zhuoru Li
Suguru Saito
Yusuke Iwasawa
Y. Matsuo
Jiaxian Guo
DiffM
100
0
0
27 Feb 2025
High-Fidelity Relightable Monocular Portrait Animation with Lighting-Controllable Video Diffusion Model
High-Fidelity Relightable Monocular Portrait Animation with Lighting-Controllable Video Diffusion Model
Mingtao Guo
Guanyu Xing
Yanli Liu
DiffMVGen
104
1
0
27 Feb 2025
One-for-More: Continual Diffusion Model for Anomaly Detection
One-for-More: Continual Diffusion Model for Anomaly Detection
Xiaofan Li
Xin Tan
Zhuo Chen
Zhizhong Zhang
Ruixin Zhang
...
Guanna Jiang
Yulong Chen
Yanyun Qu
Lizhuang Ma
Yuan Xie
DiffM
140
2
0
27 Feb 2025
GenPC: Zero-shot Point Cloud Completion via 3D Generative Priors
GenPC: Zero-shot Point Cloud Completion via 3D Generative Priors
An Li
Zhe Zhu
Mingqiang Wei
3DPC
126
0
0
27 Feb 2025
SubZero: Composing Subject, Style, and Action via Zero-Shot Personalization
SubZero: Composing Subject, Style, and Action via Zero-Shot Personalization
Shubhankar Borse
K. Bhardwaj
Mohammad Reza Karimi Dastjerdi
Hyojin Park
Shreya Kadambi
...
Prathamesh Mandke
Ankita Nayak
Harris Teague
Munawar Hayat
Fatih Porikli
DiffM
182
1
0
27 Feb 2025
InstaFace: Identity-Preserving Facial Editing with Single Image Inference
InstaFace: Identity-Preserving Facial Editing with Single Image Inference
MD Wahiduzzaman Khan
Mingshan Jia
Shaolin Zhang
En Yu
Caifeng Shan
Kaska Musial-Gabrys
DiffM
95
0
0
27 Feb 2025
Knowledge Bridger: Towards Training-free Missing Modality Completion
Knowledge Bridger: Towards Training-free Missing Modality Completion
Guanzhou Ke
Shengfeng He
Xinyu Wang
Bo Wang
Guoqing Chao
Yize Zhang
Yi Xie
HeXing Su
200
1
0
27 Feb 2025
Glad: A Streaming Scene Generator for Autonomous Driving
Bin Xie
Yingfei Liu
Tiancai Wang
Jiale Cao
Xinming Zhang
3DGSVGen
112
3
0
26 Feb 2025
FLAP: Fully-controllable Audio-driven Portrait Video Generation through 3D head conditioned diffusion model
FLAP: Fully-controllable Audio-driven Portrait Video Generation through 3D head conditioned diffusion model
Lingzhou Mu
Baiji Liu
Ruonan Zhang
Guiming Mo
Jiawei Jin
Kai Zhang
Haozhi Huang
DiffMVGen
144
2
0
26 Feb 2025
Multi-Perspective Data Augmentation for Few-shot Object Detection
Multi-Perspective Data Augmentation for Few-shot Object Detection
Anh-Khoa Nguyen Vu
Quoc-Truong Truong
Vinh-Tiep Nguyen
T. Ngo
Thanh-Toan Do
Tam V. Nguyen
166
1
0
25 Feb 2025
PromptMID: Modal Invariant Descriptors Based on Diffusion and Vision Foundation Models for Optical-SAR Image Matching
PromptMID: Modal Invariant Descriptors Based on Diffusion and Vision Foundation Models for Optical-SAR Image Matching
Han Nie
B. Luo
Jun Liu
Z. Fu
Huan Zhou
Shuo Zhang
Weixing Liu
DiffMVLM
123
0
0
25 Feb 2025
A Survey of fMRI to Image Reconstruction
A Survey of fMRI to Image Reconstruction
Weiyu Guo
Guoying Sun
Jianxiang He
Tong Shao
Shaoguang Wang
Ziyang Chen
Meisheng Hong
Ying Sun
Hui Xiong
96
1
0
24 Feb 2025
VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing
VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing
Xiangpeng Yang
Linchao Zhu
Hehe Fan
Yi Yang
DiffMVGen
128
7
0
24 Feb 2025
GCC: Generative Color Constancy via Diffusing a Color Checker
GCC: Generative Color Constancy via Diffusing a Color Checker
Chen-Wei Chang
Cheng-De Fan
Chia-Che Chang
Yi-Chen Lo
Yu-Chee Tseng
Jiun-Long Huang
Yu-Lun Liu
167
0
0
24 Feb 2025
SpecDM: Hyperspectral Dataset Synthesis with Pixel-level Semantic Annotations
SpecDM: Hyperspectral Dataset Synthesis with Pixel-level Semantic Annotations
Wen Liu
Pei Yang
Wenhui Hong
Xiaoguang Mei
Jiayi Ma
DiffM
91
0
0
24 Feb 2025
CopyJudge: Automated Copyright Infringement Identification and Mitigation in Text-to-Image Diffusion Models
CopyJudge: Automated Copyright Infringement Identification and Mitigation in Text-to-Image Diffusion Models
Shunchang Liu
Zhuan Shi
Lingjuan Lyu
Yaochu Jin
Boi Faltings
132
2
0
24 Feb 2025
A Pragmatic Note on Evaluating Generative Models with Fréchet Inception Distance for Retinal Image Synthesis
A Pragmatic Note on Evaluating Generative Models with Fréchet Inception Distance for Retinal Image Synthesis
Yuli Wu
Fucheng Liu
Rüveyda Yilmaz
Henning Konermann
Peter Walter
Johannes Stegmaier
EGVMMedIm
134
2
0
24 Feb 2025
LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation
LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation
Shuai Yang
Jing Tan
Mengchen Zhang
Tong Wu
Yongqian Li
Gordon Wetzstein
Ziwei Liu
Dahua Lin
MDEVGen
164
9
0
24 Feb 2025
X-Dancer: Expressive Music to Human Dance Video Generation
X-Dancer: Expressive Music to Human Dance Video Generation
Zeyuan Chen
Hongyi Xu
Guoxian Song
You Xie
Chenxu Zhang
Xiusi Chen
Chao Wang
Di Chang
Linjie Luo
VGen
82
1
0
24 Feb 2025
DualNeRF: Text-Driven 3D Scene Editing via Dual-Field Representation
DualNeRF: Text-Driven 3D Scene Editing via Dual-Field Representation
Yuxuan Xiong
Yue Shi
Yishun Dou
Bingbing Ni
DiffM
69
0
0
22 Feb 2025
3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation
3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation
Hansheng Chen
Bokui Shen
Yulin Liu
Ruoxi Shi
Linqi Zhou
Connor Z. Lin
Jiayuan Gu
H. Su
Gordon Wetzstein
Leonidas Guibas
189
4
0
21 Feb 2025
Image compositing is all you need for data augmentation
Image compositing is all you need for data augmentation
Ang Jia Ning Shermaine
Michalis Lazarou
Tania Stathaki
164
2
0
20 Feb 2025
SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation
SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation
Ziqiang Liu
Shuangrui Ding
Zhixiong Zhang
Xiaoyi Dong
Pan Zhang
Yuhang Zang
Yuhang Cao
Dahua Lin
Jiaqi Wang
132
3
0
18 Feb 2025
Component-aware Unsupervised Logical Anomaly Generation for Industrial Anomaly Detection
Component-aware Unsupervised Logical Anomaly Generation for Industrial Anomaly Detection
Xuan Tong
Yang Chang
Qing Zhao
Jiawen Yu
Boyang Wang
...
Xinji Mai
Haoran Wang
Zeng Tao
Yan Wang
Wenqiang Zhang
114
1
0
17 Feb 2025
MaskGWM: A Generalizable Driving World Model with Video Mask Reconstruction
MaskGWM: A Generalizable Driving World Model with Video Mask Reconstruction
Jingcheng Ni
Yuxin Guo
Yichen Liu
Rui Chen
Lewei Lu
Z. Wu
DiffMVGen
144
5
0
17 Feb 2025
FreeBlend: Advancing Concept Blending with Staged Feedback-Driven Interpolation Diffusion
FreeBlend: Advancing Concept Blending with Staged Feedback-Driven Interpolation Diffusion
Yufan Zhou
Haoyu Shen
Huan Wang
DiffM
260
1
0
17 Feb 2025
Control-CLIP: Decoupling Category and Style Guidance in CLIP for Specific-Domain Generation
Control-CLIP: Decoupling Category and Style Guidance in CLIP for Specific-Domain Generation
Zexi Jia
Chuanwei Huang
Hongyan Fei
Yeshuang Zhu
Zhiqiang Yuan
Jinchao Zhang
Jie Zhou
DiffMVLM
86
0
0
17 Feb 2025
GaussianMotion: End-to-End Learning of Animatable Gaussian Avatars with Pose Guidance from Text
GaussianMotion: End-to-End Learning of Animatable Gaussian Avatars with Pose Guidance from Text
Gyumin Shim
Sangmin Lee
Jaegul Choo
3DGS
106
0
0
17 Feb 2025
SayAnything: Audio-Driven Lip Synchronization with Conditional Video Diffusion
SayAnything: Audio-Driven Lip Synchronization with Conditional Video Diffusion
Junxian Ma
Shiwen Wang
Jian Yang
Junyi Hu
Jian Liang
Guosheng Lin
Jingbo Chen
Kai Li
Yu Meng
DiffMVGen
122
4
0
17 Feb 2025
TEASER: Token Enhanced Spatial Modeling for Expressions Reconstruction
TEASER: Token Enhanced Spatial Modeling for Expressions Reconstruction
Yunfei Liu
Lei Zhu
Lijian Lin
Ye Zhu
Ailing Zhang
Yu Li
135
1
0
16 Feb 2025
Direct Preference Optimization-Enhanced Multi-Guided Diffusion Model for Traffic Scenario Generation
Direct Preference Optimization-Enhanced Multi-Guided Diffusion Model for Traffic Scenario Generation
Seungjun Yu
Kisung Kim
Daejung Kim
Haewook Han
Jinhan Lee
124
1
0
14 Feb 2025
SWA-LDM: Toward Stealthy Watermarks for Latent Diffusion Models
SWA-LDM: Toward Stealthy Watermarks for Latent Diffusion Models
Zhiyong Yang
Linye Lyu
Xuanhang Chang
Daojing He
Yu Li
109
0
0
14 Feb 2025
E-MD3C: Taming Masked Diffusion Transformers for Efficient Zero-Shot Object Customization
E-MD3C: Taming Masked Diffusion Transformers for Efficient Zero-Shot Object Customization
T. Pham
Zhang Kang
Ji Woo Hong
Xuran Zheng
Chang D. Yoo
136
0
0
13 Feb 2025
I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models
I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models
Zhenxing Mi
Kuan-Chieh Wang
Guocheng Qian
Hanrong Ye
Runtao Liu
Sergey Tulyakov
Kfir Aberman
Dan Xu
LRM
97
2
0
12 Feb 2025
Ultrasound Image Generation using Latent Diffusion Models
Ultrasound Image Generation using Latent Diffusion Models
Benoit Freiche
Anthony El-Khoury
Ali Nasiri-Sarvi
Mahdi S. Hosseini
Damien Garcia
Adrian Basarab
Mathieu Boily
Hassan Rivaz
MedIm
103
1
0
12 Feb 2025
MaRS: A Fast Sampler for Mean Reverting Diffusion based on ODE and SDE Solvers
MaRS: A Fast Sampler for Mean Reverting Diffusion based on ODE and SDE Solvers
Ao Li
Wei Fang
Hongbo Zhao
Le Lu
Ge Yang
Minfeng Xu
DiffM
111
1
0
11 Feb 2025
Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene
Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene
Tai-Yu Pan
Sooyoung Jeon
Mengdi Fan
Jinsu Yoo
Zhenyang Feng
Mark E. Campbell
Kilian Q. Weinberger
Bharath Hariharan
Wei-Lun Chao
256
0
0
10 Feb 2025
MaterialFusion: High-Quality, Zero-Shot, and Controllable Material Transfer with Diffusion Models
MaterialFusion: High-Quality, Zero-Shot, and Controllable Material Transfer with Diffusion Models
Kamil Garifullin
Maxim Nikolaev
Andrey Kuznetsov
Aibek Alanov
129
0
0
10 Feb 2025
Beyond and Free from Diffusion: Invertible Guided Consistency Training
Chia-Hong Hsu
Shiu-hong Kao
Randall Balestriero
3DV
142
0
0
08 Feb 2025
Previous
123...121314...606162
Next