ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.05543
  4. Cited By
Adding Conditional Control to Text-to-Image Diffusion Models
v1v2v3 (latest)

Adding Conditional Control to Text-to-Image Diffusion Models

10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "Adding Conditional Control to Text-to-Image Diffusion Models"

50 / 3,090 papers shown
Title
ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron
  Pruning
ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning
Ruchika Chavhan
Da Li
Timothy M. Hospedales
96
16
0
29 May 2024
Patch-enhanced Mask Encoder Prompt Image Generation
Patch-enhanced Mask Encoder Prompt Image Generation
Shusong Xu
Peiye Liu
DiffM
40
0
0
29 May 2024
SketchDeco: Decorating B&W Sketches with Colour
SketchDeco: Decorating B&W Sketches with Colour
Chaitat Utintu
Pinaki Nath Chowdhury
Aneeshan Sain
Subhadeep Koley
A. Bhunia
Yi-Zhe Song
DiffM
69
3
0
29 May 2024
Zero-to-Hero: Enhancing Zero-Shot Novel View Synthesis via Attention Map
  Filtering
Zero-to-Hero: Enhancing Zero-Shot Novel View Synthesis via Attention Map Filtering
Ido Sobol
Chenfeng Xu
Or Litany
DiffM
80
2
0
29 May 2024
DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention
DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention
Lianghui Zhu
Zilong Huang
Bencheng Liao
Jun Hao Liew
Hanshu Yan
Jiashi Feng
Xinggang Wang
138
17
0
28 May 2024
VITON-DiT: Learning In-the-Wild Video Try-On from Human Dance Videos via
  Diffusion Transformers
VITON-DiT: Learning In-the-Wild Video Try-On from Human Dance Videos via Diffusion Transformers
Jun Zheng
Fuwei Zhao
Youjiang Xu
Xin Dong
Xiaodan Liang
VGenDiffM
69
7
0
28 May 2024
Multi-modal Generation via Cross-Modal In-Context Learning
Multi-modal Generation via Cross-Modal In-Context Learning
Amandeep Kumar
Muzammal Naseer
Sanath Narayan
Rao Muhammad Anwer
Salman Khan
Hisham Cholakkal
MLLM
92
1
0
28 May 2024
AnyFit: Controllable Virtual Try-on for Any Combination of Attire Across
  Any Scenario
AnyFit: Controllable Virtual Try-on for Any Combination of Attire Across Any Scenario
Yuhan Li
Hao Zhou
Wenxiang Shang
Ran Lin
Xuanhong Chen
Bingbing Ni
DiffM
54
5
0
28 May 2024
EG4D: Explicit Generation of 4D Object without Score Distillation
EG4D: Explicit Generation of 4D Object without Score Distillation
Qi Sun
Zhiyang Guo
Bo Liu
Jing Nathan Yan
Shengming Yin
Wen-gang Zhou
Jing Liao
Houqiang Li
VGen3DGS
109
15
0
28 May 2024
Text Modality Oriented Image Feature Extraction for Detecting
  Diffusion-based DeepFake
Text Modality Oriented Image Feature Extraction for Detecting Diffusion-based DeepFake
Di Yang
Yihao Huang
Qing Guo
Felix Juefei Xu
Xiaojun Jia
Run Wang
G. Pu
Yang Liu
DiffM
64
0
0
28 May 2024
ToonCrafter: Generative Cartoon Interpolation
ToonCrafter: Generative Cartoon Interpolation
Jinbo Xing
Hanyuan Liu
Menghan Xia
Yong Zhang
Xintao Wang
Ying Shan
Tien-Tsin Wong
119
33
0
28 May 2024
Diffusion Model Patching via Mixture-of-Prompts
Diffusion Model Patching via Mixture-of-Prompts
Seokil Ham
Sangmin Woo
Jin-Young Kim
Hyojun Go
Byeongjun Park
Changick Kim
VLM
81
2
0
28 May 2024
MindFormer: A Transformer Architecture for Multi-Subject Brain Decoding
  via fMRI
MindFormer: A Transformer Architecture for Multi-Subject Brain Decoding via fMRI
Inhwa Han
Jaayeon Lee
Jong Chul Ye
MedImAI4CE
90
1
0
28 May 2024
3D StreetUnveiler with Semantic-aware 2DGS -- a simple baseline
3D StreetUnveiler with Semantic-aware 2DGS -- a simple baseline
Jingwei Xu
Yikai Wang
Yiqun Zhao
Yanwei Fu
Shenghua Gao
3DGS
133
2
0
28 May 2024
Collaborative Video Diffusion: Consistent Multi-video Generation with
  Camera Control
Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control
Zhengfei Kuang
Shengqu Cai
Hao He
Yinghao Xu
Hongsheng Li
Leonidas Guibas
Gordon Wetzstein
VGenDiffM
112
38
0
27 May 2024
Human4DiT: Free-view Human Video Generation with 4D Diffusion
  Transformer
Human4DiT: Free-view Human Video Generation with 4D Diffusion Transformer
Ruizhi Shao
Youxin Pang
Zerong Zheng
Jingxiang Sun
Yebin Liu
VGen
103
21
0
27 May 2024
RB-Modulation: Training-Free Personalization of Diffusion Models using
  Stochastic Optimal Control
RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control
Litu Rout
Yujia Chen
Nataniel Ruiz
Abhishek Kumar
Constantine Caramanis
Sanjay Shakkottai
Wen-Sheng Chu
DiffM
98
26
0
27 May 2024
Does Diffusion Beat GAN in Image Super Resolution?
Does Diffusion Beat GAN in Image Super Resolution?
Denis Kuznedelev
Valerii Startsev
Daniil Shlenskii
Sergey Kastryulin
78
4
0
27 May 2024
PatchScaler: An Efficient Patch-Independent Diffusion Model for
  Super-Resolution
PatchScaler: An Efficient Patch-Independent Diffusion Model for Super-Resolution
Yong Liu
Hang Dong
Jinshan Pan
Qingji Dong
Kai-xiang Chen
Rongxiang Zhang
Lean Fu
Fei Wang
DiffM
80
1
0
27 May 2024
Training-free Editioning of Text-to-Image Models
Training-free Editioning of Text-to-Image Models
Jinqi Wang
Yunfei Fu
Zhangcan Ding
Bailin Deng
Yu-Kun Lai
Yipeng Qin
DiffMVLM
66
0
0
27 May 2024
From Obstacle to Opportunity: Enhancing Semi-supervised Learning with
  Synthetic Data
From Obstacle to Opportunity: Enhancing Semi-supervised Learning with Synthetic Data
Zerun Wang
Jiafeng Mao
Liuyu Xiang
Toshihiko Yamasaki
84
0
0
27 May 2024
Transfer Learning for Diffusion Models
Transfer Learning for Diffusion Models
Yidong Ouyang
Liyan Xie
Hongyuan Zha
Guang Cheng
DiffM
127
3
0
27 May 2024
TIE: Revolutionizing Text-based Image Editing for Complex-Prompt
  Following and High-Fidelity Editing
TIE: Revolutionizing Text-based Image Editing for Complex-Prompt Following and High-Fidelity Editing
Xinyu Zhang
Mengxue Kang
Fei Wei
Shuang Xu
Yuhe Liu
Lin Ma
MLLMDiffM
77
2
0
27 May 2024
Balancing User Preferences by Social Networks: A Condition-Guided Social
  Recommendation Model for Mitigating Popularity Bias
Balancing User Preferences by Social Networks: A Condition-Guided Social Recommendation Model for Mitigating Popularity Bias
Xingbo He
Wenqi Fan
Ruobing Wang
Yili Wang
Ying Wang
Shirui Pan
Xin Wang
CML
72
2
0
27 May 2024
ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance
ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance
Jiannan Huang
Jun Hao Liew
Hanshu Yan
Yuyang Yin
Yao Zhao
Yunchao Wei
Yunchao Wei
DiffM
209
7
0
27 May 2024
A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training
A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training
Kai Wang
Yukun Zhou
Mingjia Shi
Zhihang Yuan
Yuzhang Shang
Yuzhang Shang
Hanwang Zhang
Hanwang Zhang
Yang You
164
14
0
27 May 2024
CoCoGesture: Toward Coherent Co-speech 3D Gesture Generation in the Wild
CoCoGesture: Toward Coherent Co-speech 3D Gesture Generation in the Wild
Xingqun Qi
Hengyuan Zhang
Yatian Wang
J. Pan
Chen Liu
...
Qixun Zhang
Shanghang Zhang
Wenhan Luo
Qifeng Liu
Qi-fei Liu
DiffMSLR
189
7
0
27 May 2024
Protect-Your-IP: Scalable Source-Tracing and Attribution against
  Personalized Generation
Protect-Your-IP: Scalable Source-Tracing and Attribution against Personalized Generation
Runyi Li
Xuanyu Zhang
Zhipei Xu
Yongbing Zhang
Jian Zhang
WIGM
88
4
0
26 May 2024
ID-to-3D: Expressive ID-guided 3D Heads via Score Distillation Sampling
ID-to-3D: Expressive ID-guided 3D Heads via Score Distillation Sampling
F. Babiloni
Alexandros Lattas
Jiankang Deng
Stefanos Zafeiriou
DiffM
100
4
0
26 May 2024
Underwater Image Enhancement by Diffusion Model with Customized
  CLIP-Classifier
Underwater Image Enhancement by Diffusion Model with Customized CLIP-Classifier
Shuaixin Liu
Kunqian Li
Yilin Ding
Qi Qi
55
5
0
25 May 2024
C3LLM: Conditional Multimodal Content Generation Using Large Language
  Models
C3LLM: Conditional Multimodal Content Generation Using Large Language Models
Zixuan Wang
Qinkai Duan
Yu-Wing Tai
Chi-Keung Tang
118
3
0
25 May 2024
Reliable Source Approximation: Source-Free Unsupervised Domain
  Adaptation for Vestibular Schwannoma MRI Segmentation
Reliable Source Approximation: Source-Free Unsupervised Domain Adaptation for Vestibular Schwannoma MRI Segmentation
Hongye Zeng
Ke Zou
Zhihao Chen
Ru Zheng
Huazhu Fu
MedImDiffM
90
7
0
25 May 2024
FreeMotion: A Unified Framework for Number-free Text-to-Motion Synthesis
FreeMotion: A Unified Framework for Number-free Text-to-Motion Synthesis
Ke Fan
Junshu Tang
Weijian Cao
Ran Yi
Moran Li
Jing-yu Gong
Jiangning Zhang
Yabiao Wang
Chengjie Wang
Lizhuang Ma
116
19
0
24 May 2024
InstructAvatar: Text-Guided Emotion and Motion Control for Avatar
  Generation
InstructAvatar: Text-Guided Emotion and Motion Control for Avatar Generation
Yuchi Wang
Junliang Guo
Jianhong Bai
Runyi Yu
Tianyu He
Xu Tan
Xu Sun
Jiang Bian
DiffM
90
11
0
24 May 2024
Bridging The Gap between Low-rank and Orthogonal Adaptation via
  Householder Reflection Adaptation
Bridging The Gap between Low-rank and Orthogonal Adaptation via Householder Reflection Adaptation
Shen Yuan
Haotian Liu
Hongteng Xu
81
5
0
24 May 2024
Semantic Aware Diffusion Inverse Tone Mapping
Semantic Aware Diffusion Inverse Tone Mapping
Abhishek Goswami
Aru Ranjan Singh
Francesco Banterle
Kurt Debattista
Thomas Bashford-Rogers
DiffM
78
3
0
24 May 2024
SG-Adapter: Enhancing Text-to-Image Generation with Scene Graph Guidance
SG-Adapter: Enhancing Text-to-Image Generation with Scene Graph Guidance
Guibao Shen
Luozhou Wang
Jiantao Lin
Wenhang Ge
Chaozhe Zhang
...
Pengfei Wan
Zhong-ming Wang
Guangyong Chen
Yijun Li
Yingcong Chen
64
10
0
24 May 2024
Enhancing Text-to-Image Editing via Hybrid Mask-Informed Fusion
Enhancing Text-to-Image Editing via Hybrid Mask-Informed Fusion
Aoxue Li
Mingyang Yi
Zhenguo Li
DiffM
79
0
0
24 May 2024
StyleMaster: Towards Flexible Stylized Image Generation with Diffusion
  Models
StyleMaster: Towards Flexible Stylized Image Generation with Diffusion Models
Chengming Xu
Kai Hu
Donghao Luo
Jiangning Zhang
Wei Li
Yanhao Ge
Chengjie Wang
DiffM
73
0
0
24 May 2024
iVideoGPT: Interactive VideoGPTs are Scalable World Models
iVideoGPT: Interactive VideoGPTs are Scalable World Models
Jialong Wu
Shaofeng Yin
Ningya Feng
Xu He
Dong Li
Haifeng Zhang
Mingsheng Long
VGen
114
40
0
24 May 2024
ODGEN: Domain-specific Object Detection Data Generation with Diffusion
  Models
ODGEN: Domain-specific Object Detection Data Generation with Diffusion Models
Jingyuan Zhu
Shiyu Li
Yuxuan Liu
Ping Huang
Jiulong Shan
Huimin Ma
Jian Yuan
84
6
0
24 May 2024
Learning Invariant Causal Mechanism from Vision-Language Models
Learning Invariant Causal Mechanism from Vision-Language Models
Changwen Zheng
Siyu Zhao
Xingyu Zhang
Jiangmeng Li
Changwen Zheng
Jingyao Wang
CMLBDLVLM
129
0
0
24 May 2024
Improved Distribution Matching Distillation for Fast Image Synthesis
Improved Distribution Matching Distillation for Fast Image Synthesis
Tianwei Yin
Michael Gharbi
Taesung Park
Richard Zhang
Eli Shechtman
Frédo Durand
William T. Freeman
DiffM
149
127
0
23 May 2024
Semantica: An Adaptable Image-Conditioned Diffusion Model
Semantica: An Adaptable Image-Conditioned Diffusion Model
Manoj Kumar
N. Houlsby
Emiel Hoogeboom
DiffMVLM
103
0
0
23 May 2024
Direct3D: Scalable Image-to-3D Generation via 3D Latent Diffusion
  Transformer
Direct3D: Scalable Image-to-3D Generation via 3D Latent Diffusion Transformer
Shuang Wu
Youtian Lin
Feihu Zhang
Yifei Zeng
Jingxi Xu
Philip Torr
Xun Cao
Yao Yao
110
63
0
23 May 2024
EditWorld: Simulating World Dynamics for Instruction-Following Image
  Editing
EditWorld: Simulating World Dynamics for Instruction-Following Image Editing
Ling Yang
Bo-Wen Zeng
Jiaming Liu
Hong Li
Minghao Xu
Wentao Zhang
Shuicheng Yan
DiffM
86
16
0
23 May 2024
PoseCrafter: One-Shot Personalized Video Synthesis Following Flexible
  Pose Control
PoseCrafter: One-Shot Personalized Video Synthesis Following Flexible Pose Control
Yong Zhong
Min Zhao
Zebin You
Xiaofeng Yu
Changwang Zhang
Chongxuan Li
DiffM
107
6
0
23 May 2024
Regressor-free Molecule Generation to Support Drug Response Prediction
Regressor-free Molecule Generation to Support Drug Response Prediction
Kun Li
Xiuwen Gong
Shirui Pan
Hongzhi Zhang
Bo Du
Wenbin Hu
70
1
0
23 May 2024
Survey on Visual Signal Coding and Processing with Generative Models:
  Technologies, Standards and Optimization
Survey on Visual Signal Coding and Processing with Generative Models: Technologies, Standards and Optimization
Zhibo Chen
Heming Sun
Li Zhang
Fan Zhang
110
3
0
23 May 2024
FreeTuner: Any Subject in Any Style with Training-free Diffusion
FreeTuner: Any Subject in Any Style with Training-free Diffusion
Youcan Xu
Zhen Wang
Jun Xiao
Wei Liu
Long Chen
DiffM
74
11
0
23 May 2024
Previous
123...313233...606162
Next