Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.05543
Cited By
v1
v2
v3 (latest)
Adding Conditional Control to Text-to-Image Diffusion Models
10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Adding Conditional Control to Text-to-Image Diffusion Models"
50 / 3,090 papers shown
Title
ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning
Ruchika Chavhan
Da Li
Timothy M. Hospedales
96
16
0
29 May 2024
Patch-enhanced Mask Encoder Prompt Image Generation
Shusong Xu
Peiye Liu
DiffM
40
0
0
29 May 2024
SketchDeco: Decorating B&W Sketches with Colour
Chaitat Utintu
Pinaki Nath Chowdhury
Aneeshan Sain
Subhadeep Koley
A. Bhunia
Yi-Zhe Song
DiffM
69
3
0
29 May 2024
Zero-to-Hero: Enhancing Zero-Shot Novel View Synthesis via Attention Map Filtering
Ido Sobol
Chenfeng Xu
Or Litany
DiffM
80
2
0
29 May 2024
DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention
Lianghui Zhu
Zilong Huang
Bencheng Liao
Jun Hao Liew
Hanshu Yan
Jiashi Feng
Xinggang Wang
138
17
0
28 May 2024
VITON-DiT: Learning In-the-Wild Video Try-On from Human Dance Videos via Diffusion Transformers
Jun Zheng
Fuwei Zhao
Youjiang Xu
Xin Dong
Xiaodan Liang
VGen
DiffM
69
7
0
28 May 2024
Multi-modal Generation via Cross-Modal In-Context Learning
Amandeep Kumar
Muzammal Naseer
Sanath Narayan
Rao Muhammad Anwer
Salman Khan
Hisham Cholakkal
MLLM
92
1
0
28 May 2024
AnyFit: Controllable Virtual Try-on for Any Combination of Attire Across Any Scenario
Yuhan Li
Hao Zhou
Wenxiang Shang
Ran Lin
Xuanhong Chen
Bingbing Ni
DiffM
54
5
0
28 May 2024
EG4D: Explicit Generation of 4D Object without Score Distillation
Qi Sun
Zhiyang Guo
Bo Liu
Jing Nathan Yan
Shengming Yin
Wen-gang Zhou
Jing Liao
Houqiang Li
VGen
3DGS
109
15
0
28 May 2024
Text Modality Oriented Image Feature Extraction for Detecting Diffusion-based DeepFake
Di Yang
Yihao Huang
Qing Guo
Felix Juefei Xu
Xiaojun Jia
Run Wang
G. Pu
Yang Liu
DiffM
64
0
0
28 May 2024
ToonCrafter: Generative Cartoon Interpolation
Jinbo Xing
Hanyuan Liu
Menghan Xia
Yong Zhang
Xintao Wang
Ying Shan
Tien-Tsin Wong
119
33
0
28 May 2024
Diffusion Model Patching via Mixture-of-Prompts
Seokil Ham
Sangmin Woo
Jin-Young Kim
Hyojun Go
Byeongjun Park
Changick Kim
VLM
81
2
0
28 May 2024
MindFormer: A Transformer Architecture for Multi-Subject Brain Decoding via fMRI
Inhwa Han
Jaayeon Lee
Jong Chul Ye
MedIm
AI4CE
90
1
0
28 May 2024
3D StreetUnveiler with Semantic-aware 2DGS -- a simple baseline
Jingwei Xu
Yikai Wang
Yiqun Zhao
Yanwei Fu
Shenghua Gao
3DGS
133
2
0
28 May 2024
Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control
Zhengfei Kuang
Shengqu Cai
Hao He
Yinghao Xu
Hongsheng Li
Leonidas Guibas
Gordon Wetzstein
VGen
DiffM
112
38
0
27 May 2024
Human4DiT: Free-view Human Video Generation with 4D Diffusion Transformer
Ruizhi Shao
Youxin Pang
Zerong Zheng
Jingxiang Sun
Yebin Liu
VGen
103
21
0
27 May 2024
RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control
Litu Rout
Yujia Chen
Nataniel Ruiz
Abhishek Kumar
Constantine Caramanis
Sanjay Shakkottai
Wen-Sheng Chu
DiffM
98
26
0
27 May 2024
Does Diffusion Beat GAN in Image Super Resolution?
Denis Kuznedelev
Valerii Startsev
Daniil Shlenskii
Sergey Kastryulin
78
4
0
27 May 2024
PatchScaler: An Efficient Patch-Independent Diffusion Model for Super-Resolution
Yong Liu
Hang Dong
Jinshan Pan
Qingji Dong
Kai-xiang Chen
Rongxiang Zhang
Lean Fu
Fei Wang
DiffM
80
1
0
27 May 2024
Training-free Editioning of Text-to-Image Models
Jinqi Wang
Yunfei Fu
Zhangcan Ding
Bailin Deng
Yu-Kun Lai
Yipeng Qin
DiffM
VLM
66
0
0
27 May 2024
From Obstacle to Opportunity: Enhancing Semi-supervised Learning with Synthetic Data
Zerun Wang
Jiafeng Mao
Liuyu Xiang
Toshihiko Yamasaki
84
0
0
27 May 2024
Transfer Learning for Diffusion Models
Yidong Ouyang
Liyan Xie
Hongyuan Zha
Guang Cheng
DiffM
127
3
0
27 May 2024
TIE: Revolutionizing Text-based Image Editing for Complex-Prompt Following and High-Fidelity Editing
Xinyu Zhang
Mengxue Kang
Fei Wei
Shuang Xu
Yuhe Liu
Lin Ma
MLLM
DiffM
77
2
0
27 May 2024
Balancing User Preferences by Social Networks: A Condition-Guided Social Recommendation Model for Mitigating Popularity Bias
Xingbo He
Wenqi Fan
Ruobing Wang
Yili Wang
Ying Wang
Shirui Pan
Xin Wang
CML
72
2
0
27 May 2024
ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance
Jiannan Huang
Jun Hao Liew
Hanshu Yan
Yuyang Yin
Yao Zhao
Yunchao Wei
Yunchao Wei
DiffM
209
7
0
27 May 2024
A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training
Kai Wang
Yukun Zhou
Mingjia Shi
Zhihang Yuan
Yuzhang Shang
Yuzhang Shang
Hanwang Zhang
Hanwang Zhang
Yang You
164
14
0
27 May 2024
CoCoGesture: Toward Coherent Co-speech 3D Gesture Generation in the Wild
Xingqun Qi
Hengyuan Zhang
Yatian Wang
J. Pan
Chen Liu
...
Qixun Zhang
Shanghang Zhang
Wenhan Luo
Qifeng Liu
Qi-fei Liu
DiffM
SLR
189
7
0
27 May 2024
Protect-Your-IP: Scalable Source-Tracing and Attribution against Personalized Generation
Runyi Li
Xuanyu Zhang
Zhipei Xu
Yongbing Zhang
Jian Zhang
WIGM
88
4
0
26 May 2024
ID-to-3D: Expressive ID-guided 3D Heads via Score Distillation Sampling
F. Babiloni
Alexandros Lattas
Jiankang Deng
Stefanos Zafeiriou
DiffM
100
4
0
26 May 2024
Underwater Image Enhancement by Diffusion Model with Customized CLIP-Classifier
Shuaixin Liu
Kunqian Li
Yilin Ding
Qi Qi
55
5
0
25 May 2024
C3LLM: Conditional Multimodal Content Generation Using Large Language Models
Zixuan Wang
Qinkai Duan
Yu-Wing Tai
Chi-Keung Tang
118
3
0
25 May 2024
Reliable Source Approximation: Source-Free Unsupervised Domain Adaptation for Vestibular Schwannoma MRI Segmentation
Hongye Zeng
Ke Zou
Zhihao Chen
Ru Zheng
Huazhu Fu
MedIm
DiffM
90
7
0
25 May 2024
FreeMotion: A Unified Framework for Number-free Text-to-Motion Synthesis
Ke Fan
Junshu Tang
Weijian Cao
Ran Yi
Moran Li
Jing-yu Gong
Jiangning Zhang
Yabiao Wang
Chengjie Wang
Lizhuang Ma
116
19
0
24 May 2024
InstructAvatar: Text-Guided Emotion and Motion Control for Avatar Generation
Yuchi Wang
Junliang Guo
Jianhong Bai
Runyi Yu
Tianyu He
Xu Tan
Xu Sun
Jiang Bian
DiffM
90
11
0
24 May 2024
Bridging The Gap between Low-rank and Orthogonal Adaptation via Householder Reflection Adaptation
Shen Yuan
Haotian Liu
Hongteng Xu
81
5
0
24 May 2024
Semantic Aware Diffusion Inverse Tone Mapping
Abhishek Goswami
Aru Ranjan Singh
Francesco Banterle
Kurt Debattista
Thomas Bashford-Rogers
DiffM
78
3
0
24 May 2024
SG-Adapter: Enhancing Text-to-Image Generation with Scene Graph Guidance
Guibao Shen
Luozhou Wang
Jiantao Lin
Wenhang Ge
Chaozhe Zhang
...
Pengfei Wan
Zhong-ming Wang
Guangyong Chen
Yijun Li
Yingcong Chen
64
10
0
24 May 2024
Enhancing Text-to-Image Editing via Hybrid Mask-Informed Fusion
Aoxue Li
Mingyang Yi
Zhenguo Li
DiffM
79
0
0
24 May 2024
StyleMaster: Towards Flexible Stylized Image Generation with Diffusion Models
Chengming Xu
Kai Hu
Donghao Luo
Jiangning Zhang
Wei Li
Yanhao Ge
Chengjie Wang
DiffM
73
0
0
24 May 2024
iVideoGPT: Interactive VideoGPTs are Scalable World Models
Jialong Wu
Shaofeng Yin
Ningya Feng
Xu He
Dong Li
Haifeng Zhang
Mingsheng Long
VGen
114
40
0
24 May 2024
ODGEN: Domain-specific Object Detection Data Generation with Diffusion Models
Jingyuan Zhu
Shiyu Li
Yuxuan Liu
Ping Huang
Jiulong Shan
Huimin Ma
Jian Yuan
84
6
0
24 May 2024
Learning Invariant Causal Mechanism from Vision-Language Models
Changwen Zheng
Siyu Zhao
Xingyu Zhang
Jiangmeng Li
Changwen Zheng
Jingyao Wang
CML
BDL
VLM
129
0
0
24 May 2024
Improved Distribution Matching Distillation for Fast Image Synthesis
Tianwei Yin
Michael Gharbi
Taesung Park
Richard Zhang
Eli Shechtman
Frédo Durand
William T. Freeman
DiffM
149
127
0
23 May 2024
Semantica: An Adaptable Image-Conditioned Diffusion Model
Manoj Kumar
N. Houlsby
Emiel Hoogeboom
DiffM
VLM
103
0
0
23 May 2024
Direct3D: Scalable Image-to-3D Generation via 3D Latent Diffusion Transformer
Shuang Wu
Youtian Lin
Feihu Zhang
Yifei Zeng
Jingxi Xu
Philip Torr
Xun Cao
Yao Yao
110
63
0
23 May 2024
EditWorld: Simulating World Dynamics for Instruction-Following Image Editing
Ling Yang
Bo-Wen Zeng
Jiaming Liu
Hong Li
Minghao Xu
Wentao Zhang
Shuicheng Yan
DiffM
86
16
0
23 May 2024
PoseCrafter: One-Shot Personalized Video Synthesis Following Flexible Pose Control
Yong Zhong
Min Zhao
Zebin You
Xiaofeng Yu
Changwang Zhang
Chongxuan Li
DiffM
107
6
0
23 May 2024
Regressor-free Molecule Generation to Support Drug Response Prediction
Kun Li
Xiuwen Gong
Shirui Pan
Hongzhi Zhang
Bo Du
Wenbin Hu
70
1
0
23 May 2024
Survey on Visual Signal Coding and Processing with Generative Models: Technologies, Standards and Optimization
Zhibo Chen
Heming Sun
Li Zhang
Fan Zhang
110
3
0
23 May 2024
FreeTuner: Any Subject in Any Style with Training-free Diffusion
Youcan Xu
Zhen Wang
Jun Xiao
Wei Liu
Long Chen
DiffM
74
11
0
23 May 2024
Previous
1
2
3
...
31
32
33
...
60
61
62
Next