Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2208.12242
Cited By
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
25 August 2022
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Yael Pritch
Michael Rubinstein
Kfir Aberman
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation"
50 / 2,074 papers shown
Title
Confronting Ambiguity in 6D Object Pose Estimation via Score-Based Diffusion on SE(3)
Tsu-Ching Hsiao
Haoming Chen
Hsuan-Kung Yang
Chun-Yi Lee
DiffM
28
7
0
25 May 2023
Custom-Edit: Text-Guided Image Editing with Customized Diffusion Models
Jooyoung Choi
Yunjey Choi
Yunji Kim
Junho Kim
Sung-Hoon Yoon
DiffM
38
52
0
25 May 2023
Unsupervised Semantic Correspondence Using Stable Diffusion
Eric Hedlin
Gopal Sharma
Shweta Mahajan
Hossam N. Isack
Abhishek Kar
Andrea Tagliasacchi
K. M. Yi
DiffM
52
86
0
24 May 2023
Sin3DM: Learning a Diffusion Model from a Single 3D Textured Shape
Rundi Wu
Ruoshi Liu
Carl Vondrick
Changxi Zheng
DiffM
32
24
0
24 May 2023
A Neural Space-Time Representation for Text-to-Image Personalization
Yuval Alaluf
Elad Richardson
G. Metzer
Daniel Cohen-Or
DiffM
41
94
0
24 May 2023
Solving Diffusion ODEs with Optimal Boundary Conditions for Better Image Super-Resolution
Yi Ma
Huan Yang
Wenhan Yang
Jianlong Fu
Jiaying Liu
DiffM
25
7
0
24 May 2023
Training on Thin Air: Improve Image Classification with Generated Data
Yongchao Zhou
Hshmat Sahak
Jimmy Ba
DiffM
24
43
0
24 May 2023
L-CAD: Language-based Colorization with Any-level Descriptions using Diffusion Priors
Zheng Chang
Shuchen Weng
Pei Zhang
Yu Li
Si Li
Boxin Shi
DiffM
21
7
0
24 May 2023
DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion Models
Sungnyun Kim
Junsoo Lee
Kibeom Hong
Daesik Kim
Namhyuk Ahn
DiffM
21
14
0
24 May 2023
BLIP-Diffusion: Pre-trained Subject Representation for Controllable Text-to-Image Generation and Editing
Dongxu Li
Junnan Li
Steven C. H. Hoi
42
303
0
24 May 2023
Vision + Language Applications: A Survey
Yutong Zhou
N. Shimada
VLM
35
6
0
24 May 2023
Unsafe Diffusion: On the Generation of Unsafe Images and Hateful Memes From Text-To-Image Models
Y. Qu
Xinyue Shen
Xinlei He
Michael Backes
Savvas Zannettou
Yang Zhang
21
106
0
23 May 2023
Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models
Weifeng Chen
Yatai Ji
Jie Wu
Hefeng Wu
Pan Xie
Jiashi Li
Xin Xia
Xuefeng Xiao
Liang Lin
VGen
121
6
0
23 May 2023
DiffHand: End-to-End Hand Mesh Reconstruction via Diffusion Models
Lijun Li
Lian Zhuo
Bangze Zhang
Liefeng Bo
Chen Chen
45
5
0
23 May 2023
Enhancing Detail Preservation for Customized Text-to-Image Generation: A Regularization-Free Approach
Yufan Zhou
Ruiyi Zhang
Tongfei Sun
Jinhui Xu
DiffM
109
38
0
23 May 2023
LaDI-VTON: Latent Diffusion Textual-Inversion Enhanced Virtual Try-On
Davide Morelli
Alberto Baldrati
Giuseppe Cartella
Marcella Cornia
Marco Bertini
Rita Cucchiara
DiffM
68
102
0
22 May 2023
Training Diffusion Models with Reinforcement Learning
Kevin Black
Michael Janner
Yilun Du
Ilya Kostrikov
Sergey Levine
EGVM
44
320
0
22 May 2023
ControlVideo: Training-free Controllable Text-to-Video Generation
Yabo Zhang
Yuxiang Wei
Dongsheng Jiang
Xiaopeng Zhang
W. Zuo
Qi Tian
VGen
DiffM
48
237
0
22 May 2023
FACTIFY3M: A Benchmark for Multimodal Fact Verification with Explainability through 5W Question-Answering
Megha Chakraborty
Khusbu Pahwa
Anku Rani
Shreyas Chatterjee
Dwip Dalal
...
Shreyash Mishra
K. Sensharma
Aman Chadha
Amit P. Sheth
Amitava Das
DiffM
35
7
0
22 May 2023
The CLIP Model is Secretly an Image-to-Prompt Converter
Yuxuan Ding
Chunna Tian
Haoxuan Ding
Lingqiao Liu
DiffM
22
14
0
22 May 2023
Mist: Towards Improved Adversarial Examples for Diffusion Models
Chumeng Liang
Xiaoyu Wu
DiffM
28
49
0
22 May 2023
Watermarking Diffusion Model
Yugeng Liu
Zheng Li
Michael Backes
Yun Shen
Yang Zhang
WIGM
38
34
0
21 May 2023
InstructVid2Vid: Controllable Video Editing with Natural Language Instructions
Bosheng Qin
Juncheng Li
Siliang Tang
Tat-Seng Chua
Yueting Zhuang
VGen
DiffM
39
17
0
21 May 2023
LeftRefill: Filling Right Canvas based on Left Reference through Generalized Text-to-Image Diffusion Model
Chenjie Cao
Yunuo Cai
Qiaole Dong
Yikai Wang
Yanwei Fu
DiffM
40
15
0
19 May 2023
Efficient Cross-Lingual Transfer for Chinese Stable Diffusion with Images as Pivots
Jinyi Hu
Xu Han
Xiaoyuan Yi
Yutong Chen
Wenhao Li
Zhiyuan Liu
Maosong Sun
DiffM
33
4
0
19 May 2023
SlotDiffusion: Object-Centric Generative Modeling with Diffusion Models
Ziyi Wu
Jingyu Hu
Wuyue Lu
Igor Gilitschenski
Animesh Garg
DiffM
OCL
41
45
0
18 May 2023
LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation
Yujie Lu
Xianjun Yang
Xiujun Li
Xinze Wang
William Yang Wang
EGVM
57
73
0
18 May 2023
Inspecting the Geographical Representativeness of Images from Text-to-Image Models
Aparna Basu
R. Venkatesh Babu
Danish Pruthi
DiffM
41
39
0
18 May 2023
Swap Attention in Spatiotemporal Diffusions for Text-to-Video Generation
Wenjing Wang
Huan Yang
Zixi Tuo
Huiguo He
Sitong Su
Jianlong Fu
Jiaying Liu
DiffM
VGen
53
114
0
18 May 2023
TextDiffuser: Diffusion Models as Text Painters
Jingye Chen
Yupan Huang
Tengchao Lv
Lei Cui
Qifeng Chen
Furu Wei
66
114
0
18 May 2023
Constructing a personalized AI assistant for shear wall layout using Stable Diffusion
Lufeng Wang
Jie Liu
Guozhong Cheng
En Liu
Wei Chen
DiffM
11
2
0
18 May 2023
DiffUTE: Universal Text Editing Diffusion Model
Haoxing Chen
Zhuoer Xu
Zhangxuan Gu
Jun Lan
Xing Zheng
Yaohui Li
Changhua Meng
Huijia Zhu
Weiqiang Wang
DiffM
38
34
0
18 May 2023
Personalization as a Shortcut for Few-Shot Backdoor Attack against Text-to-Image Diffusion Models
Yihao Huang
Felix Juefei Xu
Qing Guo
Jie M. Zhang
Yutong Wu
Ming Hu
Tianlin Li
Geguang Pu
Yang Liu
DiffM
21
32
0
18 May 2023
Content-based Unrestricted Adversarial Attack
Zhaoyu Chen
Bo Li
Shuang Wu
Kaixun Jiang
Shouhong Ding
Wenqiang Zhang
DiffM
34
62
0
18 May 2023
FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention
Guangxuan Xiao
Tianwei Yin
William T. Freeman
F. Durand
Song Han
VGen
DiffM
61
239
0
17 May 2023
Preserve Your Own Correlation: A Noise Prior for Video Diffusion Models
Songwei Ge
Seungjun Nah
Guilin Liu
Tyler Poon
Andrew Tao
Bryan Catanzaro
David Jacobs
Jia-Bin Huang
Ming Liu
Yogesh Balaji
DiffM
VGen
51
254
0
17 May 2023
Generating coherent comic with rich story using ChatGPT and Stable Diffusion
Ze Jin
Zorina Song
DiffM
17
16
0
16 May 2023
Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts
Yuyang Zhao
Enze Xie
Lanqing Hong
Zhenguo Li
G. Lee
DiffM
VGen
46
33
0
15 May 2023
Null-text Guidance in Diffusion Models is Secretly a Cartoon-style Creator
Jing Zhao
Heliang Zheng
Chaoyue Wang
Long Lan
Wanrong Huang
Wenjing Yang
DiffM
41
10
0
11 May 2023
Visual Tuning
Bruce X. B. Yu
Jianlong Chang
Haixin Wang
Lin Liu
Shijie Wang
...
Lingxi Xie
Haojie Li
Zhouchen Lin
Qi Tian
Chang Wen Chen
VLM
62
38
0
10 May 2023
iEdit: Localised Text-guided Image Editing with Weak Supervision
Rumeysa Bodur
Erhan Gundogdu
Binod Bhattarai
Tae-Kyun Kim
M. Donoser
Loris Bazzani
DiffM
33
14
0
10 May 2023
Text-guided High-definition Consistency Texture Model
Zhibin Tang
Tiantong He
DiffM
23
6
0
10 May 2023
SUR-adapter: Enhancing Text-to-Image Pre-trained Diffusion Models with Large Language Models
Shan Zhong
Zhongzhan Huang
Wushao Wen
Jinghui Qin
Liang Lin
37
40
0
09 May 2023
Prompt Tuning Inversion for Text-Driven Image Editing Using Diffusion Models
Wenkai Dong
Song Xue
Xiaoyue Duan
Shumin Han
DiffM
50
58
0
08 May 2023
Text-to-Image Diffusion Models can be Easily Backdoored through Multimodal Data Poisoning
Shengfang Zhai
Yinpeng Dong
Qingni Shen
Shih-Chieh Pu
Yuejian Fang
Hang Su
38
72
0
07 May 2023
AADiff: Audio-Aligned Video Synthesis with Text-to-Image Diffusion
Seungwoo Lee
Chaerin Kong
D. Jeon
Nojun Kwak
DiffM
26
19
0
06 May 2023
Towards Prompt-robust Face Privacy Protection via Adversarial Decoupling Augmentation Framework
Ruijia Wu
Yuhang Wang
Huafeng Shi
Zhipeng Yu
Yichao Wu
Ding Liang
DiffM
29
9
0
06 May 2023
DisenBooth: Identity-Preserving Disentangled Tuning for Subject-Driven Text-to-Image Generation
Hong Chen
Yipeng Zhang
Simin Wu
Xin Eric Wang
Xuguang Duan
Yuwei Zhou
Wenwu Zhu
DiffM
28
47
0
05 May 2023
Personalize Segment Anything Model with One Shot
Renrui Zhang
Zhengkai Jiang
Ziyu Guo
Shilin Yan
Junting Pan
Xianzheng Ma
Hao Dong
Peng Gao
Hongsheng Li
MLLM
VLM
41
208
0
04 May 2023
Multimodal-driven Talking Face Generation via a Unified Diffusion-based Generator
Chao Xu
Shaoting Zhu
Junwei Zhu
Alexander I. Rudnicky
Jiangning Zhang
Ying Tai
Yong Liu
DiffM
62
14
0
04 May 2023
Previous
1
2
3
...
36
37
38
...
40
41
42
Next