ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2208.12242
  4. Cited By
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for
  Subject-Driven Generation
v1v2 (latest)

DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation

25 August 2022
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Yael Pritch
Michael Rubinstein
Kfir Aberman
ArXiv (abs)PDFHTML

Papers citing "DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation"

50 / 2,167 papers shown
Title
Unsupervised Semantic Correspondence Using Stable Diffusion
Unsupervised Semantic Correspondence Using Stable Diffusion
Eric Hedlin
Gopal Sharma
Shweta Mahajan
Hossam N. Isack
Abhishek Kar
Andrea Tagliasacchi
K. M. Yi
DiffM
106
96
0
24 May 2023
Sin3DM: Learning a Diffusion Model from a Single 3D Textured Shape
Sin3DM: Learning a Diffusion Model from a Single 3D Textured Shape
Rundi Wu
Ruoshi Liu
Carl Vondrick
Changxi Zheng
DiffM
124
25
0
24 May 2023
A Neural Space-Time Representation for Text-to-Image Personalization
A Neural Space-Time Representation for Text-to-Image Personalization
Yuval Alaluf
Elad Richardson
G. Metzer
Daniel Cohen-Or
DiffM
104
100
0
24 May 2023
Solving Diffusion ODEs with Optimal Boundary Conditions for Better Image
  Super-Resolution
Solving Diffusion ODEs with Optimal Boundary Conditions for Better Image Super-Resolution
Yi Ma
Huan Yang
Wenhan Yang
Jianlong Fu
Jiaying Liu
DiffM
82
7
0
24 May 2023
Training on Thin Air: Improve Image Classification with Generated Data
Training on Thin Air: Improve Image Classification with Generated Data
Yongchao Zhou
Hshmat Sahak
Jimmy Ba
DiffM
85
47
0
24 May 2023
L-CAD: Language-based Colorization with Any-level Descriptions using
  Diffusion Priors
L-CAD: Language-based Colorization with Any-level Descriptions using Diffusion Priors
Zheng Chang
Shuchen Weng
Pei Zhang
Yu Li
Si Li
Boxin Shi
DiffM
67
7
0
24 May 2023
DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion
  Models
DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion Models
Sungnyun Kim
Junsoo Lee
Kibeom Hong
Daesik Kim
Namhyuk Ahn
DiffM
88
15
0
24 May 2023
BLIP-Diffusion: Pre-trained Subject Representation for Controllable
  Text-to-Image Generation and Editing
BLIP-Diffusion: Pre-trained Subject Representation for Controllable Text-to-Image Generation and Editing
Dongxu Li
Junnan Li
Steven C. H. Hoi
105
331
0
24 May 2023
Vision + Language Applications: A Survey
Vision + Language Applications: A Survey
Yutong Zhou
N. Shimada
VLM
117
7
0
24 May 2023
Unsafe Diffusion: On the Generation of Unsafe Images and Hateful Memes
  From Text-To-Image Models
Unsafe Diffusion: On the Generation of Unsafe Images and Hateful Memes From Text-To-Image Models
Y. Qu
Xinyue Shen
Xinlei He
Michael Backes
Savvas Zannettou
Yang Zhang
71
124
0
23 May 2023
Control-A-Video: Controllable Text-to-Video Generation with Diffusion
  Models
Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models
Weifeng Chen
Yatai Ji
Jie Wu
Hefeng Wu
Pan Xie
Jiashi Li
Xin Xia
Xuefeng Xiao
Liang Lin
VGen
208
11
0
23 May 2023
DiffHand: End-to-End Hand Mesh Reconstruction via Diffusion Models
DiffHand: End-to-End Hand Mesh Reconstruction via Diffusion Models
Lijun Li
Lian Zhuo
Bangze Zhang
Liefeng Bo
Chen Chen
97
5
0
23 May 2023
Enhancing Detail Preservation for Customized Text-to-Image Generation: A
  Regularization-Free Approach
Enhancing Detail Preservation for Customized Text-to-Image Generation: A Regularization-Free Approach
Yufan Zhou
Ruiyi Zhang
Tongfei Sun
Jinhui Xu
DiffM
144
40
0
23 May 2023
LaDI-VTON: Latent Diffusion Textual-Inversion Enhanced Virtual Try-On
LaDI-VTON: Latent Diffusion Textual-Inversion Enhanced Virtual Try-On
Davide Morelli
Alberto Baldrati
Giuseppe Cartella
Marcella Cornia
Marco Bertini
Rita Cucchiara
DiffM
151
116
0
22 May 2023
Training Diffusion Models with Reinforcement Learning
Training Diffusion Models with Reinforcement Learning
Kevin Black
Michael Janner
Yilun Du
Ilya Kostrikov
Sergey Levine
EGVM
154
379
0
22 May 2023
ControlVideo: Training-free Controllable Text-to-Video Generation
ControlVideo: Training-free Controllable Text-to-Video Generation
Yabo Zhang
Yuxiang Wei
Dongsheng Jiang
Xiaopeng Zhang
W. Zuo
Qi Tian
VGenDiffM
124
254
0
22 May 2023
FACTIFY3M: A Benchmark for Multimodal Fact Verification with
  Explainability through 5W Question-Answering
FACTIFY3M: A Benchmark for Multimodal Fact Verification with Explainability through 5W Question-Answering
Megha Chakraborty
Khusbu Pahwa
Anku Rani
Shreyas Chatterjee
Dwip Dalal
...
Shreyash Mishra
K. Sensharma
Aman Chadha
Amit P. Sheth
Amitava Das
DiffM
74
8
0
22 May 2023
The CLIP Model is Secretly an Image-to-Prompt Converter
The CLIP Model is Secretly an Image-to-Prompt Converter
Yuxuan Ding
Chunna Tian
Haoxuan Ding
Lingqiao Liu
DiffM
59
15
0
22 May 2023
Mist: Towards Improved Adversarial Examples for Diffusion Models
Mist: Towards Improved Adversarial Examples for Diffusion Models
Chumeng Liang
Xiaoyu Wu
DiffM
90
57
0
22 May 2023
Watermarking Diffusion Model
Watermarking Diffusion Model
Yugeng Liu
Zheng Li
Michael Backes
Yun Shen
Yang Zhang
WIGM
84
36
0
21 May 2023
InstructVid2Vid: Controllable Video Editing with Natural Language
  Instructions
InstructVid2Vid: Controllable Video Editing with Natural Language Instructions
Bosheng Qin
Juncheng Li
Siliang Tang
Tat-Seng Chua
Yueting Zhuang
VGenDiffM
71
17
0
21 May 2023
LeftRefill: Filling Right Canvas based on Left Reference through
  Generalized Text-to-Image Diffusion Model
LeftRefill: Filling Right Canvas based on Left Reference through Generalized Text-to-Image Diffusion Model
Chenjie Cao
Yunuo Cai
Qiaole Dong
Yikai Wang
Yanwei Fu
DiffM
100
15
0
19 May 2023
Efficient Cross-Lingual Transfer for Chinese Stable Diffusion with
  Images as Pivots
Efficient Cross-Lingual Transfer for Chinese Stable Diffusion with Images as Pivots
Jinyi Hu
Xu Han
Xiaoyuan Yi
Yutong Chen
Wenhao Li
Zhiyuan Liu
Maosong Sun
DiffM
37
4
0
19 May 2023
SlotDiffusion: Object-Centric Generative Modeling with Diffusion Models
SlotDiffusion: Object-Centric Generative Modeling with Diffusion Models
Ziyi Wu
Jingyu Hu
Wuyue Lu
Igor Gilitschenski
Animesh Garg
DiffMOCL
124
47
0
18 May 2023
LLMScore: Unveiling the Power of Large Language Models in Text-to-Image
  Synthesis Evaluation
LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation
Yujie Lu
Xianjun Yang
Xiujun Li
Xinze Wang
William Yang Wang
EGVM
143
79
0
18 May 2023
Inspecting the Geographical Representativeness of Images from
  Text-to-Image Models
Inspecting the Geographical Representativeness of Images from Text-to-Image Models
Aparna Basu
R. Venkatesh Babu
Danish Pruthi
DiffM
120
40
0
18 May 2023
Swap Attention in Spatiotemporal Diffusions for Text-to-Video Generation
Swap Attention in Spatiotemporal Diffusions for Text-to-Video Generation
Wenjing Wang
Huan Yang
Zixi Tuo
Huiguo He
Sitong Su
Jianlong Fu
Jiaying Liu
DiffMVGen
155
117
0
18 May 2023
TextDiffuser: Diffusion Models as Text Painters
TextDiffuser: Diffusion Models as Text Painters
Jingye Chen
Yupan Huang
Tengchao Lv
Lei Cui
Qifeng Chen
Furu Wei
193
126
0
18 May 2023
Constructing a personalized AI assistant for shear wall layout using
  Stable Diffusion
Constructing a personalized AI assistant for shear wall layout using Stable Diffusion
Lufeng Wang
Jie Liu
Guozhong Cheng
En Liu
Wei Chen
DiffM
25
2
0
18 May 2023
DiffUTE: Universal Text Editing Diffusion Model
DiffUTE: Universal Text Editing Diffusion Model
Haoxing Chen
Zhuoer Xu
Zhangxuan Gu
Jun Lan
Xing Zheng
Yaohui Li
Changhua Meng
Huijia Zhu
Weiqiang Wang
DiffM
102
35
0
18 May 2023
Personalization as a Shortcut for Few-Shot Backdoor Attack against
  Text-to-Image Diffusion Models
Personalization as a Shortcut for Few-Shot Backdoor Attack against Text-to-Image Diffusion Models
Yihao Huang
Felix Juefei Xu
Qing Guo
Jie M. Zhang
Yutong Wu
Ming Hu
Tianlin Li
Geguang Pu
Yang Liu
DiffM
117
31
0
18 May 2023
Content-based Unrestricted Adversarial Attack
Content-based Unrestricted Adversarial Attack
Zhaoyu Chen
Yue Liu
Shuang Wu
Kaixun Jiang
Shouhong Ding
Wenqiang Zhang
DiffM
91
70
0
18 May 2023
FastComposer: Tuning-Free Multi-Subject Image Generation with Localized
  Attention
FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention
Guangxuan Xiao
Tianwei Yin
William T. Freeman
F. Durand
Song Han
VGenDiffM
152
254
0
17 May 2023
Preserve Your Own Correlation: A Noise Prior for Video Diffusion Models
Preserve Your Own Correlation: A Noise Prior for Video Diffusion Models
Songwei Ge
Seungjun Nah
Guilin Liu
Tyler Poon
Andrew Tao
Bryan Catanzaro
David Jacobs
Jia-Bin Huang
Ming-Yuan Liu
Yogesh Balaji
DiffMVGen
125
263
0
17 May 2023
Generating coherent comic with rich story using ChatGPT and Stable
  Diffusion
Generating coherent comic with rich story using ChatGPT and Stable Diffusion
Ze Jin
Zorina Song
DiffM
41
16
0
16 May 2023
Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts
Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts
Yuyang Zhao
Enze Xie
Lanqing Hong
Zhenguo Li
G. Lee
DiffMVGen
102
34
0
15 May 2023
Null-text Guidance in Diffusion Models is Secretly a Cartoon-style
  Creator
Null-text Guidance in Diffusion Models is Secretly a Cartoon-style Creator
Jing Zhao
Heliang Zheng
Chaoyue Wang
Long Lan
Wanrong Huang
Wenjing Yang
DiffM
113
10
0
11 May 2023
Visual Tuning
Visual Tuning
Bruce X. B. Yu
Jianlong Chang
Haixin Wang
Lin Liu
Shijie Wang
...
Lingxi Xie
Haojie Li
Zhouchen Lin
Qi Tian
Chang Wen Chen
VLM
174
41
0
10 May 2023
iEdit: Localised Text-guided Image Editing with Weak Supervision
iEdit: Localised Text-guided Image Editing with Weak Supervision
Rumeysa Bodur
Erhan Gundogdu
Binod Bhattarai
Tae-Kyun Kim
M. Donoser
Loris Bazzani
DiffM
72
15
0
10 May 2023
Text-guided High-definition Consistency Texture Model
Text-guided High-definition Consistency Texture Model
Zhibin Tang
Tiantong He
DiffM
37
6
0
10 May 2023
SUR-adapter: Enhancing Text-to-Image Pre-trained Diffusion Models with
  Large Language Models
SUR-adapter: Enhancing Text-to-Image Pre-trained Diffusion Models with Large Language Models
Shan Zhong
Zhongzhan Huang
Wushao Wen
Jinghui Qin
Liang Lin
94
41
0
09 May 2023
Prompt Tuning Inversion for Text-Driven Image Editing Using Diffusion
  Models
Prompt Tuning Inversion for Text-Driven Image Editing Using Diffusion Models
Wenkai Dong
Song Xue
Xiaoyue Duan
Shumin Han
DiffM
95
62
0
08 May 2023
Text-to-Image Diffusion Models can be Easily Backdoored through
  Multimodal Data Poisoning
Text-to-Image Diffusion Models can be Easily Backdoored through Multimodal Data Poisoning
Shengfang Zhai
Yinpeng Dong
Qingni Shen
Shih-Chieh Pu
Yuejian Fang
Hang Su
73
77
0
07 May 2023
AADiff: Audio-Aligned Video Synthesis with Text-to-Image Diffusion
AADiff: Audio-Aligned Video Synthesis with Text-to-Image Diffusion
Seungwoo Lee
Chaerin Kong
D. Jeon
Nojun Kwak
DiffM
111
20
0
06 May 2023
Towards Prompt-robust Face Privacy Protection via Adversarial Decoupling
  Augmentation Framework
Towards Prompt-robust Face Privacy Protection via Adversarial Decoupling Augmentation Framework
Ruijia Wu
Yuhang Wang
Huafeng Shi
Zhipeng Yu
Yichao Wu
Ding Liang
DiffM
67
9
0
06 May 2023
DisenBooth: Identity-Preserving Disentangled Tuning for Subject-Driven
  Text-to-Image Generation
DisenBooth: Identity-Preserving Disentangled Tuning for Subject-Driven Text-to-Image Generation
Hong Chen
Yipeng Zhang
Simin Wu
Xin Eric Wang
Xuguang Duan
Yuwei Zhou
Wenwu Zhu
DiffM
110
51
0
05 May 2023
Personalize Segment Anything Model with One Shot
Personalize Segment Anything Model with One Shot
Renrui Zhang
Zhengkai Jiang
Ziyu Guo
Shilin Yan
Junting Pan
Xianzheng Ma
Hao Dong
Peng Gao
Hongsheng Li
MLLMVLM
111
219
0
04 May 2023
Multimodal-driven Talking Face Generation via a Unified Diffusion-based
  Generator
Multimodal-driven Talking Face Generation via a Unified Diffusion-based Generator
Chao Xu
Shaoting Zhu
Junwei Zhu
Alexander I. Rudnicky
Jiangning Zhang
Ying Tai
Yong Liu
DiffM
117
14
0
04 May 2023
Few-shot Domain-Adaptive Visually-fused Event Detection from Text
Few-shot Domain-Adaptive Visually-fused Event Detection from Text
Farhad Moghimifar
Fatemeh Shiri
Van Nguyen
Gholamreza Haffari
Yuanyou Li
VLM
72
2
0
04 May 2023
Key-Locked Rank One Editing for Text-to-Image Personalization
Key-Locked Rank One Editing for Text-to-Image Personalization
Yoad Tewel
Rinon Gal
Gal Chechik
Yuval Atzmon
DiffM
252
174
0
02 May 2023
Previous
123...383940...424344
Next