Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2208.12242
Cited By
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
25 August 2022
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Yael Pritch
Michael Rubinstein
Kfir Aberman
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation"
50 / 2,074 papers shown
Title
Control3D: Towards Controllable Text-to-3D Generation
Yang Chen
Yingwei Pan
Yehao Li
Ting Yao
Tao Mei
DiffM
33
46
0
09 Nov 2023
A Data Perspective on Enhanced Identity Preservation for Diffusion Personalization
Xingzhe He
Zhiwen Cao
Nicholas I. Kolkin
Lantao Yu
Kun Wan
Helge Rhodin
Ratheesh Kalarot
45
13
0
07 Nov 2023
Reducing Spatial Fitting Error in Distillation of Denoising Diffusion Models
Shengzhe Zhou
Zejian Lee
Sheng Zhang
Lefan Hou
Changyuan Yang
Guang Yang
Zhiyuan Yang
Lingyun Sun
DiffM
46
0
0
07 Nov 2023
Stable Diffusion Reference Only: Image Prompt and Blueprint Jointly Guided Multi-Condition Diffusion Model for Secondary Painting
Hao Ai
Lu Sheng
DiffM
18
3
0
04 Nov 2023
Quantum circuit synthesis with diffusion models
Florian Fürrutter
Gorka Muñoz-Gil
Hans J. Briegel
AI4CE
DiffM
34
20
0
03 Nov 2023
Estimating 3D Uncertainty Field: Quantifying Uncertainty for Neural Radiance Fields
Jianxiong Shen
Ruijie Ren
Adria Ruiz
Francesc Moreno-Noguer
45
10
0
03 Nov 2023
The Blessing of Randomness: SDE Beats ODE in General Diffusion-based Image Editing
Shen Nie
Hanzhong Guo
Cheng Lu
Yuhao Zhou
Chenyu Zheng
Chongxuan Li
DiffM
34
38
0
02 Nov 2023
DP-Mix: Mixup-based Data Augmentation for Differentially Private Learning
Wenxuan Bao
Francesco Pittaluga
Vijay Kumar
Vincent Bindschaedler
33
9
0
02 Nov 2023
VideoDreamer: Customized Multi-Subject Text-to-Video Generation with Disen-Mix Finetuning on Language-Video Foundation Models
Hong Chen
Xin Wang
Guanning Zeng
Yipeng Zhang
Yuwei Zhou
Feilin Han
Wenwu Zhu
Wenwu Zhu
VGen
DiffM
38
1
0
02 Nov 2023
On Manipulating Scene Text in the Wild with Diffusion Models
Joshua Santoso
Christian Simon
Williem Pao
DiffM
48
6
0
01 Nov 2023
Consistent Video-to-Video Transfer Using Synthetic Dataset
Jiaxin Cheng
Tianjun Xiao
Tong He
VGen
DiffM
44
14
0
01 Nov 2023
Diversity and Diffusion: Observations on Synthetic Image Distributions with Stable Diffusion
David Marwood
S. Baluja
Y. Alon
DiffM
67
5
0
31 Oct 2023
Res-Tuning: A Flexible and Efficient Tuning Paradigm via Unbinding Tuner from Backbone
Zeyinzi Jiang
Chaojie Mao
Ziyuan Huang
Ao Ma
Yiliang Lv
Yujun Shen
Deli Zhao
Jingren Zhou
43
15
0
30 Oct 2023
CustomNet: Zero-shot Object Customization with Variable-Viewpoints in Text-to-Image Diffusion Models
Ziyang Yuan
Mingdeng Cao
Xintao Wang
Zhongang Qi
Chun Yuan
Ying Shan
DiffM
25
23
0
30 Oct 2023
Customizing 360-Degree Panoramas through Text-to-Image Diffusion Models
Hai Wang
Xiaoyu Xiang
Yuchen Fan
Jing-Hao Xue
96
26
0
28 Oct 2023
SD4Match: Learning to Prompt Stable Diffusion Model for Semantic Matching
Xinghui Li
Jingyi Lu
Kai Han
V. Prisacariu
DiffM
32
19
0
26 Oct 2023
AntifakePrompt: Prompt-Tuned Vision-Language Models are Fake Image Detectors
You-Ming Chang
Chen Yeh
Wei-Chen Chiu
Ning Yu
VPVLM
VLM
83
23
0
26 Oct 2023
DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
Jingxiang Sun
Bo Zhang
Ruizhi Shao
Lizhen Wang
Wen Liu
Zhenda Xie
Yebin Liu
55
132
0
25 Oct 2023
Fuse Your Latents: Video Editing with Multi-source Latent Diffusion Models
Tianyi Lu
Xing Zhang
Jiaxi Gu
Hang Xu
Renjing Pei
Songcen Xu
Zuxuan Wu
DiffM
VGen
35
4
0
25 Oct 2023
Prompt-Driven Building Footprint Extraction in Aerial Images with Offset-Building Model
Kai Li
Yupeng Deng
Yun-long Kong
Diyou Liu
Jingbo Chen
Yu Meng
Junxian Ma
Chenhao Wang
53
1
0
25 Oct 2023
On the Proactive Generation of Unsafe Images From Text-To-Image Models Using Benign Prompts
Yixin Wu
Ning Yu
Michael Backes
Yun Shen
Yang Zhang
DiffM
59
8
0
25 Oct 2023
iNVS: Repurposing Diffusion Inpainters for Novel View Synthesis
Yash Kant
Aliaksandr Siarohin
Michael Vasilkovsky
R. A. Guler
Jian Ren
Sergey Tulyakov
Igor Gilitschenski
DiffM
41
12
0
24 Oct 2023
CVPR 2023 Text Guided Video Editing Competition
Jay Zhangjie Wu
Xiuyu Li
Difei Gao
Zhen Dong
Jinbin Bai
...
Xu Cheng
Jie Tang
Mike Zheng Shou
Kurt Keutzer
Forrest N. Iandola
38
34
0
24 Oct 2023
Integrating View Conditions for Image Synthesis
Jinbin Bai
Zhen Dong
Aosong Feng
Xiao Zhang
Tian-Chun Ye
Kaicheng Zhou
67
13
0
24 Oct 2023
Nightshade: Prompt-Specific Poisoning Attacks on Text-to-Image Generative Models
Shawn Shan
Wenxin Ding
Josephine Passananti
Stanley Wu
Haitao Zheng
Ben Y. Zhao
SILM
DiffM
38
45
0
20 Oct 2023
TexFusion: Synthesizing 3D Textures with Text-Guided Image Diffusion Models
Tianshi Cao
Karsten Kreis
Sanja Fidler
Nicholas Sharp
Kangxue Yin
38
74
0
20 Oct 2023
ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection
Zhongzhan Huang
Pan Zhou
Shuicheng Yan
Liang Lin
29
27
0
20 Oct 2023
EMIT-Diff: Enhancing Medical Image Segmentation via Text-Guided Diffusion Model
Zheyu Zhang
Lanhong Yao
Bin Wang
Debesh Jha
Elif Keles
Alpay Medetalibeyoglu
Ulas Bagci
MedIm
46
10
0
19 Oct 2023
An Image is Worth Multiple Words: Discovering Object Level Concepts using Multi-Concept Prompt Learning
Chen Jin
Ryutaro Tanno
Amrutha Saseendran
Tom Diethe
Philip Teare
27
2
0
18 Oct 2023
LAMP: Learn A Motion Pattern for Few-Shot-Based Video Generation
Ruiqi Wu
Liangyu Chen
Tong Yang
Chunle Guo
Chongyi Li
Xiangyu Zhang
DiffM
VGen
89
52
0
16 Oct 2023
BiomedJourney: Counterfactual Biomedical Image Generation by Instruction-Learning from Multimodal Patient Journeys
Yu Gu
Jianwei Yang
Naoto Usuyama
Chun-yue Li
Sheng Zhang
M. Lungren
Jianfeng Gao
Hoifung Poon
MedIm
35
22
0
16 Oct 2023
A Survey on Video Diffusion Models
Zhen Xing
Qijun Feng
Haoran Chen
Qi Dai
Hang-Rui Hu
Hang Xu
Zuxuan Wu
Yu-Gang Jiang
EGVM
VGen
64
119
0
16 Oct 2023
LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts
Hanan Gani
Shariq Farooq Bhat
Muzammal Naseer
Salman Khan
Peter Wonka
DiffM
49
39
0
16 Oct 2023
DynVideo-E: Harnessing Dynamic NeRF for Large-Scale Motion- and View-Change Human-Centric Video Editing
Jia-Wei Liu
Yan-Pei Cao
Jay Zhangjie Wu
Weijia Mao
Yuchao Gu
Rui Zhao
Jussi Keppo
Ying Shan
Mike Zheng Shou
VGen
DiffM
50
15
0
16 Oct 2023
ConsistNet: Enforcing 3D Consistency for Multi-view Images Diffusion
Jiayu Yang
Ziang Cheng
Yunfei Duan
Pan Ji
Hongdong Li
DiffM
52
54
0
16 Oct 2023
EasyGen: Easing Multimodal Generation with BiDiffuser and LLMs
Xiangyu Zhao
Bo Liu
Qijiong Liu
Guangyuan Shi
Xiao-Ming Wu
VLM
DiffM
29
7
0
13 Oct 2023
R&B: Region and Boundary Aware Zero-shot Grounded Text-to-image Generation
Jiayu Xiao
Henglei Lv
Liang Li
Shuhui Wang
Qingming Huang
DiffM
45
20
0
13 Oct 2023
Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation
Zhengyuan Yang
Jianfeng Wang
Linjie Li
Kevin Qinghong Lin
Chung-Ching Lin
Zicheng Liu
Lijuan Wang
LRM
MLLM
DiffM
13
22
0
12 Oct 2023
MotionDirector: Motion Customization of Text-to-Video Diffusion Models
Rui Zhao
Yuchao Gu
Jay Zhangjie Wu
David Junhao Zhang
Jia-Wei Liu
Weijia Wu
Jussi Keppo
Mike Zheng Shou
DiffM
VGen
48
106
0
12 Oct 2023
Debias the Training of Diffusion Models
Huikang Yu
Li Shen
Jie Huang
Man Zhou
Hongsheng Li
Feng Zhao
DiffM
32
3
0
12 Oct 2023
Mapping Memes to Words for Multimodal Hateful Meme Classification
Giovanni Burbi
Alberto Baldrati
Lorenzo Agnolucci
Marco Bertini
A. Bimbo
27
12
0
12 Oct 2023
Tailored Visions: Enhancing Text-to-Image Generation with Personalized Prompt Rewriting
Zijie Chen
Lichao Zhang
Fangsheng Weng
Lili Pan
Zhenzhong Lan
32
9
0
12 Oct 2023
SingleInsert: Inserting New Concepts from a Single Image into Text-to-Image Models for Flexible Editing
Zijie Wu
Chaohui Yu
Zhen Zhu
Fan Wang
Xiang Bai
22
12
0
12 Oct 2023
Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models
Zeqiang Lai
Xizhou Zhu
Jifeng Dai
Yu Qiao
Wenhai Wang
MLLM
DiffM
54
23
0
11 Oct 2023
Multi-Concept T2I-Zero: Tweaking Only The Text Embeddings and Nothing Else
Hazarapet Tunanyan
Dejia Xu
Shant Navasardyan
Zhangyang Wang
Humphrey Shi
DiffM
88
7
0
11 Oct 2023
Uni-paint: A Unified Framework for Multimodal Image Inpainting with Pretrained Diffusion Model
Shiyuan Yang
Xiaodong Chen
Jing Liao
DiffM
30
59
0
11 Oct 2023
State of the Art on Diffusion Models for Visual Computing
Ryan Po
Wang Yifan
Vladislav Golyanik
Kfir Aberman
Jonathan T. Barron
...
Matthias Nießner
Bjorn Ommer
Christian Theobalt
Peter Wonka
Gordon Wetzstein
38
104
0
11 Oct 2023
ObjectComposer: Consistent Generation of Multiple Objects Without Fine-tuning
Alec Helbling
Evan Montoya
Duen Horng Chau
DiffM
39
1
0
10 Oct 2023
Mitigating stereotypical biases in text to image generative systems
Piero Esposito
Parmida Atighehchian
Anastasis Germanidis
Deepti Ghadiyaram
33
16
0
10 Oct 2023
JointNet: Extending Text-to-Image Diffusion for Dense Distribution Modeling
Jingyang Zhang
Shiwei Li
Yuanxun Lu
Tian Fang
David McKinnon
Yanghai Tsin
Long Quan
Yao Yao
27
10
0
10 Oct 2023
Previous
1
2
3
...
30
31
32
...
40
41
42
Next