Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2208.12242
Cited By
v1
v2 (latest)
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
25 August 2022
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Yael Pritch
Michael Rubinstein
Kfir Aberman
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation"
50 / 2,169 papers shown
Title
Gen2Det: Generate to Detect
Saksham Suri
Fanyi Xiao
Animesh Sinha
Sean Culatana
Raghuraman Krishnamoorthi
Chenchen Zhu
Abhinav Shrivastava
VLM
DiffM
93
10
0
07 Dec 2023
NeRFiller: Completing Scenes via Generative 3D Inpainting
Ethan Weber
Aleksander Holyñski
Varun Jampani
Saurabh Saxena
Noah Snavely
Abhishek Kar
Angjoo Kanazawa
106
32
0
07 Dec 2023
Generating Illustrated Instructions
Sachit Menon
Ishan Misra
Rohit Girdhar
DiffM
86
5
0
07 Dec 2023
RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models
Ozgur Kara
Barışcan Kurtkaya
Hidir Yesiltepe
James M. Rehg
Pinar Yanardag
VGen
DiffM
102
55
0
07 Dec 2023
Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation
Zhiwu Qing
Shiwei Zhang
Jiayu Wang
Xiang Wang
Yujie Wei
Yingya Zhang
Changxin Gao
Nong Sang
VGen
DiffM
64
43
0
07 Dec 2023
PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding
Zhen Li
Mingdeng Cao
Xintao Wang
Zhongang Qi
Ming-Ming Cheng
Ying Shan
DiffM
141
201
0
07 Dec 2023
DreamVideo: Composing Your Dream Videos with Customized Subject and Motion
Yujie Wei
Shiwei Zhang
Zhiwu Qing
Hangjie Yuan
Zhiheng Liu
Yu Liu
Yingya Zhang
Jingren Zhou
Hongming Shan
DiffM
VGen
75
98
0
07 Dec 2023
Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models
Jiayi Guo
Xingqian Xu
Yifan Pu
Zanlin Ni
Chaofei Wang
Manushree Vasu
Shiji Song
Gao Huang
Humphrey Shi
DiffM
76
32
0
07 Dec 2023
DemoCaricature: Democratising Caricature Generation with a Rough Sketch
Dar-Yen Chen
A. Bhunia
Subhadeep Koley
Aneeshan Sain
Pinaki Nath Chowdhury
Yi-Zhe Song
94
8
0
07 Dec 2023
Style Transfer to Calvin and Hobbes comics using Stable Diffusion
Sloke Shrestha
Sundar Sripada
Asvin Venkataramanan
DiffM
52
1
0
07 Dec 2023
Stable Diffusion for Data Augmentation in COCO and Weed Datasets
Boyang Deng
69
2
0
07 Dec 2023
LEGO: Learning EGOcentric Action Frame Generation via Visual Instruction Tuning
Bolin Lai
Xiaoliang Dai
Lawrence Chen
Guan Pang
James M. Rehg
Miao Liu
109
17
0
06 Dec 2023
Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
Zeyi Sun
Ye Fang
Tong Wu
Pan Zhang
Yuhang Zang
Shu Kong
Yuanjun Xiong
Dahua Lin
Jiaqi Wang
VLM
CLIP
131
91
0
06 Dec 2023
AVID: Any-Length Video Inpainting with Diffusion Model
Zhixing Zhang
Bichen Wu
Xiaoyan Wang
Yaqiao Luo
Luxin Zhang
Yinan Zhao
Peter Vajda
Dimitris N. Metaxas
Licheng Yu
VGen
DiffM
128
42
0
06 Dec 2023
TokenCompose: Text-to-Image Diffusion with Token-level Supervision
Zirui Wang
Zhizhou Sha
Zheng Ding
Yilin Wang
Zhuowen Tu
DiffM
105
23
0
06 Dec 2023
DiffusionSat: A Generative Foundation Model for Satellite Imagery
Samar Khanna
Patrick Liu
Linqi Zhou
Chenlin Meng
Robin Rombach
Marshall Burke
David B. Lobell
Stefano Ermon
89
65
0
06 Dec 2023
A Task is Worth One Word: Learning with Task Prompts for High-Quality Versatile Image Inpainting
Junhao Zhuang
Yanhong Zeng
Wenran Liu
Chun Yuan
Kai Chen
DiffM
141
79
0
06 Dec 2023
Language-Informed Visual Concept Learning
Sharon Lee
Yunzhi Zhang
Shangzhe Wu
Jiajun Wu
CoGe
72
9
0
06 Dec 2023
Context Diffusion: In-Context Aware Image Generation
Ivona Najdenkoska
Animesh Sinha
Abhimanyu Dubey
Dhruv Mahajan
Vignesh Ramanathan
Filip Radenovic
DiffM
53
14
0
06 Dec 2023
Personalized Face Inpainting with Diffusion Models by Parallel Visual Attention
Jianjin Xu
Saman Motamed
Praneetha Vaddamanu
C. Wu
Christian Haene
Jean-Charles Bazin
Fernando de la Torre
76
14
0
06 Dec 2023
Make-A-Storyboard: A General Framework for Storyboard with Disentangled and Merged Control
Jingkuan Song
Litao Guo
Lianli Gao
Hengtao Shen
Jingkuan Song
DiffM
72
3
0
06 Dec 2023
FAAC: Facial Animation Generation with Anchor Frame and Conditional Control for Superior Fidelity and Editability
Linze Li
Sunqi Fan
Hengjun Pu
Z. Bing
Yao Tang
Tianzhu Ye
Tong Yang
Liangyu Chen
Jiajun Liang
VGen
DiffM
48
0
0
06 Dec 2023
Rethinking E-Commerce Search
Haixun Wang
Taesik Na
64
6
0
06 Dec 2023
DiffusionAtlas: High-Fidelity Consistent Diffusion Video Editing
Shao-Yu Chang
Hwann-Tzong Chen
Tyng-Luh Liu
DiffM
VGen
102
3
0
05 Dec 2023
DreamInpainter: Text-Guided Subject-Driven Image Inpainting with Diffusion Models
Shaoan Xie
Yang Zhao
Zhisheng Xiao
Kelvin C. K. Chan
Yandong Li
Yanwu Xu
Kun Zhang
Tingbo Hou
DiffM
98
28
0
05 Dec 2023
ViscoNet: Bridging and Harmonizing Visual and Textual Conditioning for ControlNet
Soon Yau Cheong
Armin Mustafa
Andrew Gilbert
DiffM
65
5
0
05 Dec 2023
LooseControl: Lifting ControlNet for Generalized Depth Conditioning
Shariq Farooq Bhat
Niloy J. Mitra
Peter Wonka
AI4CE
DiffM
78
40
0
05 Dec 2023
Alchemist: Parametric Control of Material Properties with Diffusion Models
Prafull Sharma
Varun Jampani
Yuanzhen Li
Xuhui Jia
Dmitry Lagun
Frédo Durand
William T. Freeman
Mark J. Matthews
DiffM
130
26
0
05 Dec 2023
DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control
Yuru Jia
Lukas Hoyer
Shengyu Huang
Tianfu Wang
Luc Van Gool
Konrad Schindler
Anton Obukhov
DiffM
123
24
0
05 Dec 2023
WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation
Jiachen Lu
Ze Huang
Zeyu Yang
Jiahui Zhang
Li Zhang
VGen
94
46
0
05 Dec 2023
LivePhoto: Real Image Animation with Text-guided Motion Control
Xi Chen
Zhiheng Liu
Mengting Chen
Yutong Feng
Yu Liu
Yujun Shen
Hengshuang Zhao
VGen
DiffM
85
33
0
05 Dec 2023
MagicStick: Controllable Video Editing via Control Handle Transformations
Yue Ma
Xiaodong Cun
Yin-Yin He
Chenyang Qi
Xintao Wang
Ying Shan
Xiu Li
Qifeng Chen
VGen
120
26
0
05 Dec 2023
Fine-grained Controllable Video Generation via Object Appearance and Context
Hsin-Ping Huang
Yu-Chuan Su
Deqing Sun
Lu Jiang
Xuhui Jia
Yukun Zhu
Ming-Hsuan Yang
DiffM
VGen
73
15
0
05 Dec 2023
Customization Assistant for Text-to-image Generation
Yufan Zhou
Ruiyi Zhang
Jiuxiang Gu
Tongfei Sun
DiffM
87
12
0
05 Dec 2023
BIVDiff: A Training-Free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models
Fengyuan Shi
Jiaxi Gu
Hang Xu
Songcen Xu
Wei Zhang
Limin Wang
VGen
DiffM
70
14
0
05 Dec 2023
Analyzing and Improving the Training Dynamics of Diffusion Models
Tero Karras
M. Aittala
J. Lehtinen
Janne Hellsten
Timo Aila
S. Laine
153
203
0
05 Dec 2023
FaceStudio: Put Your Face Everywhere in Seconds
Yuxuan Yan
C. Zhang
Rui Wang
Yichao Zhou
Gege Zhang
Pei Cheng
Gang Yu
Bin-Bin Fu
DiffM
70
41
0
05 Dec 2023
Stable Diffusion Exposed: Gender Bias from Prompt to Image
Yankun Wu
Yuta Nakashima
Noa Garcia
97
18
0
05 Dec 2023
Retrieving Conditions from Reference Images for Diffusion Models
Haoran Tang
Xin Zhou
Jieren Deng
Zhihong Pan
Hao Tian
Pratik Chaudhari
77
3
0
05 Dec 2023
SAVE: Protagonist Diversification with Structure Agnostic Video Editing
Yeji Song
Wonsik Shin
Junsoo Lee
Jeesoo Kim
Nojun Kwak
DiffM
VGen
157
6
0
05 Dec 2023
Orthogonal Adaptation for Modular Customization of Diffusion Models
Ryan Po
Guandao Yang
Kfir Aberman
Gordon Wetzstein
DiffM
87
30
0
05 Dec 2023
InstructBooth: Instruction-following Personalized Text-to-Image Generation
Daewon Chae
Nokyung Park
Jinkyu Kim
Kimin Lee
DiffM
38
11
0
04 Dec 2023
Readout Guidance: Learning Control from Diffusion Features
Grace Luo
Trevor Darrell
Oliver Wang
Dan B. Goldman
Aleksander Holynski
103
27
0
04 Dec 2023
Generative Powers of Ten
Xiaojuan Wang
Janne Kontkanen
Brian L. Curless
Steven M. Seitz
Ira Kemelmacher-Shlizerman
B. Mildenhall
Pratul P. Srinivasan
Dor Verbin
Aleksander Holynski
80
10
0
04 Dec 2023
Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
Bingxin Ke
Anton Obukhov
Shengyu Huang
Nando Metzger
Rodrigo Caye Daudt
Konrad Schindler
VLM
MDE
142
173
0
04 Dec 2023
DiffiT: Diffusion Vision Transformers for Image Generation
Ali Hatamizadeh
Jiaming Song
Guilin Liu
Jan Kautz
Arash Vahdat
101
75
0
04 Dec 2023
Style Aligned Image Generation via Shared Attention
Amir Hertz
Andrey Voynov
Shlomi Fruchter
Daniel Cohen-Or
DiffM
68
135
0
04 Dec 2023
ArtAdapter: Text-to-Image Style Transfer using Multi-Level Style Encoder and Explicit Adaptation
Dar-Yen Chen
Hamish Tennent
Ching-Wen Hsu
DiffM
114
27
0
04 Dec 2023
VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence
Yuchao Gu
Yipin Zhou
Bichen Wu
Licheng Yu
Jia-Wei Liu
Rui Zhao
Jay Zhangjie Wu
David Junhao Zhang
Mike Zheng Shou
Kevin Tang
DiffM
VGen
125
42
0
04 Dec 2023
UniGS: Unified Representation for Image Generation and Segmentation
Lu Qi
Lehan Yang
Weidong Guo
Yu-Syuan Xu
Bo Du
Varun Jampani
Ming-Hsuan Yang
96
19
0
04 Dec 2023
Previous
1
2
3
...
29
30
31
...
42
43
44
Next