Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2208.12242
Cited By
v1
v2 (latest)
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
25 August 2022
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Yael Pritch
Michael Rubinstein
Kfir Aberman
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation"
50 / 2,169 papers shown
Title
Language-Guided Joint Audio-Visual Editing via One-Shot Adaptation
Susan Liang
Chao Huang
Yapeng Tian
Anurag Kumar
Chenliang Xu
DiffM
117
8
0
09 Oct 2024
Personalized Visual Instruction Tuning
Renjie Pi
Jianshu Zhang
Tianyang Han
Jipeng Zhang
Boyao Wang
Tong Zhang
MLLM
81
9
0
09 Oct 2024
Suppress Content Shift: Better Diffusion Features via Off-the-Shelf Generation Techniques
Benyuan Meng
Qianqian Xu
Zitai Wang
Zhiyong Yang
Xiaochun Cao
Qingming Huang
96
0
0
09 Oct 2024
Story-Adapter: A Training-free Iterative Framework for Long Story Visualization
Jiawei Mao
Xiaoke Huang
Yunfei Xie
Yuanqi Chang
Mude Hui
Bingjie Xu
Yuyin Zhou
VGen
DiffM
119
4
0
08 Oct 2024
PixLens: A Novel Framework for Disentangled Evaluation in Diffusion-Based Image Editing with Object Detection + SAM
Stefan Stefanache
Lluís Pastor Pérez
Julen Costa Watanabe
Ernesto Sanchez Tejedor
Thomas Hofmann
Enis Simsar
EGVM
38
0
0
08 Oct 2024
DiffusionGuard: A Robust Defense Against Malicious Diffusion-based Image Editing
June Suk Choi
Kyungmin Lee
Jongheon Jeong
Saining Xie
Jinwoo Shin
Kimin Lee
DiffM
AAML
65
4
0
08 Oct 2024
TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation
Gihyun Kwon
Jong Chul Ye
DiffM
130
5
0
08 Oct 2024
Toward General Object-level Mapping from Sparse Views with 3D Diffusion Priors
Ziwei Liao
Binbin Xu
Steven L. Waslander
DiffM
112
3
0
07 Oct 2024
GS-VTON: Controllable 3D Virtual Try-on with Gaussian Splatting
Yukang Cao
Masoud Hadi
Liang Pan
Ziwei Liu
3DGS
DiffM
102
5
0
07 Oct 2024
OmniBooth: Learning Latent Control for Image Synthesis with Multi-modal Instruction
Leheng Li
Weichao Qiu
Xu Yan
Jing He
Kaiqiang Zhou
Yingjie Cai
Qing Lian
Bingbing Liu
Ying-Cong Chen
SyDa
DiffM
85
1
0
07 Oct 2024
Low-Rank Continual Personalization of Diffusion Models
Łukasz Staniszewski
Katarzyna Zaleska
Kamil Deja
DiffM
101
0
0
07 Oct 2024
HERO: Human-Feedback Efficient Reinforcement Learning for Online Diffusion Model Finetuning
Ayano Hiranaka
Shang-Fu Chen
Chieh-Hsin Lai
Dongjun Kim
Naoki Murata
Takashi Shibuya
Wei-Hsiang Liao
Shao-Hua Sun
Yuki Mitsufuji
123
2
0
07 Oct 2024
Image Watermarks are Removable Using Controllable Regeneration from Clean Noise
Yepeng Liu
Yiren Song
Hai Ci
Yu Zhang
Haofan Wang
Mike Zheng Shou
Yuheng Bu
WIGM
118
7
0
07 Oct 2024
Learning on LoRAs: GL-Equivariant Processing of Low-Rank Weight Spaces for Large Finetuned Models
Theo Putterman
Derek Lim
Yoav Gelberg
Stefanie Jegelka
Haggai Maron
AI4CE
102
6
0
05 Oct 2024
Text-guided Diffusion Model for 3D Molecule Generation
Yanchen Luo
Sihang Li
Changhao Nai
Zhiyuan Liu
Jiancan Wu
An Zhang
Wenjie Du
Xiang Wang
81
7
0
04 Oct 2024
Tuning Timestep-Distilled Diffusion Model Using Pairwise Sample Optimization
Zichen Miao
Zhengyuan Yang
Kevin Lin
Ze Wang
Zicheng Liu
Lijuan Wang
Qiang Qiu
97
6
0
04 Oct 2024
Event-Customized Image Generation
Zhen Wang
Yilei Jiang
Dong Zheng
Jun Xiao
Long Chen
DiffM
53
1
0
03 Oct 2024
Leveraging Model Guidance to Extract Training Data from Personalized Diffusion Models
Xiaoyu Wu
Jiaru Zhang
Steven Wu
123
2
0
03 Oct 2024
ControlAR: Controllable Image Generation with Autoregressive Models
Zongming Li
Tianheng Cheng
Shoufa Chen
Peize Sun
Haocheng Shen
Longjin Ran
Xiaoxin Chen
Wenyu Liu
Xinggang Wang
DiffM
248
19
0
03 Oct 2024
DisEnvisioner: Disentangled and Enriched Visual Prompt for Customized Image Generation
Jing He
Haodong Li
Yongzhe Hu
Guibao Shen
Yingjie Cai
Weichao Qiu
Ying-Cong Chen
DiffM
97
4
0
02 Oct 2024
Harnessing the Latent Diffusion Model for Training-Free Image Style Transfer
Kento Masui
Mayu Otani
Masahiro Nomura
Hideki Nakayama
DiffM
53
1
0
02 Oct 2024
Anchors Aweigh! Sail for Optimal Unified Multi-Modal Representations
Minoh Jeong
Min Namgung
Zae Myung Kim
Dongyeop Kang
Yao-Yi Chiang
Alfred Hero
126
0
0
02 Oct 2024
Improving Fine-Grained Control via Aggregation of Multiple Diffusion Models
Conghan Yue
Zhengwei Peng
Shiyan Du
Zhi Ji
Chuangjian Cai
Le Wan
Dongyu Zhang
68
0
0
02 Oct 2024
Improved Generation of Synthetic Imaging Data Using Feature-Aligned Diffusion
Lakshmi Nair
MedIm
26
0
0
01 Oct 2024
MCGM: Mask Conditional Text-to-Image Generative Model
Rami Skaik
Leonardo Rossi
Tomaso Fontanini
Andrea Prati
DiffM
40
0
0
01 Oct 2024
CusConcept: Customized Visual Concept Decomposition with Diffusion Models
Zhi Xu
Shaozhe Hao
Kai Han
DiffM
74
4
0
01 Oct 2024
Mining Your Own Secrets: Diffusion Classifier Scores for Continual Personalization of Text-to-Image Diffusion Models
Saurav Jha
Shiqi Yang
Masato Ishii
Mengjie Zhao
Christian Simon
Muhammad Jehanzeb Mirza
Dong Gong
Lina Yao
Shusuke Takahashi
Yuki Mitsufuji
DiffM
151
3
0
01 Oct 2024
ACE: All-round Creator and Editor Following Instructions via Diffusion Transformer
Zhen Han
Zeyinzi Jiang
Yulin Pan
Jingfeng Zhang
Chaojie Mao
Chenwei Xie
Yu Liu
Jingren Zhou
DiffM
96
21
0
30 Sep 2024
Training a Computer Vision Model for Commercial Bakeries with Primarily Synthetic Images
Thomas H. Schmitt
Maximilian Bundscherer
Tobias Bocklet
31
0
0
30 Sep 2024
Illustrious: an Open Advanced Illustration Model
Sang Hyun Park
Jun Young Koh
Junha Lee
Joy Song
Dongha Kim
Hoyeon Moon
Hyunju Lee
Min Song
VLM
51
1
0
30 Sep 2024
Simple and Fast Distillation of Diffusion Models
Zhenyu Zhou
Defang Chen
Can Wang
Chun Chen
Siwei Lyu
DiffM
75
8
0
29 Sep 2024
Conditional Image Synthesis with Diffusion Models: A Survey
Zheyuan Zhan
Defang Chen
Jian-Ping Mei
Zhenghe Zhao
Jiawei Chen
Chun-Yen Chen
Siwei Lyu
Can Wang
VLM
109
10
0
28 Sep 2024
Fusion is all you need: Face Fusion for Customized Identity-Preserving Image Synthesis
Salaheldin Mohamed
Dong Han
Yong Li
57
1
0
27 Sep 2024
Gradient-free Decoder Inversion in Latent Diffusion Models
Seongmin Hong
Suh Yoon Jeon
Kyeonghyun Lee
Ernest K. Ryu
S. Chun
79
0
0
27 Sep 2024
Amodal Instance Segmentation with Diffusion Shape Prior Estimation
Minh Tran
Khoa T. Vo
Tri Nguyen
Ngan Le
DiffM
69
0
0
26 Sep 2024
Stable Video Portraits
Mirela Ostrek
Justus Thies
VGen
DiffM
72
1
0
26 Sep 2024
FreeEdit: Mask-free Reference-based Image Editing with Multi-modal Instruction
Runze He
Kai Ma
Linjiang Huang
Shaofei Huang
Jialin Gao
Xiaoming Wei
Jiao Dai
Jizhong Han
Si Liu
DiffM
78
9
0
26 Sep 2024
Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation
Qihan Huang
Siming Fu
Jinlong Liu
Hao Jiang
Yipeng Yu
Jie Song
71
9
0
26 Sep 2024
AnyLogo: Symbiotic Subject-Driven Diffusion System with Gemini Status
Jinghao Zhang
Wen Qian
Hao Luo
Fan Wang
Feng Zhao
DiffM
71
0
0
26 Sep 2024
Dark Miner: Defend against unsafe generation for text-to-image diffusion models
Zheling Meng
Bo Peng
Xiaochuan Jin
Yue Jiang
Jing Dong
Wei Wang
Tieniu Tan
DiffM
74
2
0
26 Sep 2024
ControlCity: A Multimodal Diffusion Model Based Approach for Accurate Geospatial Data Generation and Urban Morphology Analysis
Fangshuo Zhou
Huaxia Li
Rui Hu
Sensen Wu
Hailin Feng
Zhenhong Du
Liuchang Xu
DiffM
68
2
0
25 Sep 2024
DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling
Kyuheon Jung
Yongdeuk Seo
Seongwoo Cho
Jaeyoung Kim
Hyun-seok Min
Sungchul Choi
33
1
0
25 Sep 2024
Prompt Sliders for Fine-Grained Control, Editing and Erasing of Concepts in Diffusion Models
Deepak Sridhar
Nuno Vasconcelos
DiffM
64
2
0
25 Sep 2024
GeoBiked: A Dataset with Geometric Features and Automated Labeling Techniques to Enable Deep Generative Models in Engineering Design
Phillip Mueller
Sebastian Mueller
Lars Mikelsons
120
2
0
25 Sep 2024
NanoVoice: Efficient Speaker-Adaptive Text-to-Speech for Multiple Speakers
Nohil Park
Heeseung Kim
Che Hyun Lee
Jooyoung Choi
Jiheum Yeom
Sungroh Yoon
72
2
0
24 Sep 2024
ImPoster: Text and Frequency Guidance for Subject Driven Action Personalization using Diffusion Models
D. Kothandaraman
Kuldeep Kulkarni
Sumit Shekhar
Balaji Vasan Srinivasan
Dinesh Manocha
DiffM
95
1
0
24 Sep 2024
Multi-Modal Generative AI: Multi-modal LLM, Diffusion and Beyond
Hong Chen
Xin Wang
Yuwei Zhou
Bin Huang
Yipeng Zhang
Wei Feng
Houlun Chen
Zeyang Zhang
Siao Tang
Wenwu Zhu
DiffM
139
9
0
23 Sep 2024
VLEU: a Method for Automatic Evaluation for Generalizability of Text-to-Image Models
Jingtao Cao
Zheng Zhang
Hongru Wang
Kam-Fai Wong
59
0
0
23 Sep 2024
Fine Tuning Text-to-Image Diffusion Models for Correcting Anomalous Images
Hyunwoo Yoo
31
2
0
23 Sep 2024
Dormant: Defending against Pose-driven Human Image Animation
Jiachen Zhou
Mingsi Wang
Tianlin Li
Guozhu Meng
Kai Chen
160
5
0
22 Sep 2024
Previous
1
2
3
...
12
13
14
...
42
43
44
Next