Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2208.12242
Cited By
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
25 August 2022
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Yael Pritch
Michael Rubinstein
Kfir Aberman
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation"
50 / 556 papers shown
Title
Style Customization of Text-to-Vector Generation with Image Diffusion Priors
Peiying Zhang
Nanxuan Zhao
Jing Liao
DiffM
37
0
0
15 May 2025
Visual Watermarking in the Era of Diffusion Models: Advances and Challenges
Junxian Duan
Jiyang Guan
Wenkui Yang
Ran He
WIGM
43
0
0
13 May 2025
Ultra Lowrate Image Compression with Semantic Residual Coding and Compression-aware Diffusion
Anle Ke
Xu Zhang
Tong Chen
Ming-Tse Lu
Chao Zhou
Jiawen Gu
Zhan Ma
DiffM
30
0
0
13 May 2025
Visually Guided Decoding: Gradient-Free Hard Prompt Inversion with Language Models
Donghoon Kim
Minji Bae
Kyuhong Shim
B. Shim
38
0
0
13 May 2025
FLUXSynID: A Framework for Identity-Controlled Synthetic Face Generation with Document and Live Images
Raul Ismayilov
Dzemila Sero
Luuk Spreeuwers
29
0
0
12 May 2025
Uni-AIMS: AI-Powered Microscopy Image Analysis
Yanhui Hong
Nan Wang
Zhiyi Xia
Haoyi Tao
Xi Fang
...
Shengyu Li
Ziqi Chen
Zezhong Zhang
Guolin Ke
Linfeng Zhang
26
0
0
11 May 2025
BridgeIV: Bridging Customized Image and Video Generation through Test-Time Autoregressive Identity Propagation
Panwen Hu
Jiehui Huang
Qiang Sun
Xiaodan Liang
DiffM
VGen
28
0
0
11 May 2025
Noise-Consistent Siamese-Diffusion for Medical Image Synthesis and Segmentation
Kunpeng Qiu
Zhiqiang Gao
Zhiying Zhou
Mingjie Sun
Yongxin Guo
MedIm
34
0
0
09 May 2025
Demystifying Diffusion Policies: Action Memorization and Simple Lookup Table Alternatives
Chengyang He
Xu Liu
Gadiel Sznaier Camps
Guillaume Sartoretti
Mac Schwager
28
0
0
09 May 2025
Photovoltaic Defect Image Generator with Boundary Alignment Smoothing Constraint for Domain Shift Mitigation
Dongying Li
Binyi Su
Hua Zhang
Yong Li
Haiyong Chen
140
0
0
09 May 2025
Diffusion Model Quantization: A Review
Qian Zeng
Chenggong Hu
Mingli Song
Jie Song
MQ
45
0
0
08 May 2025
PIDiff: Image Customization for Personalized Identities with Diffusion Models
Jinyu Gu
Haipeng Liu
M. Y. Wang
Y. Wang
68
0
0
08 May 2025
HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation
Teng Hu
Zhentao Yu
Zhengguang Zhou
Sen Liang
Yuan Zhou
Qin Lin
Qinglin Lu
DiffM
VGen
57
0
0
07 May 2025
Generating Synthetic Data via Augmentations for Improved Facial Resemblance in DreamBooth and InstantID
Koray Ulusan
Benjamin Kiefer
DiffM
45
0
0
06 May 2025
Not All Parameters Matter: Masking Diffusion Models for Enhancing Generation Ability
L. Wang
Senmao Li
Fei Yang
Jianye Wang
Ziheng Zhang
Yong-Jin Liu
Y. Wang
Jian Yang
DiffM
61
0
0
06 May 2025
Deepfakes on Demand: the rise of accessible non-consensual deepfake image generators
Will Hawkins
Chris Russell
Brent Mittelstadt
DiffM
132
0
0
06 May 2025
Towards Dataset Copyright Evasion Attack against Personalized Text-to-Image Diffusion Models
Kuofeng Gao
Yufei Zhu
Yiming Li
Jiawang Bai
Yong-Liang Yang
Z. Li
Shu-Tao Xia
41
0
0
05 May 2025
MUSAR: Exploring Multi-Subject Customization from Single-Subject Dataset via Attention Routing
Zinan Guo
Pengze Zhang
Yanze Wu
Chong Mou
Mingcong Liu
Qian He
33
0
0
05 May 2025
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities
Xuzhi Zhang
Jintao Guo
Shanshan Zhao
Minghao Fu
Lunhao Duan
Guo-Hua Wang
Qing-Guo Chen
Zhao Xu
Weihua Luo
Kaifu Zhang
DiffM
74
0
0
05 May 2025
Enhancing AI Face Realism: Cost-Efficient Quality Improvement in Distilled Diffusion Models with a Fully Synthetic Dataset
Jakub Wąsala
Bartłomiej Wrzalski
Kornelia Noculak
Yuliia Tarasenko
Oliwer Krupa
Jan Kocoń
Grzegorz Chodak
33
0
0
04 May 2025
DualReal: Adaptive Joint Training for Lossless Identity-Motion Fusion in Video Customization
Wenchuan Wang
Mengqi Huang
Yijing Tu
Zhendong Mao
VGen
69
0
0
04 May 2025
RAGAR: Retrieval Augment Personalized Image Generation Guided by Recommendation
Run Ling
Luu Anh Tuan
Yuting Liu
G. Guo
Linying Jiang
Xingwei Wang
DiffM
57
0
0
03 May 2025
PhytoSynth: Leveraging Multi-modal Generative Models for Crop Disease Data Generation with Novel Benchmarking and Prompt Engineering Approach
Nitin Rai
Arnold W. Schumann
Nathan Boyd
MedIm
39
0
0
03 May 2025
VSC: Visual Search Compositional Text-to-Image Diffusion Model
Do Huu Dat
Nam Hyeonu
Po Yuan Mao
Tae-Hyun Oh
DiffM
CoGe
66
0
0
02 May 2025
Multi-Modal Language Models as Text-to-Image Model Evaluators
Jiahui Chen
Candace Ross
Reyhane Askari Hemmat
Koustuv Sinha
Melissa Hall
M. Drozdzal
Adriana Romero-Soriano
EGVM
60
0
0
01 May 2025
Quaternion Wavelet-Conditioned Diffusion Models for Image Super-Resolution
Luigi Sigillo
Christian Bianchi
A. Uncini
Danilo Comminiello
48
0
0
01 May 2025
Diff-Prompt: Diffusion-Driven Prompt Generator with Mask Supervision
Weicai Yan
Wang Lin
Zirun Guo
Ye Wang
Fangming Feng
Xiaoda Yang
Zhilin Wang
Tao Jin
DiffM
132
2
0
30 Apr 2025
YoChameleon: Personalized Vision and Language Generation
Thao Nguyen
Krishna Kumar Singh
Jing Shi
Trung H. Bui
Yong Jae Lee
Yuheng Li
MLLM
82
0
0
29 Apr 2025
EarthMapper: Visual Autoregressive Models for Controllable Bidirectional Satellite-Map Translation
Zhe Dong
Yuzhe Sun
Tianzhu Liu
Wangmeng Zuo
Yanfeng Gu
57
0
0
28 Apr 2025
RepText: Rendering Visual Text via Replicating
Haozhao Wang
Yongjun Xu
Yongqian Li
Jiajun Li
Chaowei Zhang
J. Wang
Kejia Yang
Z. Chen
VLM
66
0
0
28 Apr 2025
What's Pulling the Strings? Evaluating Integrity and Attribution in AI Training and Inference through Concept Shift
Jiamin Chang
Hao Li
Hammond Pearce
Ruoxi Sun
Bo-wen Li
Minhui Xue
38
0
0
28 Apr 2025
Keep the General, Inject the Specific: Structured Dialogue Fine-Tuning for Knowledge Injection without Catastrophic Forgetting
Y. Hong
Xiaofei Yin
Xinzhong Wang
Yi Tu
Ya Guo
Sufeng Duan
Weiqiang Wang
Lingyong Fang
Depeng Wang
Huijia Zhu
CLL
94
0
0
27 Apr 2025
Visibility-Uncertainty-guided 3D Gaussian Inpainting via Scene Conceptional Learning
Mingxuan Cui
Yihao Huang
Yixuan Wang
Hongkai Yu
Dahua Lin
Q. Zou
Ming-Ming Cheng
Xuzhao Li
3DGS
43
0
0
23 Apr 2025
Subject-driven Video Generation via Disentangled Identity and Motion
Daneul Kim
Jingxu Zhang
W. Jin
Sunghyun Cho
Qi Dai
Jaesik Park
Chong Luo
DiffM
VGen
115
0
0
23 Apr 2025
DreamO: A Unified Framework for Image Customization
Chong Mou
Yanze Wu
Wenxu Wu
Zinan Guo
Pengze Zhang
...
Shaojin Wu
Songtao Zhao
Jian Zhang
Qian He
Xinglong Wu
49
0
0
23 Apr 2025
FreeGraftor: Training-Free Cross-Image Feature Grafting for Subject-Driven Text-to-Image Generation
Zebin Yao
Lei Ren
Huixing Jiang
Chen Wei
Xiaojie Wang
Ruifan Li
Fangxiang Feng
DiffM
76
0
0
22 Apr 2025
Solving New Tasks by Adapting Internet Video Knowledge
Calvin Luo
Zilai Zeng
Yilun Du
Chen Sun
30
1
0
21 Apr 2025
PRISM: A Unified Framework for Photorealistic Reconstruction and Intrinsic Scene Modeling
Alara Dirik
Tuanfeng Y. Wang
Duygu Ceylan
Stefanos Zafeiriou
Anna Frühstück
DiffM
47
0
0
19 Apr 2025
LLM-Enabled Style and Content Regularization for Personalized Text-to-Image Generation
Anran Yu
Wei Feng
Yuhang Zhang
Xiang Li
Lei Meng
Lei Wu
X. Meng
DiffM
24
0
0
19 Apr 2025
Learning Joint ID-Textual Representation for ID-Preserving Image Synthesis
Zichuan Liu
Liming Jiang
Qing Yan
Yumin Jia
Hao Kang
Xin Lu
DiffM
31
0
0
19 Apr 2025
InstantCharacter: Personalize Any Characters with a Scalable Diffusion Transformer Framework
Jiale Tao
Yanbing Zhang
Qixun Wang
Yiji Cheng
Haofan Wang
...
Ruihuang Li
Linqing Wang
Chunyu Wang
Qin Lin
Qinglin Lu
DiffM
47
1
0
16 Apr 2025
RDI: An adversarial robustness evaluation metric for deep neural networks based on sample clustering features
Jialei Song
Xingquan Zuo
Feiyang Wang
Hai Huang
Tianle Zhang
AAML
85
0
0
16 Apr 2025
Anti-Aesthetics: Protecting Facial Privacy against Customized Text-to-Image Synthesis
Songping Wang
Yueming Lyu
Shiqi Liu
Ning Li
Tong Tong
Hao Sun
Caifeng Shan
PICV
67
0
0
16 Apr 2025
VideoPanda: Video Panoramic Diffusion with Multi-view Attention
Kevin Xie
Amirmojtaba Sabour
Jiahui Huang
Despoina Paschalidou
G. Klár
Umar Iqbal
Sanja Fidler
Fangyin Wei
VGen
MDE
39
0
0
15 Apr 2025
Hierarchical and Step-Layer-Wise Tuning of Attention Specialty for Multi-Instance Synthesis in Diffusion Transformers
Chunyang Zhang
Zhenhong Sun
Zhicheng Zhang
Junyan Wang
Yu Zhang
Dong Gong
H. Mo
Daoyi Dong
45
0
0
14 Apr 2025
Flux Already Knows -- Activating Subject-Driven Image Generation without Training
Hao Kang
Stathi Fotiadis
Liming Jiang
Qing Yan
Yumin Jia
Zichuan Liu
Min Jin Chong
Xin Lu
40
0
0
12 Apr 2025
COP-GEN-Beta: Unified Generative Modelling of COPernicus Imagery Thumbnails
Miguel Espinosa
V. Marsocci
Yuru Jia
Elliot J. Crowley
Mikolaj Czerkawski
DiffM
52
0
0
11 Apr 2025
ID-Booth: Identity-consistent Face Generation with Diffusion Models
Darian Tomašević
Fadi Boutros
Chenhao Lin
Naser Damer
Vitomir Štruc
Peter Peer
DiffM
60
1
0
10 Apr 2025
Probability Density Geodesics in Image Diffusion Latent Space
Qingtao Yu
Jaskirat Singh
Zhaoyuan Yang
Peter Tu
Jing Zhang
Hongdong Li
Richard Hartley
Dylan Campbell
DiffM
65
0
0
09 Apr 2025
TactileNet: Bridging the Accessibility Gap with AI-Generated Tactile Graphics for Individuals with Vision Impairment
Adnan Khan
Alireza Choubineh
Mai A. Shaaban
Abbas Akkasi
Majid Komeili
DiffM
40
0
0
07 Apr 2025
1
2
3
4
...
10
11
12
Next