Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2312.16272
Cited By
SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation
26 December 2023
Yuxuan Zhang
Yiren Song
Jiaming Liu
Rui Wang
Jinpeng Yu
Hao Tang
Huaxia Li
Xu Tang
Yao Hu
Han Pan
Zhongliang Jing
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation"
50 / 52 papers shown
Title
BridgeIV: Bridging Customized Image and Video Generation through Test-Time Autoregressive Identity Propagation
Panwen Hu
Jiehui Huang
Qiang Sun
Xiaodan Liang
DiffM
VGen
28
0
0
11 May 2025
MUSAR: Exploring Multi-Subject Customization from Single-Subject Dataset via Attention Routing
Zinan Guo
Pengze Zhang
Yanze Wu
Chong Mou
Songtao Zhao
Qian He
30
0
0
05 May 2025
Subject-driven Video Generation via Disentangled Identity and Motion
Daneul Kim
Jingxu Zhang
W. Jin
Sunghyun Cho
Qi Dai
Jaesik Park
Chong Luo
DiffM
VGen
110
0
0
23 Apr 2025
Flux Already Knows -- Activating Subject-Driven Image Generation without Training
Hao Kang
Stathi Fotiadis
Liming Jiang
Qing Yan
Yumin Jia
Zichuan Liu
Min Jin Chong
Xin Lu
37
0
0
12 Apr 2025
DreamFuse: Adaptive Image Fusion with Diffusion Transformer
Junjia Huang
Pengxiang Yan
Jiyang Liu
Jie Wu
Zhao Wang
Yitong Wang
Liang Lin
G. Li
35
0
0
11 Apr 2025
FlexIP: Dynamic Control of Preservation and Personality for Customized Image Generation
Linyan Huang
Haonan Lin
Yanning Zhou
Kaiwen Xiao
42
0
0
10 Apr 2025
Less-to-More Generalization: Unlocking More Controllability by In-Context Generation
Shaojin Wu
Mengqi Huang
Wenxu Wu
Yufeng Cheng
Fei Ding
Qian He
DiffM
52
4
0
02 Apr 2025
FreeUV: Ground-Truth-Free Realistic Facial UV Texture Recovery via Cross-Assembly Inference Strategy
Xingchao Yang
Takafumi Taketomi
Yuki Endo
Yoshihiro Kanamori
DiffM
46
0
0
21 Mar 2025
BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing
Yaowei Li
Lingen Li
Zhaoyang Zhang
Xiaoyu Li
Guangzhi Wang
Hongxiang Li
Xiaodong Cun
Ying Shan
Yuexian Zou
DiffM
67
1
0
17 Mar 2025
Personalize Anything for Free with Diffusion Transformer
Haoran Feng
Zehuan Huang
Lin Li
Hairong Lv
Lu Sheng
DiffM
78
1
0
16 Mar 2025
MoEdit: On Learning Quantity Perception for Multi-object Image Editing
Yanfeng Li
Kahou Chan
Yue Sun
C. Lam
Tong Tong
Zitong Yu
Keren Fu
Xiaohong Liu
Tao Tan
DiffM
38
0
0
13 Mar 2025
RealGeneral: Unifying Visual Generation via Temporal In-Context Learning with Video Models
Yijing Lin
Mengqi Huang
Shuhan Zhuang
Zhendong Mao
VGen
43
0
0
13 Mar 2025
Zero-Shot Subject-Centric Generation for Creative Application Using Entropy Fusion
Kaifeng Zou
Xiaoyi Feng
Peng Wang
Tao Huang
Zizhou Huang
Zhang Haihang
Yuntao Zou
Dagang Li
DiffM
49
0
0
12 Mar 2025
OminiControl2: Efficient Conditioning for Diffusion Transformers
Zhenxiong Tan
Qiaochu Xue
Xingyi Yang
Songhua Liu
Xinchao Wang
DiffM
42
0
0
11 Mar 2025
LatexBlend: Scaling Multi-concept Customized Generation with Latent Textual Blending
Jian Jin
Zhenbo Yu
Yang Shen
Zhenyong Fu
Jian Yang
DiffM
63
0
0
10 Mar 2025
Personalized Generation In Large Model Era: A Survey
Yiyan Xu
Jinghao Zhang
Alireza Salemi
Xinting Hu
W. Wang
Fuli Feng
Hamed Zamani
Xiangnan He
Tat-Seng Chua
3DV
79
2
0
04 Mar 2025
CustomVideoX: 3D Reference Attention Driven Dynamic Adaptation for Zero-Shot Customized Video Diffusion Transformers
D. She
Mushui Liu
Jingxuan Pang
Jin Wang
Zhen Yang
...
Yi Wang
Qihan Huang
Haobin Tang
Yunlong Yu
Siming Fu
VGen
96
4
0
21 Feb 2025
ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning
Yuzhou Huang
Ziyang Yuan
Quande Liu
Qiulin Wang
Xintao Wang
Ruimao Zhang
Pengfei Wan
Di Zhang
Kun Gai
VGen
DiffM
37
10
0
08 Jan 2025
RealCustom++: Representing Images as Real-Word for Real-Time Customization
Zhendong Mao
Mengqi Huang
Fei Ding
Mingcong Liu
Qian He
Xiaojun Chang
DiffM
72
6
0
03 Jan 2025
OmniPrism: Learning Disentangled Visual Concept for Image Generation
Yangyang Li
Daqing Liu
Wu Liu
Allen He
Xinchen Liu
Yongdong Zhang
Guoqing Jin
DiffM
CoGe
78
0
0
16 Dec 2024
IDProtector: An Adversarial Noise Encoder to Protect Against ID-Preserving Image Generation
Yiren Song
Pei Yang
Hai Ci
Mike Zheng Shou
117
3
0
16 Dec 2024
T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts
Ziwei Huang
Wanggui He
Quanyu Long
Yandi Wang
Haoyuan Li
...
Fangxun Shu
Long Chen
Hao Jiang
Leilei Gan
Fei Wu
EGVM
184
3
0
05 Dec 2024
AnyDressing: Customizable Multi-Garment Virtual Dressing via Latent Diffusion Models
Xinghui Li
Qichao Sun
Pengze Zhang
Fulong Ye
Zhichao Liao
Wanquan Feng
Songtao Zhao
Qian He
DiffM
70
2
0
05 Dec 2024
DomainGallery: Few-shot Domain-driven Image Generation by Attribute-centric Finetuning
Yuxuan Duan
Y. Hong
Bo Zhang
Jun Lan
Huijia Zhu
Weiqiang Wang
Jianfu Zhang
Li Niu
L. Zhang
DiffM
46
0
0
07 Nov 2024
3DIS: Depth-Driven Decoupled Instance Synthesis for Text-to-Image Generation
Dewei Zhou
Ji Xie
Zongxin Yang
Yi Yang
DiffM
64
7
0
16 Oct 2024
Image Watermarks are Removable Using Controllable Regeneration from Clean Noise
Yepeng Liu
Yiren Song
Hai Ci
Yu Zhang
Haofan Wang
Mike Zheng Shou
Yuheng Bu
WIGM
54
3
0
07 Oct 2024
Storynizor: Consistent Story Generation via Inter-Frame Synchronized and Shuffled ID Injection
Yuhang Ma
Wenting Xu
Chaoyi Zhao
Keqiang Sun
Qinfeng Jin
Zeng Zhao
Changjie Fan
Zhipeng Hu
VGen
DiffM
27
1
0
29 Sep 2024
Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation
Qihan Huang
Siming Fu
Jinlong Liu
Hao Jiang
Yipeng Yu
Jie Song
31
5
0
26 Sep 2024
StoryMaker: Towards Holistic Consistent Characters in Text-to-image Generation
Zhengguang Zhou
Jing Li
Huaxia Li
Nemo Chen
Xu Tang
DiffM
VGen
34
8
0
19 Sep 2024
GroundingBooth: Grounding Text-to-Image Customization
Zhexiao Xiong
Wei Xiong
Jing Shi
He Zhang
Yizhi Song
Nathan Jacobs
DiffM
56
6
0
13 Sep 2024
CustomContrast: A Multilevel Contrastive Perspective For Subject-Driven Text-to-Image Customization
Nan Chen
Mengqi Huang
Zhuowei Chen
Yang Zheng
Lei Zhang
Zhendong Mao
DiffM
46
5
0
09 Sep 2024
Foodfusion: A Novel Approach for Food Image Composition via Diffusion Models
Chaohua Shi
Xuan Wang
Si Shi
Xule Wang
Mingrui Zhu
Nannan Wang
X. Gao
CoGe
28
1
0
26 Aug 2024
Stable-Hair: Real-World Hair Transfer via Diffusion Model
Yuxuan Zhang
Qing Zhang
Yiren Song
Jiaming Liu
DiffM
46
6
0
19 Jul 2024
Subject-driven Text-to-Image Generation via Preference-based Reinforcement Learning
Yanting Miao
William Loh
Suraj Kothawade
Pascal Poupart
Abdullah Rashwan
Yeqing Li
EGVM
47
1
0
16 Jul 2024
MIGC++: Advanced Multi-Instance Generation Controller for Image Synthesis
Dewei Zhou
Y. Li
Fan Ma
Zongxin Yang
Y. Yang
95
11
0
02 Jul 2024
Character-Adapter: Prompt-Guided Region Control for High-Fidelity Character Customization
Yuhang Ma
Wenting Xu
Jiji Tang
Qinfeng Jin
Rongsheng Zhang
Zeng Zhao
Changjie Fan
Zhipeng Hu
38
6
0
24 Jun 2024
EMMA: Your Text-to-Image Diffusion Model Can Secretly Accept Multi-Modal Prompts
Yucheng Han
Rui Wang
Chi Zhang
Juntao Hu
Pei Cheng
Bin-Bin Fu
Hanwang Zhang
70
6
0
13 Jun 2024
MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance
X. Wang
Siming Fu
Qihan Huang
Wanggui He
Hao Jiang
DiffM
38
41
0
11 Jun 2024
AttenCraft: Attention-guided Disentanglement of Multiple Concepts for Text-to-Image Customization
Junjie Shentu
Matthew Watson
Noura Al Moubayed
DiffM
49
0
0
28 May 2024
EditWorld: Simulating World Dynamics for Instruction-Following Image Editing
Ling Yang
Bo-Wen Zeng
Jiaming Liu
Hong Li
Minghao Xu
Wentao Zhang
Shuicheng Yan
DiffM
34
9
0
23 May 2024
FreeTuner: Any Subject in Any Style with Training-free Diffusion
Youcan Xu
Zhen Wang
Jun Xiao
Wei Liu
Long Chen
DiffM
36
9
0
23 May 2024
MasterWeaver: Taming Editability and Face Identity for Personalized Text-to-Image Generation
Yuxiang Wei
Zhilong Ji
Jinfeng Bai
Hongzhi Zhang
Lei Zhang
W. Zuo
DiffM
46
0
0
09 May 2024
From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generation
Zehuan Huang
Hongxing Fan
Lipeng Wang
Lu Sheng
DiffM
34
10
0
23 Apr 2024
UVMap-ID: A Controllable and Personalized UV Map Generative Model
Weijie Wang
Jichao Zhang
Chang Liu
Xia Li
Xingqian Xu
Humphrey Shi
N. Sebe
Bruno Lepri
33
2
0
22 Apr 2024
StableGarment: Garment-Centric Generation via Stable Diffusion
Rui Wang
Hailong Guo
Jiaming Liu
Huaxia Li
Haibo Zhao
Xu Tang
Yao Hu
Hao Tang
Peipei Li
DiffM
32
13
0
16 Mar 2024
Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model
Yuxuan Zhang
Lifu Wei
Qing Zhang
Yiren Song
DiffM
31
12
0
12 Mar 2024
λ
λ
λ
-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space
Maitreya Patel
Sangmin Jung
Chitta Baral
Yezhou Yang
VLM
31
28
0
07 Feb 2024
IPDreamer: Appearance-Controllable 3D Object Generation with Complex Image Prompts
Bo-Wen Zeng
Shanglin Li
Yutang Feng
Ling Yang
Hong Li
...
Conghui He
Wentao Zhang
Jianzhuang Liu
Baochang Zhang
Shuicheng Yan
DiffM
32
1
0
09 Oct 2023
InstantBooth: Personalized Text-to-Image Generation without Test-Time Finetuning
Jing Shi
Wei Xiong
Zhe-nan Lin
H. J. Jung
DiffM
119
278
0
06 Apr 2023
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
267
4,229
0
30 Jan 2023
1
2
Next