Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2404.16022
Cited By
PuLID: Pure and Lightning ID Customization via Contrastive Alignment
24 April 2024
Zinan Guo
Yanze Wu
Zhuowei Chen
Lang Chen
Qian He
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"PuLID: Pure and Lightning ID Customization via Contrastive Alignment"
37 / 37 papers shown
Title
FLUXSynID: A Framework for Identity-Controlled Synthetic Face Generation with Document and Live Images
Raul Ismayilov
Dzemila Sero
Luuk Spreeuwers
96
0
0
12 May 2025
DreamID: High-Fidelity and Fast diffusion-based Face Swapping via Triplet ID Group Learning
Fulong Ye
Miao Hua
Pengze Zhang
Xinghui Li
Qichao Sun
Mingcong Liu
Qian He
Xinglong Wu
124
0
0
20 Apr 2025
Learning Joint ID-Textual Representation for ID-Preserving Image Synthesis
Zichuan Liu
Liming Jiang
Qing Yan
Yumin Jia
Hao Kang
Xin Lu
DiffM
88
0
0
19 Apr 2025
Teaching Humans Subtle Differences with DIFFusion
Mia Chiquier
Orr Avrech
Yossi Gandelsman
Berthy Feng
Katherine Bouman
Carl Vondrick
DiffM
107
0
0
10 Apr 2025
Meta-LoRA: Meta-Learning LoRA Components for Domain-Aware ID Personalization
Barış Batuhan Topal
Umut Özyurt
Zafer Doğan Budak
Ramazan Gokberk Cinbis
78
0
0
28 Mar 2025
Concat-ID: Towards Universal Identity-Preserving Video Synthesis
Yong Zhong
Zhuoyi Yang
Jiayan Teng
Xiaotao Gu
Chongxuan Li
VGen
103
4
0
18 Mar 2025
UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing
Tsu-Jui Fu
Yusu Qian
Chen Chen
Wenze Hu
Zhe Gan
Yue Yang
175
2
0
16 Mar 2025
FaceID-6M: A Large-Scale, Open-Source FaceID Customization Dataset
Shuhe Wang
Xiaoya Li
Jiwei Li
G. Wang
Xiaofei Sun
...
Han Qiu
Mo Yu
Shengjie Shen
Tianwei Zhang
Eduard H. Hovy
VLM
92
1
0
10 Mar 2025
Phantom: Subject-consistent video generation via cross-modal alignment
Lijie Liu
Tianxiang Ma
Bingchuan Li
Zhuowei Chen
Jiawei Liu
Qian He
Xinglong Wu
Qian He
Xinglong Wu
DiffM
VGen
139
10
0
16 Feb 2025
RealCustom++: Representing Images as Real-Word for Real-Time Customization
Zhendong Mao
Mengqi Huang
Fei Ding
Mingcong Liu
Qian He
Xiaojun Chang
DiffM
111
6
0
03 Jan 2025
Nested Attention: Semantic-aware Attention Values for Concept Personalization
Or Patashnik
Rinon Gal
Daniil Ostashev
Sergey Tulyakov
Kfir Aberman
Daniel Cohen-Or
DiffM
80
6
0
03 Jan 2025
Omni-ID: Holistic Identity Representation Designed for Generative Tasks
Guocheng Qian
Kuan-Chieh Wang
Or Patashnik
Negin Heravi
Daniil Ostashev
Sergey Tulyakov
Daniel Cohen-Or
Kfir Aberman
119
4
0
12 Dec 2024
SerialGen: Personalized Image Generation by First Standardization Then Personalization
Cong Xie
Han Zou
Ruiqi Yu
Yan Zhang
Zhenpeng Zhan
114
1
0
02 Dec 2024
EmojiDiff: Advanced Facial Expression Control with High Identity Preservation in Portrait Generation
Liangwei Jiang
Ruida Li
Zhifeng Zhang
Shuo Fang
Chenguang Ma
DiffM
137
1
0
02 Dec 2024
Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis
Yuxi Ren
Xin Xia
Yanzuo Lu
Jiacheng Zhang
Jie Wu
Pan Xie
Xing Wang
Xuefeng Xiao
98
74
0
21 Apr 2024
AnimateDiff-Lightning: Cross-Model Diffusion Distillation
Shanchuan Lin
Xiao Yang
DiffM
VGen
71
23
0
19 Mar 2024
InstantID: Zero-shot Identity-Preserving Generation in Seconds
Qixun Wang
Xu Bai
Haofan Wang
Zekui Qin
Anthony Chen
Huaxia Li
Xu Tang
Feng-Long Xie
79
249
0
15 Jan 2024
PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding
Zhen Li
Mingdeng Cao
Xintao Wang
Zhongang Qi
Ming-Ming Cheng
Ying Shan
DiffM
80
197
0
07 Dec 2023
Aligning Text-to-Image Diffusion Models with Reward Backpropagation
Mihir Prabhudesai
Anirudh Goyal
Deepak Pathak
Katerina Fragkiadaki
90
128
0
05 Oct 2023
Inserting Anybody in Diffusion Models via Celeb Basis
Genlan Yuan
Xiaodong Cun
Yong Zhang
Maomao Li
Chenyang Qi
Xintao Wang
Ying Shan
Huicheng Zheng
DiffM
49
53
0
01 Jun 2023
ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation
Jiazheng Xu
Xiao Liu
Yuchen Wu
Yuxuan Tong
Qinkai Li
Ming Ding
Jie Tang
Yuxiao Dong
113
371
0
12 Apr 2023
EVA-CLIP: Improved Training Techniques for CLIP at Scale
Quan-Sen Sun
Yuxin Fang
Ledell Yu Wu
Xinlong Wang
Yue Cao
CLIP
VLM
129
493
0
27 Mar 2023
T2I-Adapter: Learning Adapters to Dig out More Controllable Ability for Text-to-Image Diffusion Models
Chong Mou
Xintao Wang
Liangbin Xie
Yanze Wu
Shuai Liu
Zhongang Qi
Ying Shan
Xiaohu Qie
DiffM
67
1,014
0
16 Feb 2023
Adding Conditional Control to Text-to-Image Diffusion Models
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
AI4CE
112
4,104
1
10 Feb 2023
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
409
4,539
0
30 Jan 2023
DiffFace: Diffusion-based Face Swapping with Facial Guidance
Kihong Kim
Yunho Kim
Seokju Cho
Junyoung Seo
Jisu Nam
Kychul Lee
Seung Wook Kim
Kwanghee Lee
DiffM
52
55
0
27 Dec 2022
Multi-Concept Customization of Text-to-Image Diffusion
Nupur Kumari
Bin Zhang
Richard Y. Zhang
Eli Shechtman
Jun-Yan Zhu
122
870
0
08 Dec 2022
DPM-Solver++: Fast Solver for Guided Sampling of Diffusion Probabilistic Models
Cheng Lu
Yuhao Zhou
Fan Bao
Jianfei Chen
Chongxuan Li
Jun Zhu
DiffM
135
590
0
02 Nov 2022
An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion
Rinon Gal
Yuval Alaluf
Yuval Atzmon
Or Patashnik
Amit H. Bermano
Gal Chechik
Daniel Cohen-Or
117
1,873
0
02 Aug 2022
Elucidating the Design Space of Diffusion-Based Generative Models
Tero Karras
M. Aittala
Timo Aila
S. Laine
DiffM
167
1,967
0
01 Jun 2022
Hierarchical Text-Conditional Image Generation with CLIP Latents
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
VLM
DiffM
360
6,854
0
13 Apr 2022
LoRA: Low-Rank Adaptation of Large Language Models
J. E. Hu
Yelong Shen
Phillip Wallis
Zeyuan Allen-Zhu
Yuanzhi Li
Shean Wang
Lu Wang
Weizhu Chen
OffRL
AI4TS
AI4CE
ALM
AIMat
383
10,273
0
17 Jun 2021
Generalizing Face Forgery Detection with High-frequency Features
Yucheng Luo
Yong Zhang
Junchi Yan
Wei Liu
CVBM
62
339
0
23 Mar 2021
Towards Real-World Blind Face Restoration with Generative Facial Prior
Xintao Wang
Yu Li
Honglun Zhang
Ying Shan
CVBM
74
446
0
11 Jan 2021
Denoising Diffusion Implicit Models
Jiaming Song
Chenlin Meng
Stefano Ermon
VLM
DiffM
228
7,350
0
06 Oct 2020
Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation
Elad Richardson
Yuval Alaluf
Or Patashnik
Yotam Nitzan
Yaniv Azar
Stav Shapiro
Daniel Cohen-Or
122
1,108
0
03 Aug 2020
CurricularFace: Adaptive Curriculum Learning Loss for Deep Face Recognition
Yanhua Huang
Yuhan Wang
Ying Tai
Xiaoming Liu
Pengcheng Shen
Shaoxin Li
Jilin Li
Feiyue Huang
CVBM
53
505
0
01 Apr 2020
1