Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2503.07091
Cited By
v1
v2
v3 (latest)
FaceID-6M: A Large-Scale, Open-Source FaceID Customization Dataset
10 March 2025
Shuhe Wang
Xiaoya Li
Jiwei Li
G. Wang
Xiaofei Sun
Bob Zhu
Han Qiu
Mo Yu
Shengjie Shen
Tianwei Zhang
Eduard H. Hovy
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"FaceID-6M: A Large-Scale, Open-Source FaceID Customization Dataset"
36 / 36 papers shown
Title
Guard Me If You Know Me: Protecting Specific Face-Identity from Deepfakes
Kaiqing Lin
Zhiyuan Yan
Ke-Yue Zhang
Li Hao
Yue Zhou
Yuzhen Lin
Weixiang Li
Taiping Yao
Shouhong Ding
Bin Li
111
0
0
26 May 2025
GPT-4o System Card
OpenAI OpenAI
:
Aaron Hurst
Adam Lerer
Adam P. Goucher
...
Yuchen He
Yuchen Zhang
Yujia Jin
Yunxing Dai
Yury Malkov
MLLM
195
984
0
25 Oct 2024
PuLID: Pure and Lightning ID Customization via Contrastive Alignment
Zinan Guo
Yanze Wu
Zhuowei Chen
Lang Chen
Qian He
DiffM
84
65
0
24 Apr 2024
LCM-Lookahead for Encoder-based Text-to-Image Personalization
Rinon Gal
Or Lichter
Elad Richardson
Or Patashnik
Amit H. Bermano
Gal Chechik
Daniel Cohen-Or
DiffM
93
33
0
04 Apr 2024
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
Patrick Esser
Sumith Kulal
A. Blattmann
Rahim Entezari
Jonas Muller
...
Zion English
Kyle Lacey
Alex Goodwin
Yannik Marek
Robin Rombach
DiffM
284
1,364
0
05 Mar 2024
PortraitBooth: A Versatile Portrait Model for Fast Identity-preserved Personalization
Xu Peng
Junwei Zhu
Boyuan Jiang
Ying Tai
Donghao Luo
Jiangning Zhang
Wei Lin
Taisong Jin
Chengjie Wang
Rongrong Ji
DiffM
117
59
0
11 Dec 2023
PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding
Zhen Li
Mingdeng Cao
Xintao Wang
Zhongang Qi
Ming-Ming Cheng
Ying Shan
DiffM
96
197
0
07 Dec 2023
IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models
Hu Ye
Jun Zhang
Siyi Liu
Xiao Han
Wei Yang
DiffM
95
802
0
13 Aug 2023
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis
Dustin Podell
Zion English
Kyle Lacey
A. Blattmann
Tim Dockhorn
Jonas Muller
Joe Penna
Robin Rombach
222
2,413
0
04 Jul 2023
Face0: Instantaneously Conditioning a Text-to-Image Model on a Face
Dani Valevski
Danny Lumen
Yossi Matias
Yaniv Leviathan
DiffM
VLM
53
77
0
11 Jun 2023
Inserting Anybody in Diffusion Models via Celeb Basis
Genlan Yuan
Xiaodong Cun
Yong Zhang
Maomao Li
Chenyang Qi
Xintao Wang
Ying Shan
Huicheng Zheng
DiffM
67
53
0
01 Jun 2023
FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention
Guangxuan Xiao
Tianwei Yin
William T. Freeman
F. Durand
Song Han
VGen
DiffM
114
254
0
17 May 2023
Composer: Creative and Controllable Image Synthesis with Composable Conditions
Lianghua Huang
Di Chen
Yu Liu
Yujun Shen
Deli Zhao
Jingren Zhou
DiffM
56
288
0
20 Feb 2023
Scalable Diffusion Models with Transformers
William S. Peebles
Saining Xie
GNN
95
2,386
0
19 Dec 2022
Multi-Concept Customization of Text-to-Image Diffusion
Nupur Kumari
Bin Zhang
Richard Y. Zhang
Eli Shechtman
Jun-Yan Zhu
149
872
0
08 Dec 2022
eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers
Yogesh Balaji
Seungjun Nah
Xun Huang
Arash Vahdat
Jiaming Song
...
Timo Aila
S. Laine
Bryan Catanzaro
Tero Karras
Xuan Li
VLM
MoE
177
827
0
02 Nov 2022
LAION-5B: An open large-scale dataset for training next generation image-text models
Christoph Schuhmann
Romain Beaumont
Richard Vencu
Cade Gordon
Ross Wightman
...
Srivatsa Kundurthy
Katherine Crowson
Ludwig Schmidt
R. Kaczmarczyk
J. Jitsev
VLM
MLLM
CLIP
194
3,482
0
16 Oct 2022
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Yael Pritch
Michael Rubinstein
Kfir Aberman
279
2,885
0
25 Aug 2022
An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion
Rinon Gal
Yuval Alaluf
Yuval Atzmon
Or Patashnik
Amit H. Bermano
Gal Chechik
Daniel Cohen-Or
164
1,889
0
02 Aug 2022
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
...
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
VLM
437
6,062
0
23 May 2022
CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers
Ming Ding
Wendi Zheng
Wenyi Hong
Jie Tang
VLM
106
334
0
28 Apr 2022
Hierarchical Text-Conditional Image Generation with CLIP Latents
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
VLM
DiffM
410
6,897
0
13 Apr 2022
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
460
15,665
0
20 Dec 2021
GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models
Alex Nichol
Prafulla Dhariwal
Aditya A. Ramesh
Pranav Shyam
Pamela Mishkin
Bob McGrew
Ilya Sutskever
Mark Chen
364
3,605
0
20 Dec 2021
Diffusion Models Beat GANs on Image Synthesis
Prafulla Dhariwal
Alex Nichol
241
7,933
0
11 May 2021
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIP
VLM
967
29,731
0
26 Feb 2021
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
418
4,987
0
24 Feb 2021
Score-Based Generative Modeling through Stochastic Differential Equations
Yang Song
Jascha Narain Sohl-Dickstein
Diederik P. Kingma
Abhishek Kumar
Stefano Ermon
Ben Poole
DiffM
SyDa
344
6,551
0
26 Nov 2020
Denoising Diffusion Implicit Models
Jiaming Song
Chenlin Meng
Stefano Ermon
VLM
DiffM
286
7,454
0
06 Oct 2020
Denoising Diffusion Probabilistic Models
Jonathan Ho
Ajay Jain
Pieter Abbeel
DiffM
658
18,276
0
19 Jun 2020
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
445
20,298
0
23 Oct 2019
Neural Discrete Representation Learning
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
BDL
SSL
OCL
228
5,061
0
02 Nov 2017
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
725
132,199
0
12 Jun 2017
PixelVAE: A Latent Variable Model for Natural Images
Ishaan Gulrajani
Kundan Kumar
Faruk Ahmed
Adrien Ali Taïga
Francesco Visin
David Vazquez
Aaron Courville
DRL
SSL
BDL
80
340
0
15 Nov 2016
Conditional Image Generation with PixelCNN Decoders
Aaron van den Oord
Nal Kalchbrenner
Oriol Vinyals
L. Espeholt
Alex Graves
Koray Kavukcuoglu
VLM
209
2,517
0
16 Jun 2016
Microsoft COCO: Common Objects in Context
Nayeon Lee
Michael Maire
Serge J. Belongie
Lubomir Bourdev
Ross B. Girshick
James Hays
Pietro Perona
Deva Ramanan
C. L. Zitnick
Piotr Dollár
ObjD
416
43,777
0
01 May 2014
1