Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2401.07519
Cited By
InstantID: Zero-shot Identity-Preserving Generation in Seconds
15 January 2024
Qixun Wang
Xu Bai
Haofan Wang
Zekui Qin
Anthony Chen
Huaxia Li
Xu Tang
Yao Hu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"InstantID: Zero-shot Identity-Preserving Generation in Seconds"
50 / 193 papers shown
Title
DynamicID: Zero-Shot Multi-ID Image Personalization with Flexible Facial Editability
Xirui Hu
Jiahao Wang
Hao Chen
Weizhan Zhang
Benqi Wang
Yangfu Li
Haishun Nan
DiffM
67
0
0
09 Mar 2025
DoraCycle: Domain-Oriented Adaptation of Unified Generative Model in Multimodal Cycles
Rui Zhao
Weijia Mao
Mike Zheng Shou
66
0
0
05 Mar 2025
VisAgent: Narrative-Preserving Story Visualization Framework
Seungkwon Kim
GyuTae Park
Sangyeon Kim
Seung-Hun Nam
40
0
0
04 Mar 2025
Personalized Generation In Large Model Era: A Survey
Yiyan Xu
Jinghao Zhang
Alireza Salemi
Xinting Hu
Luu Anh Tuan
Fuli Feng
Hamed Zamani
Xiangnan He
Tat-Seng Chua
3DV
79
2
0
04 Mar 2025
ACCORD: Alleviating Concept Coupling through Dependence Regularization for Text-to-Image Diffusion Personalization
Shizhan Liu
Hao Zheng
Hang Yu
Jianguo Li
DiffM
71
0
0
03 Mar 2025
Zero-Shot Head Swapping in Real-World Scenarios
S. Jeong
Taewoong Kang
Hyojin Jang
Jaegul Choo
39
0
0
02 Mar 2025
UIFace: Unleashing Inherent Model Capabilities to Enhance Intra-Class Diversity in Synthetic Face Recognition
Xiao Lin
Y. Huang
Jianqing Xu
Yuxi Mi
Shuigeng Zhou
Shouhong Ding
72
0
0
27 Feb 2025
SubZero: Composing Subject, Style, and Action via Zero-Shot Personalization
Shubhankar Borse
K. Bhardwaj
Mohammad Reza Karimi Dastjerdi
Hyojin Park
Shreya Kadambi
...
Prathamesh Mandke
Ankita Nayak
Harris Teague
Munawar Hayat
Fatih Porikli
DiffM
84
1
0
27 Feb 2025
InstaFace: Identity-Preserving Facial Editing with Single Image Inference
MD Wahiduzzaman Khan
Mingshan Jia
Shaolin Zhang
En Yu
Caifeng Shan
Kaska Musial-Gabrys
DiffM
54
0
0
27 Feb 2025
DP-Adapter: Dual-Pathway Adapter for Boosting Fidelity and Text Consistency in Customizable Human Image Generation
Ye Wang
Xuping Xie
Lanjun Wang
Zili Yi
Rui Ma
DiffM
97
0
0
21 Feb 2025
Phantom: Subject-consistent video generation via cross-modal alignment
Lijie Liu
Tianxiang Ma
Bingchuan Li
Zhuowei Chen
Jiawei Liu
Qian He
Xinglong Wu
Qian He
Xinglong Wu
DiffM
VGen
52
5
0
16 Feb 2025
I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models
Zhenxing Mi
Kuan-Chieh Jackson Wang
Guocheng Qian
Hanrong Ye
Runtao Liu
Sergey Tulyakov
Kfir Aberman
Dan Xu
LRM
47
0
0
12 Feb 2025
ComposeAnyone: Controllable Layout-to-Human Generation with Decoupled Multimodal Conditions
Shiyue Zhang
Zheng Chong
Xi Lu
Wenqing Zhang
Haoxiang Li
Xujie Zhang
Jiehui Huang
Xiao Dong
Xiaodan Liang
DiffM
42
0
0
21 Jan 2025
Multi-subject Open-set Personalization in Video Generation
Tsai-Shien Chen
Aliaksandr Siarohin
Willi Menapace
Yuwei Fang
Kwot Sin Lee
Ivan Skorokhodov
Kfir Aberman
Jun-Yan Zhu
Ming Yang
Sergey Tulyakov
DiffM
VGen
69
7
0
10 Jan 2025
ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning
Yuzhou Huang
Ziyang Yuan
Quande Liu
Qiulin Wang
Xintao Wang
Ruimao Zhang
Pengfei Wan
Di Zhang
Kun Gai
VGen
DiffM
45
10
0
08 Jan 2025
ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling
Chaojie Mao
J. Zhang
Yulin Pan
Zeyinzi Jiang
Zhen Han
Yu Liu
Jingren Zhou
DiffM
48
15
0
05 Jan 2025
SlerpFace: Face Template Protection via Spherical Linear Interpolation
Zhizhou Zhong
Yuxi Mi
Y. Huang
Jianqing Xu
Guodong Mu
Shouhong Ding
Jingyun Zhang
Rizen Guo
Yunsheng Wu
Shuigeng Zhou
AAML
PICV
85
5
0
03 Jan 2025
RealCustom++: Representing Images as Real-Word for Real-Time Customization
Zhendong Mao
Mengqi Huang
Fei Ding
Mingcong Liu
Qian He
Xiaojun Chang
DiffM
78
6
0
03 Jan 2025
ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving
Jiehui Huang
Xiao Dong
Wenhui Song
Zheng Chong
Zhiqiang Zhang
...
Long Chen
Hanhui Li
Yiqiang Yan
Shengcai Liao
Xiaodan Liang
DiffM
50
19
0
31 Dec 2024
Personalized Large Vision-Language Models
Chau Pham
Hoang Phan
David Doermann
Yunjie Tian
VLM
49
3
0
23 Dec 2024
RealisID: Scale-Robust and Fine-Controllable Identity Customization via Local and Global Complementation
Zhaoyang Sun
Fei Du
Weihua Chen
Fan Wang
Yaxiong Chen
Yi Rong
Shengwu Xiong
DiffM
83
1
0
22 Dec 2024
F-Bench: Rethinking Human Preference Evaluation Metrics for Benchmarking Face Generation, Customization, and Restoration
Lu Liu
Huiyu Duan
Qiang Hu
Liu Yang
Chunlei Cai
Tianxiao Ye
Huayu Liu
Xiaoyun Zhang
Guangtao Zhai
EGVM
97
1
0
17 Dec 2024
IDProtector: An Adversarial Noise Encoder to Protect Against ID-Preserving Image Generation
Yiren Song
Pei Yang
Hai Ci
Mike Zheng Shou
125
3
0
16 Dec 2024
ColorFlow: Retrieval-Augmented Image Sequence Colorization
Junhao Zhuang
Xuan Ju
Zhe Zhang
Yong-Jin Liu
Shiyi Zhang
Chun Yuan
Ying Shan
DiffM
107
1
0
16 Dec 2024
SHMT: Self-supervised Hierarchical Makeup Transfer via Latent Diffusion Models
Zhaoyang Sun
Shengwu Xiong
Yaxiong Chen
Fei Du
Weihua Chen
Fan Wang
Yi Rong
DiffM
74
1
0
15 Dec 2024
Feat2GS: Probing Visual Foundation Models with Gaussian Splatting
Yue Chen
Xingyu Chen
Anpei Chen
Gerard Pons-Moll
Yuliang Xiu
3DGS
86
3
0
12 Dec 2024
Omni-ID: Holistic Identity Representation Designed for Generative Tasks
Guocheng Qian
Kuan-Chieh Jackson Wang
Or Patashnik
Negin Heravi
Daniil Ostashev
Sergey Tulyakov
Daniel Cohen-Or
Kfir Aberman
93
4
0
12 Dec 2024
UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics
Xi Chen
Zhifei Zhang
He Zhang
Yuqian Zhou
Seunggeun Kim
...
Nanxuan Zhao
Yilin Wang
Hui Ding
Zhe Lin
Hengshuang Zhao
VGen
DiffM
126
21
0
10 Dec 2024
Pinco: Position-induced Consistent Adapter for Diffusion Transformer in Foreground-conditioned Inpainting
Guangben Lu
Yuzhen Du
Zhimin Sun
Ran Yi
Yifan Qi
Yizhe Tang
Tianyi Wang
Lizhuang Ma
Fangyuan Zou
DiffM
80
1
0
05 Dec 2024
EmojiDiff: Advanced Facial Expression Control with High Identity Preservation in Portrait Generation
Liangwei Jiang
Ruida Li
Zhifeng Zhang
Shuo Fang
Chenguang Ma
DiffM
74
1
0
02 Dec 2024
SerialGen: Personalized Image Generation by First Standardization Then Personalization
Cong Xie
Han Zou
Ruiqi Yu
Yan Zhang
Zhenpeng Zhan
72
1
0
02 Dec 2024
Deepfake Media Generation and Detection in the Generative AI Era: A Survey and Outlook
Florinel-Alin Croitoru
Andrei Iulian Hiji
Vlad Hondru
Nicolae-Cătălin Ristea
Paul Irofti
Marius Popescu
Cristian Rusu
Radu Tudor Ionescu
Fahad Shahbaz Khan
Mubarak Shah
89
3
0
29 Nov 2024
DreamBlend: Advancing Personalized Fine-tuning of Text-to-Image Diffusion Models
Shwetha Ram
T. Neiman
Qianli Feng
Andrew Stuart
S. D. Tran
Trishul Chilimbi
77
1
0
28 Nov 2024
Diffusion Self-Distillation for Zero-Shot Customized Image Generation
Shengqu Cai
Eric Ryan Chan
Yunzhi Zhang
Leonidas J. Guibas
Jiajun Wu
Gordon Wetzstein
83
8
0
27 Nov 2024
HiFiVFS: High Fidelity Video Face Swapping
Xu Chen
Keke He
Junwei Zhu
Yanhao Ge
Wei Li
Chengjie Wang
VGen
DiffM
80
1
0
27 Nov 2024
MotionCharacter: Identity-Preserving and Motion Controllable Human Video Generation
Haopeng Fang
Di Qiu
Binjie Mao
Pengfei Yan
He Tang
VGen
DiffM
72
4
0
27 Nov 2024
StableAnimator: High-Quality Identity-Preserving Human Image Animation
Shuyuan Tu
Zhen Xing
Xintong Han
Zhi-Qi Cheng
Qi Dai
Chong Luo
Zuxuan Wu
VGen
109
15
0
26 Nov 2024
One Diffusion to Generate Them All
Duong H. Le
Tuan Pham
Sangho Lee
Christopher Clark
Aniruddha Kembhavi
Stephan Mandt
Ranjay Krishna
Jiasen Lu
VLM
74
5
0
25 Nov 2024
AnyText2: Visual Text Generation and Editing With Customizable Attributes
Yuxiang Tuo
Yifeng Geng
Liefeng Bo
VLM
93
6
0
22 Nov 2024
Foundation Cures Personalization: Improving Personalized Models' Prompt Consistency via Hidden Foundation Knowledge
Yiyang Cai
Zhengkai Jiang
Yong-Jin Liu
Chunyang Jiang
Wei Xue
Wenhan Luo
Yike Guo
95
0
0
22 Nov 2024
Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models
Nvidia
:
Yuval Atzmon
Maciej Bala
Yogesh Balaji
...
Ting-Chun Wang
Shuran Song
Fangyin Wei
Yu Zeng
Qinsheng Zhang
58
6
0
11 Nov 2024
Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement
Zhennan Chen
Yajie Li
Haofan Wang
Z. Chen
Zhengkai Jiang
Jun Yu Li
Qian Wang
Jian Yang
Ying Tai
DiffM
52
8
0
10 Nov 2024
Generalizable Single-Source Cross-modality Medical Image Segmentation via Invariant Causal Mechanisms
Boqi Chen
Yuanzhi Zhu
Yunke Ao
Sebastiano Caprara
Reto Sutter
Gunnar Rätsch
E. Konukoglu
A. Susmelj
MedIm
DiffM
OOD
51
0
0
07 Nov 2024
MegaPortrait: Revisiting Diffusion Control for High-fidelity Portrait Generation
Han Yang
Sotiris Anagnostidis
Enis Simsar
Thomas Hofmann
DiffM
28
0
0
07 Nov 2024
Language-guided Hierarchical Fine-grained Image Forgery Detection and Localization
Xiao Guo
Xiaohong Liu
I. Masi
Xiaoming Liu
95
9
0
31 Oct 2024
FuseAnyPart: Diffusion-Driven Facial Parts Swapping via Multiple Reference Images
Zheng Yu
Yaohua Wang
Siying Cui
Aixi Zhang
Wei-Long Zheng
Senzhang Wang
36
0
0
30 Oct 2024
Unbounded: A Generative Infinite Game of Character Life Simulation
Jialu Li
Yuanzhen Li
Neal Wadhwa
Yael Pritch
David E. Jacobs
Michael Rubinstein
Joey Tianyi Zhou
Nataniel Ruiz
VGen
AI4CE
36
4
0
24 Oct 2024
Group Diffusion Transformers are Unsupervised Multitask Learners
Lianghua Huang
Wei Wang
Zhi-Fan Wu
Huanzhang Dou
Yupeng Shi
Yutong Feng
C. Liang
Yu Liu
Jingren Zhou
VLM
49
12
0
19 Oct 2024
MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models
Donghao Zhou
Jiancheng Huang
J. Bai
Jiaze Wang
Hao Chen
Guangyong Chen
Xiaowei Hu
Pheng Ann Heng
47
5
0
17 Oct 2024
FaceChain-FACT: Face Adapter with Decoupled Training for Identity-preserved Personalization
Cheng Yu
Haoyu Xie
Lei Shang
Yong-Jin Liu
Jun Dan
Liefeng Bo
Baigui Sun
24
2
0
16 Oct 2024
Previous
1
2
3
4
Next