Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2301.03396
Cited By
Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation
6 January 2023
Michal Stypulkowski
Konstantinos Vougioukas
Sen He
Maciej Ziȩba
Stavros Petridis
M. Pantic
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation"
27 / 27 papers shown
Title
KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution
Antoni Bigata
Rodrigo Mira
Stella Bounareli
Michał Stypułkowski
Konstantinos Vougioukas
Stavros Petridis
Maja Pantic
52
0
0
01 May 2025
MagicPortrait: Temporally Consistent Face Reenactment with 3D Geometric Guidance
Mengting Wei
Yante Li
Tuomas Varanka
Yan Jiang
Guoying Zhao
DiffM
VGen
74
0
0
30 Apr 2025
FREAK: Frequency-modulated High-fidelity and Real-time Audio-driven Talking Portrait Synthesis
Ziqi Ni
Ao Fu
Yi Zhou
61
0
0
06 Mar 2025
KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame Interpolation
Antoni Bigata
Michał Stypułkowski
Rodrigo Mira
Stella Bounareli
Konstantinos Vougioukas
Zoe Landgraf
Nikita Drobyshev
Maciej Ziȩba
Stavros Petridis
M. Pantic
DiffM
VGen
65
2
0
03 Mar 2025
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models
Gaojie Lin
Jianwen Jiang
Jiaqi Yang
Zerong Zheng
Chao Liang
DiffM
VGen
183
11
0
03 Feb 2025
Quantum Diffusion Model for Quark and Gluon Jet Generation
Mariia Baidachna
Rey Guadarrama
Gopal Ramesh Dahale
Tom Magorsch
Isabel Pedraza
Konstantin T. Matchev
Katia Matcheva
Kyoungchul Kong
S. Gleyzer
DiffM
47
0
0
31 Dec 2024
ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving
Jiehui Huang
Xiao Dong
Wenhui Song
Zheng Chong
Zhiqiang Zhang
...
Long Chen
Hanhui Li
Yiqiang Yan
Shengcai Liao
Xiaodan Liang
DiffM
50
19
0
31 Dec 2024
DAWN: Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for Talking Head Video Generation
Hanbo Cheng
Limin Lin
Chenyu Liu
Pengcheng Xia
Pengfei Hu
Jiefeng Ma
Jun Du
Jia Pan
DiffM
VGen
136
0
0
17 Oct 2024
LaDTalk: Latent Denoising for Synthesizing Talking Head Videos with High Frequency Details
Jian Yang
Xukun Wang
Wentao Wang
Guoming Li
Qihang Fang
Ruihong Yuan
Tianyang Wang
Jason Zhaoxin Fan
Yeying Jin
Zhaoxin Fan
VGen
47
1
0
01 Oct 2024
ProbTalk3D: Non-Deterministic Emotion Controllable Speech-Driven 3D Facial Animation Synthesis Using VQ-VAE
Sichun Wu
Kazi Injamamul Haque
Zerrin Yumak
VGen
30
2
0
12 Sep 2024
EMOdiffhead: Continuously Emotional Control in Talking Head Generation via Diffusion
Jian Zhang
Weijian Mai
Zhijun Zhang
VGen
32
0
0
11 Sep 2024
CyberHost: Taming Audio-driven Avatar Diffusion Model with Region Codebook Attention
Gaojie Lin
Jianwen Jiang
Chao Liang
Tianyun Zhong
Jiaqi Yang
Yanbo Zheng
VGen
DiffM
63
13
0
03 Sep 2024
High-fidelity and Lip-synced Talking Face Synthesis via Landmark-based Diffusion Model
Weizhi Zhong
Junfan Lin
Peixin Chen
Liang Lin
Guanbin Li
36
1
0
10 Aug 2024
Landmark-guided Diffusion Model for High-fidelity and Temporally Coherent Talking Head Generation
Jintao Tan
Xize Cheng
Lingyu Xiong
Lei Zhu
Xiandong Li
Wenxiong Kang
Kai Gong
Minglei Li
Yi Cai
DiffM
28
2
0
03 Aug 2024
Stable Diffusion-based Data Augmentation for Federated Learning with Non-IID Data
Mahdi Morafah
M. Reisser
Bill Lin
Christos Louizos
FedML
34
5
0
13 May 2024
LatentColorization: Latent Diffusion-Based Speaker Video Colorization
Rory Ward
Dan Bigioi
Shubhajit Basak
John G. Breslin
Peter Corcoran
VGen
DiffM
27
2
0
09 May 2024
Balanced Mixed-Type Tabular Data Synthesis with Diffusion Models
Zeyu Yang
Peikun Guo
Khadija Zanna
Akane Sano
Xiaoxue Yang
Akane Sano
DiffM
34
8
0
12 Apr 2024
DiffusionAct: Controllable Diffusion Autoencoder for One-shot Face Reenactment
Stella Bounareli
Christos Tzelepis
Vasileios Argyriou
Ioannis Patras
Georgios Tzimiropoulos
DiffM
43
7
0
25 Mar 2024
Detecting Multimedia Generated by Large AI Models: A Survey
Li Lin
Neeraj Gupta
Yue Zhang
Hainan Ren
Chun-Hao Liu
Feng Ding
Xin Wang
Xin Li
Luisa Verdoliva
Shu Hu
86
57
0
22 Jan 2024
GMTalker: Gaussian Mixture-based Audio-Driven Emotional Talking Video Portraits
Yibo Xia
Lizhen Wang
Xiang Deng
Xiaoyan Luo
Yunhong Wang
Yebin Liu
VGen
42
1
0
12 Dec 2023
Neural Text to Articulate Talk: Deep Text to Audiovisual Speech Synthesis achieving both Auditory and Photo-realism
Georgios Milis
P. Filntisis
A. Roussos
Petros Maragos
CVBM
34
2
0
11 Dec 2023
A Comprehensive Multi-scale Approach for Speech and Dynamics Synchrony in Talking Head Generation
Louis Airale
Dominique Vaufreydaz
Xavier Alameda-Pineda
23
1
0
04 Jul 2023
Face Generation and Editing with StyleGAN: A Survey
Andrew Melnik
Maksim Miasayedzenkau
Dzianis Makaravets
Dzianis Pirshtuk
Eren Akbulut
Dennis Holzmann
Tarek Renusch
Gustav Reichert
Helge J. Ritter
CVBM
27
40
0
18 Dec 2022
Autoregressive GAN for Semantic Unconditional Head Motion Generation
Louis Airale
Xavier Alameda-Pineda
Stéphane Lathuilière
Dominique Vaufreydaz
22
3
0
02 Nov 2022
EAMM: One-Shot Emotional Talking Face via Audio-Based Emotion-Aware Motion Model
Xinya Ji
Hang Zhou
Kaisiyuan Wang
Qianyi Wu
Wayne Wu
Feng Xu
Xun Cao
CVBM
54
157
0
30 May 2022
Flexible Diffusion Modeling of Long Videos
William Harvey
Saeid Naderiparizi
Vaden Masrani
Christian Weilbach
Frank D. Wood
DiffM
BDL
VGen
176
285
0
23 May 2022
PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering
Yurui Ren
Gezhong Li
Yuanqi Chen
Thomas H. Li
Shan Liu
DiffM
VGen
49
224
0
17 Sep 2021
1