Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2503.12963
Cited By
Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait
17 March 2025
Chaolong Yang
Kai Yao
Yuyao Yan
Chenru Jiang
Weiguang Zhao
Jie Sun
Guangliang Cheng
Yifei Zhang
Bin Dong
K. Huang
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait"
16 / 16 papers shown
Title
LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control
Jianzhu Guo
Dingyun Zhang
Xiaoqiang Liu
Zhizhou Zhong
Yuan Zhang
Pengfei Wan
Di Zhang
VGen
88
60
0
03 Jul 2024
VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
K. Cheng
Xiaodong Cun
Yong Zhang
Menghan Xia
Fei Yin
Mingrui Zhu
Xuanxia Wang
Jue Wang
Nan Wang
CVBM
50
99
0
27 Nov 2022
SPACE: Speech-driven Portrait Animation with Controllable Expression
Francesco Ferroni
Arun Mallya
Ting-Chun Wang
Rafael Valle
Xuan Li
VGen
59
47
0
17 Nov 2022
Implicit Warping for Animation with Image Sets
Arun Mallya
Ting-Chun Wang
Xuan Li
VGen
142
41
0
04 Oct 2022
StyleHEAT: One-Shot High-Resolution Editable Talking Face Generation via Pre-trained StyleGAN
Fei Yin
Yong Zhang
Xiaodong Cun
Ming Cao
Yanbo Fan
Xuanxia Wang
Qingyan Bai
Baoyuan Wu
Jue Wang
Yujiu Yang
CVBM
87
173
0
08 Mar 2022
Generative Adversarial Networks
Gilad Cohen
Raja Giryes
GAN
255
30,123
0
01 Mar 2022
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
388
15,454
0
20 Dec 2021
PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering
Yurui Ren
Gezhong Li
Yuanqi Chen
Thomas H. Li
Shan Liu
DiffM
VGen
98
227
0
17 Sep 2021
One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing
Ting-Chun Wang
Arun Mallya
Xuan Li
3DH
92
482
0
30 Nov 2020
Denoising Diffusion Implicit Models
Jiaming Song
Chenlin Meng
Stefano Ermon
VLM
DiffM
231
7,350
0
06 Oct 2020
A Lip Sync Expert Is All You Need for Speech to Lip Generation In The Wild
Prajwal K R
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
EGVM
96
777
0
23 Aug 2020
MakeItTalk: Speaker-Aware Talking-Head Animation
Yang Zhou
Xintong Han
Eli Shechtman
J. Echevarria
E. Kalogerakis
Dingzeyu Li
63
421
0
27 Apr 2020
Towards Automatic Face-to-Face Translation
Prajwal K R
Rudrabha Mukhopadhyay
Jerin Philip
Abhishek Jha
Vinay P. Namboodiri
C. V. Jawahar
CVBM
89
174
0
01 Mar 2020
First Order Motion Model for Image Animation
Aliaksandr Siarohin
Stéphane Lathuilière
Sergey Tulyakov
Elisa Ricci
N. Sebe
VGen
DiffM
81
925
0
29 Feb 2020
Hierarchical Cross-Modal Talking Face Generationwith Dynamic Pixel-Wise Loss
Lele Chen
R. Maddox
Z. Duan
Chenliang Xu
CVBM
68
398
0
09 May 2019
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
1.7K
150,006
0
22 Dec 2014
1