ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.10137
  4. Cited By
Audio-driven Talking Face Video Generation with Learning-based
  Personalized Head Pose

Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose

24 February 2020
Ran Yi
Zipeng Ye
Juyong Zhang
Hujun Bao
Yong-jin Liu
    CVBM
ArXivPDFHTML

Papers citing "Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose"

33 / 83 papers shown
Title
Self-Supervised Video Forensics by Audio-Visual Anomaly Detection
Self-Supervised Video Forensics by Audio-Visual Anomaly Detection
Chao Feng
Ziyang Chen
Andrew Owens
31
71
0
04 Jan 2023
StyleTalk: One-shot Talking Head Generation with Controllable Speaking
  Styles
StyleTalk: One-shot Talking Head Generation with Controllable Speaking Styles
Yifeng Ma
Suzhe Wang
Zhipeng Hu
Changjie Fan
Tangjie Lv
Yu-qiong Ding
Zhidong Deng
Xin Yu
56
82
0
03 Jan 2023
Imitator: Personalized Speech-driven 3D Facial Animation
Imitator: Personalized Speech-driven 3D Facial Animation
Balamurugan Thambiraja
I. Habibie
S. Aliakbarian
Darren Cosker
Christian Theobalt
Justus Thies
CVBM
39
49
0
30 Dec 2022
Memories are One-to-Many Mapping Alleviators in Talking Face Generation
Memories are One-to-Many Mapping Alleviators in Talking Face Generation
Anni Tang
Tianyu He
Xuejiao Tan
Jun Ling
Liang Song
CVBM
26
23
0
09 Dec 2022
Masked Lip-Sync Prediction by Audio-Visual Contextual Exploitation in
  Transformers
Masked Lip-Sync Prediction by Audio-Visual Contextual Exploitation in Transformers
Yasheng Sun
Hang Zhou
Kaisiyuan Wang
Qianyi Wu
Zhibin Hong
Jingtuo Liu
Errui Ding
Jingdong Wang
Ziwei Liu
Koike Hideki
32
34
0
09 Dec 2022
Progressive Disentangled Representation Learning for Fine-Grained
  Controllable Talking Head Synthesis
Progressive Disentangled Representation Learning for Fine-Grained Controllable Talking Head Synthesis
Duomin Wang
Yu Deng
Zixin Yin
H. Shum
Baoyuan Wang
10
60
0
26 Nov 2022
Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial
  Decomposition
Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition
Jiaxiang Tang
Kaisiyuan Wang
Hang Zhou
Xiaokang Chen
Dongliang He
Tianshu Hu
Jingtuo Liu
Gang Zeng
Jingdong Wang
3DH
31
76
0
22 Nov 2022
Autoregressive GAN for Semantic Unconditional Head Motion Generation
Autoregressive GAN for Semantic Unconditional Head Motion Generation
Louis Airale
Xavier Alameda-Pineda
Stéphane Lathuilière
Dominique Vaufreydaz
17
3
0
02 Nov 2022
SyncTalkFace: Talking Face Generation with Precise Lip-Syncing via
  Audio-Lip Memory
SyncTalkFace: Talking Face Generation with Precise Lip-Syncing via Audio-Lip Memory
Se Jin Park
Minsu Kim
Joanna Hong
J. Choi
Y. Ro
CVBM
19
85
0
02 Nov 2022
Geometry Driven Progressive Warping for One-Shot Face Animation
Geometry Driven Progressive Warping for One-Shot Face Animation
Yatao Zhong
F. Amjadi
Ilya Zharkov
3DH
CVBM
21
1
0
05 Oct 2022
StableFace: Analyzing and Improving Motion Stability for Talking Face
  Generation
StableFace: Analyzing and Improving Motion Stability for Talking Face Generation
Jun Ling
Xuejiao Tan
Liyang Chen
Runnan Li
Yuchao Zhang
Sheng Zhao
Liang Song
CVBM
44
13
0
29 Aug 2022
Video Manipulations Beyond Faces: A Dataset with Human-Machine Analysis
Video Manipulations Beyond Faces: A Dataset with Human-Machine Analysis
Trisha Mittal
Ritwik Sinha
Viswanathan Swaminathan
John Collomosse
Dinesh Manocha
24
9
0
26 Jul 2022
Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head
  Synthesis
Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head Synthesis
Shuai Shen
Wanhua Li
Zhengbiao Zhu
Yueqi Duan
Jie Zhou
Jiwen Lu
CVBM
25
105
0
24 Jul 2022
Perceptual Conversational Head Generation with Regularized Driver and
  Enhanced Renderer
Perceptual Conversational Head Generation with Regularized Driver and Enhanced Renderer
Ai-Mei Huang
Zhewei Huang
Shuchang Zhou
VGen
32
7
0
26 Jun 2022
Deep Learning for Visual Speech Analysis: A Survey
Deep Learning for Visual Speech Analysis: A Survey
Changchong Sheng
Gangyao Kuang
L. Bai
Chen Hou
Y. Guo
Xin Xu
M. Pietikäinen
Li Liu
VLM
23
33
0
22 May 2022
A comprehensive survey on semantic facial attribute editing using
  generative adversarial networks
A comprehensive survey on semantic facial attribute editing using generative adversarial networks
A. Nickabadi
Maryam Saeedi Fard
Nastaran Moradzadeh Farid
Najmeh Mohammadbagheri
CVBM
GAN
EGVM
38
9
0
21 May 2022
Sound-Guided Semantic Video Generation
Sound-Guided Semantic Video Generation
Seung Hyun Lee
Gyeongrok Oh
Wonmin Byeon
Chanyoung Kim
Wonjae Ryoo
Sang Ho Yoon
Hyunjun Cho
Jihyun Bae
Jinkyu Kim
Sangpil Kim
VGen
18
24
0
20 Apr 2022
Audio-Visual Person-of-Interest DeepFake Detection
Audio-Visual Person-of-Interest DeepFake Detection
D. Cozzolino
Alessandro Pianese
Matthias Nießner
L. Verdoliva
28
60
0
06 Apr 2022
Residual-guided Personalized Speech Synthesis based on Face Image
Residual-guided Personalized Speech Synthesis based on Face Image
Jianrong Wang
Zixuan Wang
Xiaosheng Hu
Xuewei Li
Qiang Fang
Li Liu
CVBM
19
16
0
01 Apr 2022
StyleHEAT: One-Shot High-Resolution Editable Talking Face Generation via
  Pre-trained StyleGAN
StyleHEAT: One-Shot High-Resolution Editable Talking Face Generation via Pre-trained StyleGAN
Fei Yin
Yong Zhang
Xiaodong Cun
Ming Cao
Yanbo Fan
Xuanxia Wang
Qingyan Bai
Baoyuan Wu
Jue Wang
Yujiu Yang
CVBM
34
171
0
08 Mar 2022
Freeform Body Motion Generation from Speech
Freeform Body Motion Generation from Speech
Jing-Fen Xu
Wei Zhang
Yalong Bai
Qi-Biao Sun
Tao Mei
SLR
31
18
0
04 Mar 2022
Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation
Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation
Xian Liu
Yinghao Xu
Qianyi Wu
Hang Zhou
Wayne Wu
Bolei Zhou
VGen
DiffM
3DH
37
140
0
19 Jan 2022
DFA-NeRF: Personalized Talking Head Generation via Disentangled Face
  Attributes Neural Rendering
DFA-NeRF: Personalized Talking Head Generation via Disentangled Face Attributes Neural Rendering
Shunyu Yao
Ruizhe Zhong
Yichao Yan
Guangtao Zhai
Xiaokang Yang
CVBM
19
90
0
03 Jan 2022
Multimodal Image Synthesis and Editing: The Generative AI Era
Multimodal Image Synthesis and Editing: The Generative AI Era
Fangneng Zhan
Yingchen Yu
Rongliang Wu
Jiahui Zhang
Shijian Lu
Lingjie Liu
Adam Kortylewski
Christian Theobalt
Eric Xing
EGVM
24
48
0
27 Dec 2021
FaceFormer: Speech-Driven 3D Facial Animation with Transformers
FaceFormer: Speech-Driven 3D Facial Animation with Transformers
Yingruo Fan
Zhaojiang Lin
Jun Saito
Wenping Wang
Taku Komura
CVBM
43
195
0
10 Dec 2021
One-shot Talking Face Generation from Single-speaker Audio-Visual
  Correlation Learning
One-shot Talking Face Generation from Single-speaker Audio-Visual Correlation Learning
Suzhe Wang
Lincheng Li
Yueqing Ding
Xin Yu
CVBM
64
117
0
06 Dec 2021
Imitating Arbitrary Talking Style for Realistic Audio-DrivenTalking Face
  Synthesis
Imitating Arbitrary Talking Style for Realistic Audio-DrivenTalking Face Synthesis
Haozhe Wu
Jia Jia
Haoyu Wang
Yishun Dou
Chao Duan
Qingshan Deng
CVBM
9
73
0
30 Oct 2021
Deep Person Generation: A Survey from the Perspective of Face, Pose and
  Cloth Synthesis
Deep Person Generation: A Survey from the Perspective of Face, Pose and Cloth Synthesis
Tong Sha
Wei Zhang
T. Shen
Zhoujun Li
Tao Mei
29
38
0
05 Sep 2021
AnyoneNet: Synchronized Speech and Talking Head Generation for Arbitrary
  Person
AnyoneNet: Synchronized Speech and Talking Head Generation for Arbitrary Person
Xinsheng Wang
Qicong Xie
Jihua Zhu
Lei Xie
O. Scharenborg
28
16
0
09 Aug 2021
KoDF: A Large-scale Korean DeepFake Detection Dataset
KoDF: A Large-scale Korean DeepFake Detection Dataset
Patrick Kwon
J. You
Gyuhyeon Nam
Sungwoo Park
Gyeongsu Chae
21
99
0
18 Mar 2021
CNN with large memory layers
CNN with large memory layers
R. Karimov
Yury Malkov
Karim Iskakov
Victor Lempitsky
19
0
0
27 Jan 2021
Multi Modal Adaptive Normalization for Audio to Video Generation
Multi Modal Adaptive Normalization for Audio to Video Generation
Neeraj Kumar
Srishti Goel
Ankur Narang
Brejesh Lall
VGen
DiffM
27
0
0
14 Dec 2020
APB2FaceV2: Real-Time Audio-Guided Multi-Face Reenactment
APB2FaceV2: Real-Time Audio-Guided Multi-Face Reenactment
Jiangning Zhang
Xianfang Zeng
Chao Xu
Jun Chen
Yong Liu
Yunliang Jiang
CVBM
28
1
0
25 Oct 2020
Previous
12