ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.10137
  4. Cited By
Audio-driven Talking Face Video Generation with Learning-based
  Personalized Head Pose

Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose

24 February 2020
Ran Yi
Zipeng Ye
Juyong Zhang
Hujun Bao
Yong-jin Liu
    CVBM
ArXivPDFHTML

Papers citing "Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose"

50 / 83 papers shown
Title
Detecting Lip-Syncing Deepfakes: Vision Temporal Transformer for Analyzing Mouth Inconsistencies
Detecting Lip-Syncing Deepfakes: Vision Temporal Transformer for Analyzing Mouth Inconsistencies
Soumyya Kanti Datta
Shan Jia
Siwei Lyu
44
0
0
02 Apr 2025
Personalized Generation In Large Model Era: A Survey
Yiyan Xu
Jinghao Zhang
Alireza Salemi
Xinting Hu
Wenqiang Wang
Fuli Feng
Hamed Zamani
Xiangnan He
Tat-Seng Chua
3DV
79
2
0
04 Mar 2025
InsTaG: Learning Personalized 3D Talking Head from Few-Second Video
InsTaG: Learning Personalized 3D Talking Head from Few-Second Video
Jiahe Li
Jiawei Zhang
Xiao Bai
Jin Zheng
J. Zhou
L. Gu
62
0
0
27 Feb 2025
Driving Towards Inclusion: A Systematic Review of AI-powered Accessibility Enhancements for People with Disability in Autonomous Vehicles
Driving Towards Inclusion: A Systematic Review of AI-powered Accessibility Enhancements for People with Disability in Autonomous Vehicles
Ashish Bastola
Julian Brinkley
Hao Wang
Abolfazl Razi
A. Moshayedi
Abolfazl Razi
48
0
0
10 Jan 2025
JoyVASA: Portrait and Animal Image Animation with Diffusion-Based
  Audio-Driven Facial Dynamics and Head Motion Generation
JoyVASA: Portrait and Animal Image Animation with Diffusion-Based Audio-Driven Facial Dynamics and Head Motion Generation
Xuyang Cao
Guoxin Wang
Sheng Shi
Jun Zhao
Yang Yao
Jintao Fei
Minyu Gao
VGen
37
1
0
14 Nov 2024
MimicTalk: Mimicking a personalized and expressive 3D talking face in
  minutes
MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes
Zhenhui Ye
Tianyun Zhong
Yi Ren
Ziyue Jiang
Jiawei Huang
...
Chen Zhang
Zehan Wang
Xize Chen
Xiang Yin
Zhou Zhao
VGen
36
3
0
09 Oct 2024
A Comprehensive Survey with Critical Analysis for Deepfake Speech Detection
A Comprehensive Survey with Critical Analysis for Deepfake Speech Detection
Lam Pham
Phat Lam
Dat Tran
Hieu Tang
Tin Nguyen
Alexander Schindler
Canh Vu
Alexander Polonsky
Canh Vu
53
3
0
23 Sep 2024
StyleTalk++: A Unified Framework for Controlling the Speaking Styles of
  Talking Heads
StyleTalk++: A Unified Framework for Controlling the Speaking Styles of Talking Heads
Suzhen Wang
Yifeng Ma
Yu Ding
Zhipeng Hu
Changjie Fan
Tangjie Lv
Zhidong Deng
Xin Yu
43
9
0
14 Sep 2024
S^3D-NeRF: Single-Shot Speech-Driven Neural Radiance Field for High
  Fidelity Talking Head Synthesis
S^3D-NeRF: Single-Shot Speech-Driven Neural Radiance Field for High Fidelity Talking Head Synthesis
Dongze Li
Kang Zhao
Wei Wang
Yifeng Ma
Bo Peng
Yingya Zhang
Jing Dong
3DH
CVBM
35
2
0
18 Aug 2024
Content and Style Aware Audio-Driven Facial Animation
Content and Style Aware Audio-Driven Facial Animation
Qingju Liu
Hyeongwoo Kim
Gaurav Bharaj
DiffM
43
1
0
13 Aug 2024
A Comprehensive Taxonomy and Analysis of Talking Head Synthesis:
  Techniques for Portrait Generation, Driving Mechanisms, and Editing
A Comprehensive Taxonomy and Analysis of Talking Head Synthesis: Techniques for Portrait Generation, Driving Mechanisms, and Editing
Ming Meng
Yufei Zhao
Bo Zhang
Yonggui Zhu
Weimin Shi
Maxwell Wen
Zhaoxin Fan
VGen
39
1
0
15 Jun 2024
AVFF: Audio-Visual Feature Fusion for Video Deepfake Detection
AVFF: Audio-Visual Feature Fusion for Video Deepfake Detection
Trevine Oorloff
Surya Koppisetti
Nicolò Bonettini
Divyaraj Solanki
Ben Colman
Yaser Yacoob
Ali Shahriyari
Gaurav Bharaj
32
21
0
05 Jun 2024
Faces that Speak: Jointly Synthesising Talking Face and Speech from Text
Faces that Speak: Jointly Synthesising Talking Face and Speech from Text
Youngjoon Jang
Ji-Hoon Kim
Junseok Ahn
Doyeop Kwak
Hong-Sun Yang
Yooncheol Ju
Il-Hwan Kim
Byeong-Yeol Kim
Joon Son Chung
CVBM
29
9
0
16 May 2024
Dyadic Interaction Modeling for Social Behavior Generation
Dyadic Interaction Modeling for Social Behavior Generation
Minh Tran
Di Chang
Maksim Siniukov
Mohammad Soleymani
VGen
37
6
0
14 Mar 2024
FlowVQTalker: High-Quality Emotional Talking Face Generation through
  Normalizing Flow and Quantization
FlowVQTalker: High-Quality Emotional Talking Face Generation through Normalizing Flow and Quantization
Shuai Tan
Bin Ji
Ye Pan
42
15
0
11 Mar 2024
Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis
Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis
Zicheng Zhang
Ruobing Zheng
Ziwen Liu
Congying Han
Tianqi Li
Meng Wang
Tiande Guo
Jingdong Chen
Bonan Li
Ming Yang
3DH
32
5
0
27 Feb 2024
Exposing Lip-syncing Deepfakes from Mouth Inconsistencies
Exposing Lip-syncing Deepfakes from Mouth Inconsistencies
Soumyya Kanti Datta
Shan Jia
Siwei Lyu
36
6
0
18 Jan 2024
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis
Zhenhui Ye
Tianyun Zhong
Yi Ren
Jiaqi Yang
Weichuang Li
...
Jinglin Liu
Chen Zhang
Xiang Yin
Zejun Ma
Zhou Zhao
29
45
0
16 Jan 2024
DreamTalk: When Expressive Talking Head Generation Meets Diffusion
  Probabilistic Models
DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Yifeng Ma
Shiwei Zhang
Jiayu Wang
Xiang Wang
Yingya Zhang
Zhidong Deng
DiffM
41
2
0
15 Dec 2023
GSmoothFace: Generalized Smooth Talking Face Generation via Fine Grained
  3D Face Guidance
GSmoothFace: Generalized Smooth Talking Face Generation via Fine Grained 3D Face Guidance
Haiming Zhang
Zhihao Yuan
Chaoda Zheng
Xu Yan
Baoyuan Wang
Guanbin Li
Song Wu
Shuguang Cui
Zhen Li
CVBM
47
1
0
12 Dec 2023
GMTalker: Gaussian Mixture-based Audio-Driven Emotional Talking Video Portraits
GMTalker: Gaussian Mixture-based Audio-Driven Emotional Talking Video Portraits
Yibo Xia
Lizhen Wang
Xiang Deng
Xiaoyan Luo
Yunhong Wang
Yebin Liu
VGen
42
1
0
12 Dec 2023
R2-Talker: Realistic Real-Time Talking Head Synthesis with Hash Grid
  Landmarks Encoding and Progressive Multilayer Conditioning
R2-Talker: Realistic Real-Time Talking Head Synthesis with Hash Grid Landmarks Encoding and Progressive Multilayer Conditioning
Zhiling Ye
LiangGuo Zhang
Dingheng Zeng
Quan Lu
Ning Jiang
24
0
0
09 Dec 2023
3DiFACE: Diffusion-based Speech-driven 3D Facial Animation and Editing
3DiFACE: Diffusion-based Speech-driven 3D Facial Animation and Editing
Balamurugan Thambiraja
S. Aliakbarian
Darren Cosker
Justus Thies
DiffM
VGen
45
11
0
01 Dec 2023
THInImg: Cross-modal Steganography for Presenting Talking Heads in
  Images
THInImg: Cross-modal Steganography for Presenting Talking Heads in Images
Lin Zhao
Hongxuan Li
Xuefei Ning
Xinru Jiang
27
1
0
28 Nov 2023
OSM-Net: One-to-Many One-shot Talking Head Generation with Spontaneous
  Head Motions
OSM-Net: One-to-Many One-shot Talking Head Generation with Spontaneous Head Motions
Jin Liu
Xi Wang
Xiaomeng Fu
Yesheng Chai
Cai Yu
Jiao Dai
Jizhong Han
21
3
0
28 Sep 2023
ReliTalk: Relightable Talking Portrait Generation from a Single Video
ReliTalk: Relightable Talking Portrait Generation from a Single Video
Haonan Qiu
Zhaoxi Chen
Yuming Jiang
Hang Zhou
Xiangyu Fan
Lei Yang
Wayne Wu
Ziwei Liu
DiffM
VGen
24
10
0
05 Sep 2023
RADIO: Reference-Agnostic Dubbing Video Synthesis
RADIO: Reference-Agnostic Dubbing Video Synthesis
Dongyeun Lee
Chaewon Kim
Sangjoon Yu
Jaejun Yoo
Gyeong-Moon Park
VGen
DiffM
34
1
0
05 Sep 2023
Audio-Driven Dubbing for User Generated Contents via Style-Aware
  Semi-Parametric Synthesis
Audio-Driven Dubbing for User Generated Contents via Style-Aware Semi-Parametric Synthesis
Linsen Song
Wayne Wu
Chaoyou Fu
Chen Change Loy
Ran He
25
10
0
31 Aug 2023
A Survey on Deep Multi-modal Learning for Body Language Recognition and
  Generation
A Survey on Deep Multi-modal Learning for Body Language Recognition and Generation
Li Liu
Lufei Gao
Wen-Ling Lei
Fengji Ma
Xiaotian Lin
Jin-Tao Wang
CVBM
27
5
0
17 Aug 2023
Speech-Driven 3D Face Animation with Composite and Regional Facial
  Movements
Speech-Driven 3D Face Animation with Composite and Regional Facial Movements
Haozhe Wu
Songtao Zhou
Jia Jia
Junliang Xing
Qi Wen
Xiang Wen
CVBM
32
15
0
10 Aug 2023
UniBriVL: Robust Universal Representation and Generation of Audio Driven
  Diffusion Models
UniBriVL: Robust Universal Representation and Generation of Audio Driven Diffusion Models
Sen Fang
Bowen Gao
Yangjian Wu
T. Teoh
DiffM
31
1
0
29 Jul 2023
MODA: Mapping-Once Audio-driven Portrait Animation with Dual Attentions
MODA: Mapping-Once Audio-driven Portrait Animation with Dual Attentions
Yunfei Liu
Lijian Lin
Fei Yu
Changyin Zhou
Yu Li
DiffM
VGen
42
23
0
19 Jul 2023
A Comprehensive Multi-scale Approach for Speech and Dynamics Synchrony
  in Talking Head Generation
A Comprehensive Multi-scale Approach for Speech and Dynamics Synchrony in Talking Head Generation
Louis Airale
Dominique Vaufreydaz
Xavier Alameda-Pineda
23
1
0
04 Jul 2023
Parametric Implicit Face Representation for Audio-Driven Facial
  Reenactment
Parametric Implicit Face Representation for Audio-Driven Facial Reenactment
Ricong Huang
Puxiang Lai
Yipeng Qin
Guanbin Li
CVBM
DiffM
25
13
0
13 Jun 2023
IFaceUV: Intuitive Motion Facial Image Generation by Identity
  Preservation via UV map
IFaceUV: Intuitive Motion Facial Image Generation by Identity Preservation via UV map
Han-Lim Lee
Yu-Te Ku
Eunseok Kim
Seungryul Baek
3DH
28
0
0
08 Jun 2023
LPMM: Intuitive Pose Control for Neural Talking-Head Model via
  Landmark-Parameter Morphable Model
LPMM: Intuitive Pose Control for Neural Talking-Head Model via Landmark-Parameter Morphable Model
K. Lee
Patrick Kwon
Myung Ki Lee
Namhyuk Ahn
Junsoo Lee
9
1
0
17 May 2023
StyleSync: High-Fidelity Generalized and Personalized Lip Sync in
  Style-based Generator
StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator
Jiazhi Guan
Zhanwang Zhang
Hang Zhou
Tianshu Hu
Kaisiyuan Wang
...
Haocheng Feng
Jingtuo Liu
Errui Ding
Ziwei Liu
Jingdong Wang
37
57
0
09 May 2023
High-fidelity Generalized Emotional Talking Face Generation with
  Multi-modal Emotion Space Learning
High-fidelity Generalized Emotional Talking Face Generation with Multi-modal Emotion Space Learning
Chao Xu
Sijun Tan
Jibang Wu
Yue Han
Wenqing Chu
Xiaohui Bei
Chengjie Wang
Haifeng Xu
Yong Liu
CVBM
48
36
0
04 May 2023
GeneFace++: Generalized and Stable Real-Time Audio-Driven 3D Talking
  Face Generation
GeneFace++: Generalized and Stable Real-Time Audio-Driven 3D Talking Face Generation
Zhenhui Ye
Jinzheng He
Ziyue Jiang
Rongjie Huang
Jia-Bin Huang
Jinglin Liu
Yixiang Ren
Xiang Yin
Zejun Ma
Zhou Zhao
CVBM
49
29
0
01 May 2023
Audio-Driven Talking Face Generation with Diverse yet Realistic Facial
  Animations
Audio-Driven Talking Face Generation with Diverse yet Realistic Facial Animations
Rongliang Wu
Yingchen Yu
Fangneng Zhan
Jiahui Zhang
Xiaoqin Zhang
Shijian Lu
CVBM
24
9
0
18 Apr 2023
That's What I Said: Fully-Controllable Talking Face Generation
That's What I Said: Fully-Controllable Talking Face Generation
Youngjoon Jang
Kyeongha Rho
Jong-Bin Woo
Hyeongkeun Lee
Jihwan Park
Youshin Lim
Byeong-Yeol Kim
Joon Son Chung
CVBM
19
9
0
06 Apr 2023
DAE-Talker: High Fidelity Speech-Driven Talking Face Generation with
  Diffusion Autoencoder
DAE-Talker: High Fidelity Speech-Driven Talking Face Generation with Diffusion Autoencoder
Chenpeng Du
Qi Chen
Xie Chen
K. Yu
DiffM
27
50
0
30 Mar 2023
MusicFace: Music-driven Expressive Singing Face Synthesis
MusicFace: Music-driven Expressive Singing Face Synthesis
Peng Liu
W. Deng
Hengda Li
Jintai Wang
Yinglin Zheng
Yiwei Ding
Xiaohu Guo
Ming Zeng
CVBM
35
10
0
24 Mar 2023
Exploring Efficient-Tuned Learning Audio Representation Method from
  BriVL
Exploring Efficient-Tuned Learning Audio Representation Method from BriVL
Sen Fang
Yang Wu
Bowen Gao
Jingwen Cai
T. Teoh
DiffM
18
1
0
08 Mar 2023
Pose-Controllable 3D Facial Animation Synthesis using Hierarchical
  Audio-Vertex Attention
Pose-Controllable 3D Facial Animation Synthesis using Hierarchical Audio-Vertex Attention
Bin Liu
Xiaolin K. Wei
Bo Li
Junjie Cao
Yunyu Lai
CVBM
19
1
0
24 Feb 2023
GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face
  Synthesis
GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis
Zhenhui Ye
Ziyue Jiang
Yi Ren
Jinglin Liu
Jinzheng He
Zhou Zhao
CVBM
20
122
0
31 Jan 2023
Regeneration Learning: A Learning Paradigm for Data Generation
Regeneration Learning: A Learning Paradigm for Data Generation
Xu Tan
Tao Qin
Jiang Bian
Tie-Yan Liu
Yoshua Bengio
GAN
38
15
0
21 Jan 2023
Speech Driven Video Editing via an Audio-Conditioned Diffusion Model
Speech Driven Video Editing via an Audio-Conditioned Diffusion Model
Dan Bigioi
Shubhajit Basak
Michał Stypułkowski
Maciej Ziȩba
H. Jordan
R. Mcdonnell
Peter Corcoran
DiffM
VGen
24
34
0
10 Jan 2023
CodeTalker: Speech-Driven 3D Facial Animation with Discrete Motion Prior
CodeTalker: Speech-Driven 3D Facial Animation with Discrete Motion Prior
Jinbo Xing
Menghan Xia
Yuechen Zhang
Xiaodong Cun
Jue Wang
T. Wong
24
141
0
06 Jan 2023
Expressive Speech-driven Facial Animation with controllable emotions
Expressive Speech-driven Facial Animation with controllable emotions
Yutong Chen
Junhong Zhao
Weiqiang Zhang
25
8
0
05 Jan 2023
12
Next