ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2107.09293
  4. Cited By
Audio2Head: Audio-driven One-shot Talking-head Generation with Natural
  Head Motion

Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion

20 July 2021
Suzhe Wang
Lincheng Li
Yu-qiong Ding
Changjie Fan
Xin Yu
    VGen
ArXivPDFHTML

Papers citing "Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion"

50 / 106 papers shown
Title
META4: Semantically-Aligned Generation of Metaphoric Gestures Using
  Self-Supervised Text and Speech Representation
META4: Semantically-Aligned Generation of Metaphoric Gestures Using Self-Supervised Text and Speech Representation
Mireille Fares
Catherine Pelachaud
Nicolas Obin
18
1
0
09 Nov 2023
AdaMesh: Personalized Facial Expressions and Head Poses for Adaptive Speech-Driven 3D Facial Animation
AdaMesh: Personalized Facial Expressions and Head Poses for Adaptive Speech-Driven 3D Facial Animation
Liyang Chen
Weihong Bao
Shunwei Lei
Boshi Tang
Zhiyong Wu
Shiyin Kang
Haozhi Huang
Helen M. Meng
42
1
0
11 Oct 2023
OSM-Net: One-to-Many One-shot Talking Head Generation with Spontaneous
  Head Motions
OSM-Net: One-to-Many One-shot Talking Head Generation with Spontaneous Head Motions
Jin Liu
Xi Wang
Xiaomeng Fu
Yesheng Chai
Cai Yu
Jiao Dai
Jizhong Han
21
3
0
28 Sep 2023
Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a
  Short Video
Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Video
Xiuzhe Wu
Pengfei Hu
Yang Wu
Xiaoyang Lyu
Yan-Pei Cao
Ying Shan
Wenming Yang
Zhongqian Sun
Xiaojuan Qi
23
14
0
09 Sep 2023
From Pixels to Portraits: A Comprehensive Survey of Talking Head
  Generation Techniques and Applications
From Pixels to Portraits: A Comprehensive Survey of Talking Head Generation Techniques and Applications
Shreyank N. Gowda
Dheeraj Pandey
Shashank Narayana Gowda
49
3
0
30 Aug 2023
AdVerb: Visually Guided Audio Dereverberation
AdVerb: Visually Guided Audio Dereverberation
Sanjoy Chowdhury
Sreyan Ghosh
Subhrajyoti Dasgupta
Anton Ratnarajah
Utkarsh Tyagi
Tianyi Zhou
27
11
0
23 Aug 2023
DF-3DFace: One-to-Many Speech Synchronized 3D Face Animation with
  Diffusion
DF-3DFace: One-to-Many Speech Synchronized 3D Face Animation with Diffusion
Se Jin Park
Joanna Hong
Minsu Kim
Y. Ro
29
4
0
23 Aug 2023
A Survey on Deep Multi-modal Learning for Body Language Recognition and
  Generation
A Survey on Deep Multi-modal Learning for Body Language Recognition and Generation
Li Liu
Lufei Gao
Wen-Ling Lei
Fengji Ma
Xiaotian Lin
Jin-Tao Wang
CVBM
27
5
0
17 Aug 2023
Text-to-Video: a Two-stage Framework for Zero-shot Identity-agnostic
  Talking-head Generation
Text-to-Video: a Two-stage Framework for Zero-shot Identity-agnostic Talking-head Generation
Zhichao Wang
M. Dai
Keld Lundgaard
VGen
DiffM
43
2
0
12 Aug 2023
Controlling Character Motions without Observable Driving Source
Controlling Character Motions without Observable Driving Source
Weiyuan Li
Bin Dai
Ziyi Zhou
Qi Yao
Baoyuan Wang
VGen
8
1
0
11 Aug 2023
Learning and Evaluating Human Preferences for Conversational Head
  Generation
Learning and Evaluating Human Preferences for Conversational Head Generation
Mohan Zhou
Yalong Bai
Wei Zhang
Ting Yao
Tiejun Zhao
Tao Mei
32
2
0
20 Jul 2023
MODA: Mapping-Once Audio-driven Portrait Animation with Dual Attentions
MODA: Mapping-Once Audio-driven Portrait Animation with Dual Attentions
Yunfei Liu
Lijian Lin
Fei Yu
Changyin Zhou
Yu Li
DiffM
VGen
42
23
0
19 Jul 2023
A Comprehensive Multi-scale Approach for Speech and Dynamics Synchrony
  in Talking Head Generation
A Comprehensive Multi-scale Approach for Speech and Dynamics Synchrony in Talking Head Generation
Louis Airale
Dominique Vaufreydaz
Xavier Alameda-Pineda
23
1
0
04 Jul 2023
SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend
  3D Talking Faces
SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking Faces
Ziqiao Peng
Yihao Luo
Yue Shi
Hao-Xuan Xu
Xiangyu Zhu
Jun He
Hongyan Liu
Zhaoxin Fan
55
40
0
19 Jun 2023
Emotional Talking Head Generation based on Memory-Sharing and
  Attention-Augmented Networks
Emotional Talking Head Generation based on Memory-Sharing and Attention-Augmented Networks
Jianrong Wang
Yaxin Zhao
Li Liu
Tian-Shun Xu
Qi Li
Sen Li
16
9
0
06 Jun 2023
Exploring Phonetic Context-Aware Lip-Sync For Talking Face Generation
Exploring Phonetic Context-Aware Lip-Sync For Talking Face Generation
Se Jin Park
Minsu Kim
J. Choi
Y. Ro
CVBM
27
4
0
31 May 2023
DaGAN++: Depth-Aware Generative Adversarial Network for Talking Head
  Video Generation
DaGAN++: Depth-Aware Generative Adversarial Network for Talking Head Video Generation
Fa-Ting Hong
Li Shen
Dan Xu
3DH
CVBM
21
15
0
10 May 2023
StyleSync: High-Fidelity Generalized and Personalized Lip Sync in
  Style-based Generator
StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator
Jiazhi Guan
Zhanwang Zhang
Hang Zhou
Tianshu Hu
Kaisiyuan Wang
...
Haocheng Feng
Jingtuo Liu
Errui Ding
Ziwei Liu
Jingdong Wang
42
57
0
09 May 2023
Audio-Driven Talking Face Generation with Diverse yet Realistic Facial
  Animations
Audio-Driven Talking Face Generation with Diverse yet Realistic Facial Animations
Rongliang Wu
Yingchen Yu
Fangneng Zhan
Jiahui Zhang
Xiaoqin Zhang
Shijian Lu
CVBM
24
9
0
18 Apr 2023
That's What I Said: Fully-Controllable Talking Face Generation
That's What I Said: Fully-Controllable Talking Face Generation
Youngjoon Jang
Kyeongha Rho
Jong-Bin Woo
Hyeongkeun Lee
Jihwan Park
Youshin Lim
Byeong-Yeol Kim
Joon Son Chung
CVBM
19
9
0
06 Apr 2023
TalkCLIP: Talking Head Generation with Text-Guided Expressive Speaking
  Styles
TalkCLIP: Talking Head Generation with Text-Guided Expressive Speaking Styles
Yifeng Ma
Suzhe Wang
Yu-qiong Ding
Lincheng Li
Bowen Ma
Tangjie Lv
Changjie Fan
Zhipeng Hu
Zhidong Deng
Xin Yu
CLIP
31
21
0
01 Apr 2023
FONT: Flow-guided One-shot Talking Head Generation with Natural Head
  Motions
FONT: Flow-guided One-shot Talking Head Generation with Natural Head Motions
Jin Liu
Xi Wang
Xiaomeng Fu
Yesheng Chai
Cai Yu
Jiao Dai
Jizhong Han
24
6
0
31 Mar 2023
MusicFace: Music-driven Expressive Singing Face Synthesis
MusicFace: Music-driven Expressive Singing Face Synthesis
Peng Liu
W. Deng
Hengda Li
Jintai Wang
Yinglin Zheng
Yiwei Ding
Xiaohu Guo
Ming Zeng
CVBM
35
10
0
24 Mar 2023
Emotionally Enhanced Talking Face Generation
Emotionally Enhanced Talking Face Generation
Sahil Goyal
Shagun Uppal
Sarthak Bhagat
Yi Yu
Yifang Yin
R. Shah
CVBM
28
14
0
21 Mar 2023
Style Transfer for 2D Talking Head Animation
Style Transfer for 2D Talking Head Animation
Trong-Thang Pham
Nhat Le
Tuong Khanh Long Do
Hung Nguyen
Erman Tjiputra
Quang-Dieu Tran
A. Nguyen
22
3
0
17 Mar 2023
DisCoHead: Audio-and-Video-Driven Talking Head Generation by
  Disentangled Control of Head Pose and Facial Expressions
DisCoHead: Audio-and-Video-Driven Talking Head Generation by Disentangled Control of Head Pose and Facial Expressions
Geumbyeol Hwang
Sunwon Hong
Seunghyun Lee
Sungwoo Park
Gyeongsu Chae
VGen
32
5
0
14 Mar 2023
DINet: Deformation Inpainting Network for Realistic Face Visually
  Dubbing on High Resolution Video
DINet: Deformation Inpainting Network for Realistic Face Visually Dubbing on High Resolution Video
Zhimeng Zhang
Zhipeng Hu
W. Deng
Changjie Fan
Tangjie Lv
Yu-qiong Ding
3DH
CVBM
38
59
0
07 Mar 2023
OPT: One-shot Pose-Controllable Talking Head Generation
OPT: One-shot Pose-Controllable Talking Head Generation
Jin Liu
Xi Wang
Xiaomeng Fu
Yesheng Chai
Cai Yu
Jiao Dai
Jizhong Han
17
5
0
16 Feb 2023
Speech Driven Video Editing via an Audio-Conditioned Diffusion Model
Speech Driven Video Editing via an Audio-Conditioned Diffusion Model
Dan Bigioi
Shubhajit Basak
Michał Stypułkowski
Maciej Ziȩba
H. Jordan
R. Mcdonnell
Peter Corcoran
DiffM
VGen
24
34
0
10 Jan 2023
StyleTalk: One-shot Talking Head Generation with Controllable Speaking
  Styles
StyleTalk: One-shot Talking Head Generation with Controllable Speaking Styles
Yifeng Ma
Suzhe Wang
Zhipeng Hu
Changjie Fan
Tangjie Lv
Yu-qiong Ding
Zhidong Deng
Xin Yu
56
82
0
03 Jan 2023
Imitator: Personalized Speech-driven 3D Facial Animation
Imitator: Personalized Speech-driven 3D Facial Animation
Balamurugan Thambiraja
I. Habibie
S. Aliakbarian
Darren Cosker
Christian Theobalt
Justus Thies
CVBM
41
49
0
30 Dec 2022
Memories are One-to-Many Mapping Alleviators in Talking Face Generation
Memories are One-to-Many Mapping Alleviators in Talking Face Generation
Anni Tang
Tianyu He
Xuejiao Tan
Jun Ling
Liang Song
CVBM
26
23
0
09 Dec 2022
Masked Lip-Sync Prediction by Audio-Visual Contextual Exploitation in
  Transformers
Masked Lip-Sync Prediction by Audio-Visual Contextual Exploitation in Transformers
Yasheng Sun
Hang Zhou
Kaisiyuan Wang
Qianyi Wu
Zhibin Hong
Jingtuo Liu
Errui Ding
Jingdong Wang
Ziwei Liu
Koike Hideki
35
34
0
09 Dec 2022
Talking Head Generation with Probabilistic Audio-to-Visual Diffusion
  Priors
Talking Head Generation with Probabilistic Audio-to-Visual Diffusion Priors
Zhentao Yu
Zixin Yin
Deyu Zhou
Duomin Wang
Finn Wong
Baoyuan Wang
DiffM
30
35
0
07 Dec 2022
VideoReTalking: Audio-based Lip Synchronization for Talking Head Video
  Editing In the Wild
VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
K. Cheng
Xiaodong Cun
Yong Zhang
Menghan Xia
Fei Yin
Mingrui Zhu
Xuanxia Wang
Jue Wang
Nan Wang
CVBM
25
92
0
27 Nov 2022
Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial
  Decomposition
Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition
Jiaxiang Tang
Kaisiyuan Wang
Hang Zhou
Xiaokang Chen
Dongliang He
Tianshu Hu
Jingtuo Liu
Gang Zeng
Jingdong Wang
3DH
34
76
0
22 Nov 2022
SadTalker: Learning Realistic 3D Motion Coefficients for Stylized
  Audio-Driven Single Image Talking Face Animation
SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Wenxuan Zhang
Xiaodong Cun
Xuan Wang
Yong Zhang
Xiaodong Shen
Yu-Xiao Guo
Ying Shan
Fei-Yue Wang
VGen
35
233
0
22 Nov 2022
SPACE: Speech-driven Portrait Animation with Controllable Expression
SPACE: Speech-driven Portrait Animation with Controllable Expression
Francesco Ferroni
Arun Mallya
Ting-Chun Wang
Rafael Valle
Xuan Li
VGen
31
45
0
17 Nov 2022
Autoregressive GAN for Semantic Unconditional Head Motion Generation
Autoregressive GAN for Semantic Unconditional Head Motion Generation
Louis Airale
Xavier Alameda-Pineda
Stéphane Lathuilière
Dominique Vaufreydaz
25
3
0
02 Nov 2022
Facial Expression Video Generation Based-On Spatio-temporal
  Convolutional GAN: FEV-GAN
Facial Expression Video Generation Based-On Spatio-temporal Convolutional GAN: FEV-GAN
Hamza Bouzid
Lahoucine Ballihi
CVBM
25
9
0
20 Oct 2022
Compressing Video Calls using Synthetic Talking Heads
Compressing Video Calls using Synthetic Talking Heads
Madhav Agarwal
Anchit Gupta
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
17
10
0
07 Oct 2022
Audio-Visual Face Reenactment
Audio-Visual Face Reenactment
Madhav Agarwal
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
DiffM
VGen
24
22
0
06 Oct 2022
Talking Head from Speech Audio using a Pre-trained Image Generator
Talking Head from Speech Audio using a Pre-trained Image Generator
M. M. Alghamdi
He-Nan Wang
A. Bulpitt
David C. Hogg
70
21
0
09 Sep 2022
StyleTalker: One-shot Style-based Audio-driven Talking Head Video
  Generation
StyleTalker: One-shot Style-based Audio-driven Talking Head Video Generation
Dong Min
Min-Hwan Song
Eunji Ko
Sung Ju Hwang
VGen
35
12
0
23 Aug 2022
Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head
  Synthesis
Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head Synthesis
Shuai Shen
Wanhua Li
Zhengbiao Zhu
Yueqi Duan
Jie Zhou
Jiwen Lu
CVBM
25
105
0
24 Jul 2022
3D Concept Grounding on Neural Fields
3D Concept Grounding on Neural Fields
Yining Hong
Yilun Du
Chun-Tse Lin
J. Tenenbaum
Chuang Gan
23
19
0
13 Jul 2022
Face-Dubbing++: Lip-Synchronous, Voice Preserving Translation of Videos
Face-Dubbing++: Lip-Synchronous, Voice Preserving Translation of Videos
Alexander Waibel
M. Behr
Fevziye Irem Eyiokur
Dogucan Yaman
Tuan-Nam Nguyen
Carlos Mullov
Mehmet Arif Demirtas
Alperen Kantarci
Stefan Constantin
H. K. Ekenel
CVBM
15
14
0
09 Jun 2022
EAMM: One-Shot Emotional Talking Face via Audio-Based Emotion-Aware
  Motion Model
EAMM: One-Shot Emotional Talking Face via Audio-Based Emotion-Aware Motion Model
Xinya Ji
Hang Zhou
Kaisiyuan Wang
Qianyi Wu
Wayne Wu
Feng Xu
Xun Cao
CVBM
54
157
0
30 May 2022
Deep Learning for Visual Speech Analysis: A Survey
Deep Learning for Visual Speech Analysis: A Survey
Changchong Sheng
Gangyao Kuang
L. Bai
Chen Hou
Y. Guo
Xin Xu
M. Pietikäinen
Li Liu
VLM
26
33
0
22 May 2022
Emotion-Controllable Generalized Talking Face Generation
Emotion-Controllable Generalized Talking Face Generation
Sanjana Sinha
S. Biswas
Ravindra Yadav
Brojeshwar Bhowmick
CVBM
13
49
0
02 May 2022
Previous
123
Next