Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2107.09293
Cited By
Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion
20 July 2021
Suzhe Wang
Lincheng Li
Yu-qiong Ding
Changjie Fan
Xin Yu
VGen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion"
50 / 106 papers shown
Title
Disentangle Identity, Cooperate Emotion: Correlation-Aware Emotional Talking Portrait Generation
Weipeng Tan
Chuming Lin
Chengming Xu
F. Xu
Xiaobin Hu
Xiaozhong Ji
Junwei Zhu
Chengjie Wang
Yanwei Fu
49
0
0
25 Apr 2025
Exploiting Temporal Audio-Visual Correlation Embedding for Audio-Driven One-Shot Talking Head Animation
Zhihua Xu
Tianshui Chen
Zhijing Yang
Siyuan Peng
Keze Wang
Liang Lin
26
0
0
08 Apr 2025
DisentTalk: Cross-lingual Talking Face Generation via Semantic Disentangled Diffusion Model
Kangwei Liu
Junwu Liu
Yun Cao
Jinlin Guo
Xiaowei Yi
DiffM
41
0
0
24 Mar 2025
Dimitra: Audio-driven Diffusion model for Expressive Talking Head Generation
Baptiste Chopin
Tashvik Dhamija
P. Balaji
Yaohui Wang
A. Dantcheva
DiffM
VGen
49
0
0
24 Feb 2025
Emotion Recognition and Generation: A Comprehensive Review of Face, Speech, and Text Modalities
Rebecca Mobbs
Dimitrios Makris
Vasileios Argyriou
43
0
0
02 Feb 2025
Joint Learning of Depth and Appearance for Portrait Image Animation
Xinya Ji
Gaspard Zoss
Prashanth Chandran
Lingchen Yang
Xun Cao
B. Solenthaler
D. Bradley
3DH
MDE
48
0
0
15 Jan 2025
Takin-ADA: Emotion Controllable Audio-Driven Animation with Canonical and Landmark Loss Optimization
Bin Lin
Yanzhen Yu
Jianhao Ye
Ruitao Lv
Yuqing Yang
Ruoye Xie
Pan Yu
Hongbin Zhou
VGen
35
1
0
18 Oct 2024
DAWN: Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for Talking Head Video Generation
Hanbo Cheng
Limin Lin
Chenyu Liu
Pengcheng Xia
Pengfei Hu
Jiefeng Ma
Jun Du
Jia Pan
DiffM
VGen
136
0
0
17 Oct 2024
Separation of Neural Drives to Muscles from Transferred Polyfunctional Nerves using Implanted Micro-electrode Arrays
Laura Ferrante
Anna Boesendorfer
D. Barsakcioglu
Benedikt Baumgartner
Yazan Al-Ajam
Alex Woollard
Norbert Venantius Kang
Oskar Aszmann
D. Farina
38
0
0
14 Oct 2024
Hallo2: Long-Duration and High-Resolution Audio-Driven Portrait Image Animation
Jiahao Cui
Hui Li
Yao Yao
Hao Zhu
Hanlin Shang
Kaihui Cheng
Hang Zhou
Siyu Zhu
Jingdong Wang
DiffM
VGen
43
22
0
10 Oct 2024
Learning Frame-Wise Emotion Intensity for Audio-Driven Talking-Head Generation
Jingyi Xu
Hieu Le
Zhixin Shu
Yang Wang
Yi-Hsuan Tsai
Dimitris Samaras
34
0
0
29 Sep 2024
DreamHead: Learning Spatial-Temporal Correspondence via Hierarchical Diffusion for Audio-driven Talking Head Synthesis
Fa-Ting Hong
Yunfei Liu
Yu Li
Changyin Zhou
Fei Yu
D. Xu
DiffM
35
0
0
16 Sep 2024
LawDNet: Enhanced Audio-Driven Lip Synthesis via Local Affine Warping Deformation
Deng Junli
Luo Yihao
Yang Xueting
Li Siyou
Wang Wei
Guo Jinyang
Shi Ping
26
0
0
14 Sep 2024
StyleTalk++: A Unified Framework for Controlling the Speaking Styles of Talking Heads
Suzhen Wang
Yifeng Ma
Yu Ding
Zhipeng Hu
Changjie Fan
Tangjie Lv
Zhidong Deng
Xin Yu
46
9
0
14 Sep 2024
SVP: Style-Enhanced Vivid Portrait Talking Head Diffusion Model
Weipeng Tan
Chuming Lin
Chengming Xu
Xiaozhong Ji
Junwei Zhu
Chengjie Wang
Yanwei Fu
DiffM
41
0
0
05 Sep 2024
FD2Talk: Towards Generalized Talking Head Generation with Facial Decoupled Diffusion Model
Ziyu Yao
Xuxin Cheng
Zhiqi Huang
DiffM
19
3
0
18 Aug 2024
ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer
Jiazhi Guan
Zhiliang Xu
Hang Zhou
Kaisiyuan Wang
Shengyi He
...
Errui Ding
Jingtuo Liu
Jingdong Wang
Youjian Zhao
Ziwei Liu
VGen
51
2
0
06 Aug 2024
EmoFace: Audio-driven Emotional 3D Face Animation
Chang Liu
Qunfen Lin
Zijiao Zeng
Ye Pan
CVBM
44
4
0
17 Jul 2024
RITA: A Real-time Interactive Talking Avatars Framework
Wuxinlin Cheng
Cheng Wan
Yupeng Cao
Sihan Chen
37
0
0
18 Jun 2024
A Comprehensive Taxonomy and Analysis of Talking Head Synthesis: Techniques for Portrait Generation, Driving Mechanisms, and Editing
Ming Meng
Yufei Zhao
Bo Zhang
Yonggui Zhu
Weimin Shi
Maxwell Wen
Zhaoxin Fan
VGen
42
1
0
15 Jun 2024
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Mingwang Xu
Hui Li
Qingkun Su
Hanlin Shang
Liwei Zhang
Ce Liu
Jingdong Wang
Yao Yao
Siyu Zhu
VGen
34
68
0
13 Jun 2024
Emotional Conversation: Empowering Talking Faces with Cohesive Expression, Gaze and Pose Generation
Jiadong Liang
Feng Lu
CVBM
34
0
0
12 Jun 2024
Controllable Talking Face Generation by Implicit Facial Keypoints Editing
Dong Zhao
Jiaying Shi
Wenjun Li
Shudong Wang
Shenghui Xu
Zhaoming Pan
CVBM
46
0
0
05 Jun 2024
InstructAvatar: Text-Guided Emotion and Motion Control for Avatar Generation
Yuchi Wang
Junliang Guo
Jianhong Bai
Runyi Yu
Tianyu He
Xu Tan
Xu Sun
Jiang Bian
DiffM
35
9
0
24 May 2024
OpFlowTalker: Realistic and Natural Talking Face Generation via Optical Flow Guidance
Shuheng Ge
Haoyu Xing
Li Zhang
Xiangqian Wu
39
0
0
23 May 2024
Generative Artificial Intelligence: A Systematic Review and Applications
S. S. Sengar
Affan Bin Hasan
Sanjay Kumar
Fiona Carroll
MedIm
30
51
0
17 May 2024
Faces that Speak: Jointly Synthesising Talking Face and Speech from Text
Youngjoon Jang
Ji-Hoon Kim
Junseok Ahn
Doyeop Kwak
Hong-Sun Yang
Yooncheol Ju
Il-Hwan Kim
Byeong-Yeol Kim
Joon Son Chung
CVBM
31
9
0
16 May 2024
NeRFFaceSpeech: One-shot Audio-driven 3D Talking Head Synthesis via Generative Prior
Gihoon Kim
Kwanggyoon Seo
Sihun Cha
Junyong Noh
DiffM
3DH
39
1
0
09 May 2024
AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding
Tao Liu
Feilong Chen
Shuai Fan
Chenpeng Du
Qi Chen
Xie Chen
Kai Yu
DiffM
PINN
36
25
0
06 May 2024
Explicit Correlation Learning for Generalizable Cross-Modal Deepfake Detection
Cai Yu
Shan Jia
Xiaomeng Fu
Jin Liu
Jiahe Tian
Jiao Dai
Xi Wang
Siwei Lyu
Jizhong Han
39
5
0
30 Apr 2024
Embedded Representation Learning Network for Animating Styled Video Portrait
Tianyong Wang
Xiangyu Liang
Wangguandong Zheng
Dan Niu
Haifeng Xia
Siyu Xia
3DH
29
0
0
29 Apr 2024
VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time
Sicheng Xu
Guojun Chen
Yu-Xiao Guo
Jiaolong Yang
Chong Li
Zhenyu Zang
Yizhong Zhang
Xin Tong
Baining Guo
45
87
0
16 Apr 2024
FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appearance, Head-pose, and Facial Expression Features
Andre Rochow
Max Schwarz
Sven Behnke
ViT
48
6
0
15 Apr 2024
THQA: A Perceptual Quality Assessment Database for Talking Heads
Yingjie Zhou
Zicheng Zhang
Wei Sun
Xiaohong Liu
Xiongkuo Min
Zhihua Wang
Xiao-Ping Zhang
Guangtao Zhai
EGVM
32
10
0
13 Apr 2024
EDTalk: Efficient Disentanglement for Emotional Talking Head Synthesis
Shuai Tan
Bin Ji
Mengxiao Bi
Ye Pan
38
26
0
02 Apr 2024
MoDiTalker: Motion-Disentangled Diffusion Model for High-Fidelity Talking Head Generation
Seyeon Kim
Siyoon Jin
Jihye Park
Kihong Kim
Jiyoung Kim
Jisu Nam
Seungryong Kim
DiffM
VGen
60
3
0
28 Mar 2024
VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis
Enric Corona
Andrei Zanfir
Eduard Gabriel Bazavan
Nikos Kolotouros
Thiemo Alldieck
C. Sminchisescu
VGen
DiffM
43
26
0
13 Mar 2024
FlowVQTalker: High-Quality Emotional Talking Face Generation through Normalizing Flow and Quantization
Shuai Tan
Bin Ji
Ye Pan
42
15
0
11 Mar 2024
Style2Talker: High-Resolution Talking Head Generation with Emotion Style and Art Style
Shuai Tan
Bin Ji
Ye Pan
34
19
0
11 Mar 2024
Say Anything with Any Style
Shuai Tan
Bin Ji
Yu Ding
Ye Pan
VGen
DiffM
29
10
0
11 Mar 2024
EmoSpeaker: One-shot Fine-grained Emotion-Controlled Talking Face Generation
Guanwen Feng
Haoran Cheng
Yunan Li
Zhiyuan Ma
Chaoneng Li
Zhihao Qian
Qiguang Miao
Chi-Man Pun
CVBM
31
2
0
02 Feb 2024
VectorTalker: SVG Talking Face Generation with Progressive Vectorisation
Hao Hu
Xuan Wang
Jingxiang Sun
Yanbo Fan
Yu-Xiao Guo
Caigui Jiang
17
1
0
18 Dec 2023
DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Yifeng Ma
Shiwei Zhang
Jiayu Wang
Xiang Wang
Yingya Zhang
Zhidong Deng
DiffM
44
2
0
15 Dec 2023
GSmoothFace: Generalized Smooth Talking Face Generation via Fine Grained 3D Face Guidance
Haiming Zhang
Zhihao Yuan
Chaoda Zheng
Xu Yan
Baoyuan Wang
Guanbin Li
Song Wu
Shuguang Cui
Zhen Li
CVBM
47
1
0
12 Dec 2023
GMTalker: Gaussian Mixture-based Audio-Driven Emotional Talking Video Portraits
Yibo Xia
Lizhen Wang
Xiang Deng
Xiaoyan Luo
Yunhong Wang
Yebin Liu
VGen
42
1
0
12 Dec 2023
R2-Talker: Realistic Real-Time Talking Head Synthesis with Hash Grid Landmarks Encoding and Progressive Multilayer Conditioning
Zhiling Ye
LiangGuo Zhang
Dingheng Zeng
Quan Lu
Ning Jiang
29
0
0
09 Dec 2023
SingingHead: A Large-scale 4D Dataset for Singing Head Animation
Sijing Wu
Yunhao Li
Weitian Zhang
Jun Jia
Yucheng Zhu
Yichao Yan
Guangtao Zhai
Xiaokang Yang
41
2
0
07 Dec 2023
3DiFACE: Diffusion-based Speech-driven 3D Facial Animation and Editing
Balamurugan Thambiraja
S. Aliakbarian
Darren Cosker
Justus Thies
DiffM
VGen
45
11
0
01 Dec 2023
SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis
Ziqiao Peng
Wentao Hu
Yue Shi
Xiangyu Zhu
Xiaomei Zhang
Hao Zhao
Jun He
Hongyan Liu
Zhaoxin Fan
38
39
0
29 Nov 2023
GAIA: Zero-shot Talking Avatar Generation
Tianyu He
Junliang Guo
Runyi Yu
Yuchi Wang
Jialiang Zhu
...
Chunyu Wang
Han Hu
HsiangTao Wu
Sheng Zhao
Jiang Bian
31
25
0
26 Nov 2023
1
2
3
Next