ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2008.10010
  4. Cited By
A Lip Sync Expert Is All You Need for Speech to Lip Generation In The
  Wild

A Lip Sync Expert Is All You Need for Speech to Lip Generation In The Wild

23 August 2020
Prajwal K R
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
    EGVM
ArXivPDFHTML

Papers citing "A Lip Sync Expert Is All You Need for Speech to Lip Generation In The Wild"

50 / 410 papers shown
Title
THQA: A Perceptual Quality Assessment Database for Talking Heads
THQA: A Perceptual Quality Assessment Database for Talking Heads
Yingjie Zhou
Zicheng Zhang
Wei Sun
Xiaohong Liu
Xiongkuo Min
Zhihua Wang
Xiao-Ping Zhang
Guangtao Zhai
EGVM
32
10
0
13 Apr 2024
Translation-based Video-to-Video Synthesis
Translation-based Video-to-Video Synthesis
Pratim Saha
Chengcui Zhang
DiffM
26
1
0
03 Apr 2024
Co-Speech Gesture Video Generation via Motion-Decoupled Diffusion Model
Co-Speech Gesture Video Generation via Motion-Decoupled Diffusion Model
Xu He
Qiaochu Huang
Zhensong Zhang
Zhiwei Lin
Zhiyong Wu
Sicheng Yang
Minglei Li
Zhiyi Chen
Songcen Xu
Xiaofei Wu
35
15
0
02 Apr 2024
EDTalk: Efficient Disentanglement for Emotional Talking Head Synthesis
EDTalk: Efficient Disentanglement for Emotional Talking Head Synthesis
Shuai Tan
Bin Ji
Mengxiao Bi
Ye Pan
38
26
0
02 Apr 2024
Learning to Generate Conditional Tri-plane for 3D-aware Expression
  Controllable Portrait Animation
Learning to Generate Conditional Tri-plane for 3D-aware Expression Controllable Portrait Animation
Taekyung Ki
Dongchan Min
Gyeongsu Chae
3DH
38
5
0
31 Mar 2024
Talk3D: High-Fidelity Talking Portrait Synthesis via Personalized 3D
  Generative Prior
Talk3D: High-Fidelity Talking Portrait Synthesis via Personalized 3D Generative Prior
Jaehoon Ko
Kyusun Cho
Joungbin Lee
Heeji Yoon
Sangmin Lee
Sangjun Ahn
Seungryong Kim
34
2
0
29 Mar 2024
MI-NeRF: Learning a Single Face NeRF from Multiple Identities
MI-NeRF: Learning a Single Face NeRF from Multiple Identities
Aggelina Chatziagapi
Grigorios G. Chrysos
Dimitris Samaras
CVBM
45
2
0
29 Mar 2024
MoDiTalker: Motion-Disentangled Diffusion Model for High-Fidelity
  Talking Head Generation
MoDiTalker: Motion-Disentangled Diffusion Model for High-Fidelity Talking Head Generation
Seyeon Kim
Siyoon Jin
Jihye Park
Kihong Kim
Jiyoung Kim
Jisu Nam
Seungryong Kim
DiffM
VGen
66
3
0
28 Mar 2024
Superior and Pragmatic Talking Face Generation with Teacher-Student
  Framework
Superior and Pragmatic Talking Face Generation with Teacher-Student Framework
Chao Liang
Jianwen Jiang
Tianyun Zhong
Gaojie Lin
Zhengkun Rong
Jiaqi Yang
Yongming Zhu
45
1
0
26 Mar 2024
Deepfake Generation and Detection: A Benchmark and Survey
Deepfake Generation and Detection: A Benchmark and Survey
Gan Pei
Jiangning Zhang
Menghan Hu
Zhenyu Zhang
Chengjie Wang
Yunsheng Wu
Guangtao Zhai
Jian Yang
Chunhua Shen
Dacheng Tao
52
25
0
26 Mar 2024
Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework
Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework
Ziyao Huang
Fan Tang
Yong Zhang
Xiaodong Cun
Juan Cao
Jintao Li
Tong-Yee Lee
DiffM
VGen
45
16
0
25 Mar 2024
DiffusionAct: Controllable Diffusion Autoencoder for One-shot Face Reenactment
DiffusionAct: Controllable Diffusion Autoencoder for One-shot Face Reenactment
Stella Bounareli
Christos Tzelepis
Vasileios Argyriou
Ioannis Patras
Georgios Tzimiropoulos
DiffM
45
7
0
25 Mar 2024
Dyadic Interaction Modeling for Social Behavior Generation
Dyadic Interaction Modeling for Social Behavior Generation
Minh Tran
Di Chang
Maksim Siniukov
Mohammad Soleymani
VGen
40
7
0
14 Mar 2024
VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis
VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis
Enric Corona
Andrei Zanfir
Eduard Gabriel Bazavan
Nikos Kolotouros
Thiemo Alldieck
C. Sminchisescu
VGen
DiffM
43
26
0
13 Mar 2024
A Comparative Study of Perceptual Quality Metrics for Audio-driven
  Talking Head Videos
A Comparative Study of Perceptual Quality Metrics for Audio-driven Talking Head Videos
Weixia Zhang
Chengguang Zhu
Jingnan Gao
Yichao Yan
Guangtao Zhai
Xiaokang Yang
EGVM
54
2
0
11 Mar 2024
FlowVQTalker: High-Quality Emotional Talking Face Generation through
  Normalizing Flow and Quantization
FlowVQTalker: High-Quality Emotional Talking Face Generation through Normalizing Flow and Quantization
Shuai Tan
Bin Ji
Ye Pan
42
15
0
11 Mar 2024
Style2Talker: High-Resolution Talking Head Generation with Emotion Style
  and Art Style
Style2Talker: High-Resolution Talking Head Generation with Emotion Style and Art Style
Shuai Tan
Bin Ji
Ye Pan
43
19
0
11 Mar 2024
Say Anything with Any Style
Say Anything with Any Style
Shuai Tan
Bin Ji
Yu Ding
Ye Pan
VGen
DiffM
29
10
0
11 Mar 2024
FaceChain-ImagineID: Freely Crafting High-Fidelity Diverse Talking Faces
  from Disentangled Audio
FaceChain-ImagineID: Freely Crafting High-Fidelity Diverse Talking Faces from Disentangled Audio
Chao Xu
Yang Liu
Jiazheng Xing
Weida Wang
Mingze Sun
...
Tianxin Huang
Siyuan Li
Zhi-Qi Cheng
Ying Tai
Baigui Sun
CVBM
54
11
0
04 Mar 2024
G4G:A Generic Framework for High Fidelity Talking Face Generation with
  Fine-grained Intra-modal Alignment
G4G:A Generic Framework for High Fidelity Talking Face Generation with Fine-grained Intra-modal Alignment
Juan Zhang
Jiahao Chen
Cheng Wang
Zhi-Yang Yu
Tangquan Qi
Di Wu
CVBM
41
0
0
28 Feb 2024
Context-aware Talking Face Video Generation
Context-aware Talking Face Video Generation
Meidai Xuanyuan
Yuwang Wang
Honglei Guo
Qionghai Dai
DiffM
41
0
0
28 Feb 2024
EMO: Emote Portrait Alive -- Generating Expressive Portrait Videos with
  Audio2Video Diffusion Model under Weak Conditions
EMO: Emote Portrait Alive -- Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
Linrui Tian
Qi Wang
Bang Zhang
Liefeng Bo
DiffM
69
102
0
27 Feb 2024
Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis
Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis
Zicheng Zhang
Ruobing Zheng
Ziwen Liu
Congying Han
Tianqi Li
Meng Wang
Tiande Guo
Jingdong Chen
Bonan li
Ming Yang
3DH
32
5
0
27 Feb 2024
AVI-Talking: Learning Audio-Visual Instructions for Expressive 3D
  Talking Face Generation
AVI-Talking: Learning Audio-Visual Instructions for Expressive 3D Talking Face Generation
Yasheng Sun
Wenqing Chu
Hang Zhou
Kaisiyuan Wang
Hideki Koike
37
5
0
25 Feb 2024
EmoSpeaker: One-shot Fine-grained Emotion-Controlled Talking Face
  Generation
EmoSpeaker: One-shot Fine-grained Emotion-Controlled Talking Face Generation
Guanwen Feng
Haoran Cheng
Yunan Li
Zhiyuan Ma
Chaoneng Li
Zhihao Qian
Qiguang Miao
Chi-Man Pun
CVBM
31
2
0
02 Feb 2024
Lips Are Lying: Spotting the Temporal Inconsistency between Audio and
  Visual in Lip-Syncing DeepFakes
Lips Are Lying: Spotting the Temporal Inconsistency between Audio and Visual in Lip-Syncing DeepFakes
Weifeng Liu
Tianyi She
Jiawei Liu
Run Wang
Dongyu Yao
Ziyou Liang
40
5
0
28 Jan 2024
NeRF-AD: Neural Radiance Field with Attention-based Disentanglement for
  Talking Face Synthesis
NeRF-AD: Neural Radiance Field with Attention-based Disentanglement for Talking Face Synthesis
Chongke Bi
Xiaoxing Liu
Zhilei Liu
DiffM
CVBM
29
4
0
23 Jan 2024
Text-to-Image Cross-Modal Generation: A Systematic Review
Text-to-Image Cross-Modal Generation: A Systematic Review
Maciej Żelaszczyk
Jacek Mańdziuk
35
3
0
21 Jan 2024
Exposing Lip-syncing Deepfakes from Mouth Inconsistencies
Exposing Lip-syncing Deepfakes from Mouth Inconsistencies
Soumyya Kanti Datta
Shan Jia
Siwei Lyu
46
6
0
18 Jan 2024
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis
Zhenhui Ye
Tianyun Zhong
Yi Ren
Jiaqi Yang
Weichuang Li
...
Jinglin Liu
Chen Zhang
Xiang Yin
Zejun Ma
Zhou Zhao
32
45
0
16 Jan 2024
EmoTalker: Emotionally Editable Talking Face Generation via Diffusion
  Model
EmoTalker: Emotionally Editable Talking Face Generation via Diffusion Model
Bingyuan Zhang
Xulong Zhang
Ning Cheng
Jun Yu
Jing Xiao
Jianzong Wang
DiffM
31
5
0
16 Jan 2024
Dubbing for Everyone: Data-Efficient Visual Dubbing using Neural
  Rendering Priors
Dubbing for Everyone: Data-Efficient Visual Dubbing using Neural Rendering Priors
Jack D. Saunders
Vinay P. Namboodiri
VGen
DiffM
31
1
0
11 Jan 2024
Towards a Simultaneous and Granular Identity-Expression Control in
  Personalized Face Generation
Towards a Simultaneous and Granular Identity-Expression Control in Personalized Face Generation
Renshuai Liu
Bowen Ma
Wei Zhang
Zhipeng Hu
Changjie Fan
Tangjie Lv
Yu-qiong Ding
Xuan Cheng
DiffM
27
20
0
02 Jan 2024
TransFace: Unit-Based Audio-Visual Speech Synthesizer for Talking Head
  Translation
TransFace: Unit-Based Audio-Visual Speech Synthesizer for Talking Head Translation
Xize Cheng
Rongjie Huang
Linjun Li
Tao Jin
Zehan Wang
Aoxiong Yin
Minglei Li
Xinyu Duan
Changpeng Yang
Zhou Zhao
33
2
0
23 Dec 2023
AE-NeRF: Audio Enhanced Neural Radiance Field for Few Shot Talking Head
  Synthesis
AE-NeRF: Audio Enhanced Neural Radiance Field for Few Shot Talking Head Synthesis
Dongze Li
Kang Zhao
Wei Wang
Bo Peng
Yingya Zhang
Jing Dong
Tien-Ping Tan
DiffM
VGen
29
12
0
18 Dec 2023
DreamTalk: When Expressive Talking Head Generation Meets Diffusion
  Probabilistic Models
DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Yifeng Ma
Shiwei Zhang
Jiayu Wang
Xiang Wang
Yingya Zhang
Zhidong Deng
DiffM
47
2
0
15 Dec 2023
FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head
  Models
FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head Models
Shivangi Aneja
Justus Thies
Angela Dai
Matthias Nießner
DiffM
VGen
40
29
0
13 Dec 2023
GSmoothFace: Generalized Smooth Talking Face Generation via Fine Grained
  3D Face Guidance
GSmoothFace: Generalized Smooth Talking Face Generation via Fine Grained 3D Face Guidance
Haiming Zhang
Zhihao Yuan
Chaoda Zheng
Xu Yan
Baoyuan Wang
Guanbin Li
Song Wu
Shuguang Cui
Zhen Li
CVBM
50
1
0
12 Dec 2023
GMTalker: Gaussian Mixture-based Audio-Driven Emotional Talking Video Portraits
GMTalker: Gaussian Mixture-based Audio-Driven Emotional Talking Video Portraits
Yibo Xia
Lizhen Wang
Xiang Deng
Xiaoyan Luo
Yunhong Wang
Yebin Liu
VGen
48
1
0
12 Dec 2023
Neural Text to Articulate Talk: Deep Text to Audiovisual Speech
  Synthesis achieving both Auditory and Photo-realism
Neural Text to Articulate Talk: Deep Text to Audiovisual Speech Synthesis achieving both Auditory and Photo-realism
Georgios Milis
P. Filntisis
A. Roussos
Petros Maragos
CVBM
34
2
0
11 Dec 2023
DiT-Head: High-Resolution Talking Head Synthesis using Diffusion
  Transformers
DiT-Head: High-Resolution Talking Head Synthesis using Diffusion Transformers
Aaron Mir
Eduardo Alonso
Esther Mondragón
DiffM
38
2
0
11 Dec 2023
R2-Talker: Realistic Real-Time Talking Head Synthesis with Hash Grid
  Landmarks Encoding and Progressive Multilayer Conditioning
R2-Talker: Realistic Real-Time Talking Head Synthesis with Hash Grid Landmarks Encoding and Progressive Multilayer Conditioning
Zhiling Ye
LiangGuo Zhang
Dingheng Zeng
Quan Lu
Ning Jiang
32
0
0
09 Dec 2023
FT2TF: First-Person Statement Text-To-Talking Face Generation
FT2TF: First-Person Statement Text-To-Talking Face Generation
Xingjian Diao
Ming Cheng
Wayner Barrios
SouYoung Jin
43
11
0
09 Dec 2023
SingingHead: A Large-scale 4D Dataset for Singing Head Animation
SingingHead: A Large-scale 4D Dataset for Singing Head Animation
Sijing Wu
Yunhao Li
Weitian Zhang
Jun Jia
Yucheng Zhu
Yichao Yan
Guangtao Zhai
Xiaokang Yang
46
2
0
07 Dec 2023
PMMTalk: Speech-Driven 3D Facial Animation from Complementary Pseudo
  Multi-modal Features
PMMTalk: Speech-Driven 3D Facial Animation from Complementary Pseudo Multi-modal Features
Tianshun Han
Shengnan Gui
Yiqing Huang
Baihui Li
Lijian Liu
...
Quan Lu
Ruicong Zhi
Yanyan Liang
Du Zhang
Jun Wan
VGen
46
1
0
05 Dec 2023
AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation
  with Unified Audio-Visual Speech Representation
AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation
J. Choi
Se Jin Park
Minsu Kim
Y. Ro
33
12
0
05 Dec 2023
VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D
  Hybrid Prior
VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior
Xusen Sun
Longhao Zhang
Hao Zhu
Peng Zhang
Bang Zhang
Xinya Ji
Kangneng Zhou
Daiheng Gao
Liefeng Bo
Xun Cao
VGen
33
24
0
04 Dec 2023
Vulnerability of Automatic Identity Recognition to Audio-Visual
  Deepfakes
Vulnerability of Automatic Identity Recognition to Audio-Visual Deepfakes
Pavel Korshunov
Haolin Chen
Philip N. Garner
S´ebastien Marcel
CVBM
48
4
0
29 Nov 2023
SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis
SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis
Ziqiao Peng
Wentao Hu
Yue Shi
Xiangyu Zhu
Xiaomei Zhang
Hao Zhao
Jun He
Hongyan Liu
Zhaoxin Fan
41
39
0
29 Nov 2023
AgentAvatar: Disentangling Planning, Driving and Rendering for
  Photorealistic Avatar Agents
AgentAvatar: Disentangling Planning, Driving and Rendering for Photorealistic Avatar Agents
Duomin Wang
Bin Dai
Yu Deng
Baoyuan Wang
VGen
45
5
0
29 Nov 2023
Previous
123456789
Next