Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2003.00418
Cited By
Towards Automatic Face-to-Face Translation
1 March 2020
Prajwal K R
Rudrabha Mukhopadhyay
Jerin Philip
Abhishek Jha
Vinay P. Namboodiri
C. V. Jawahar
CVBM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Towards Automatic Face-to-Face Translation"
41 / 91 papers shown
Title
All's well that FID's well? Result quality and metric scores in GAN models for lip-sychronization tasks
Carina Geldhauser
Johan Liljegren
Pontus Nordqvist
16
0
0
28 Dec 2022
VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
K. Cheng
Xiaodong Cun
Yong Zhang
Menghan Xia
Fei Yin
Mingrui Zhu
Xuanxia Wang
Jue Wang
Nan Wang
CVBM
25
92
0
27 Nov 2022
SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Wenxuan Zhang
Xiaodong Cun
Xuan Wang
Yong Zhang
Xiaodong Shen
Yu-Xiao Guo
Ying Shan
Fei Wang
VGen
35
233
0
22 Nov 2022
MARLIN: Masked Autoencoder for facial video Representation LearnINg
Zhixi Cai
Shreya Ghosh
Kalin Stefanov
Abhinav Dhall
Jianfei Cai
Hamid Rezatofighi
Reza Haffari
Munawar Hayat
ViT
CVBM
27
60
0
12 Nov 2022
A Survey of Deep Face Restoration: Denoise, Super-Resolution, Deblur, Artifact Removal
Tao Wang
Kaihao Zhang
Xuanxi Chen
Wenhan Luo
Jiankang Deng
Tong Lu
Xiaochun Cao
Wei Liu
Hongdong Li
S. Zafeiriou
SupR
37
36
0
05 Nov 2022
SyncTalkFace: Talking Face Generation with Precise Lip-Syncing via Audio-Lip Memory
Se Jin Park
Minsu Kim
Joanna Hong
J. Choi
Y. Ro
CVBM
30
85
0
02 Nov 2022
Technology Pipeline for Large Scale Cross-Lingual Dubbing of Lecture Videos into Multiple Indian Languages
Anusha Prakash
Arun Kumar
Ashish Seth
Bhagyashree Mukherjee
Ishika Gupta
...
D. Sharma
H. Murthy
P. Bhattacharya
S. Umesh
R. Sangal
40
4
0
01 Nov 2022
Compressing Video Calls using Synthetic Talking Heads
Madhav Agarwal
Anchit Gupta
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
17
10
0
07 Oct 2022
Audio-Visual Face Reenactment
Madhav Agarwal
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
DiffM
VGen
27
22
0
06 Oct 2022
Lip-to-Speech Synthesis for Arbitrary Speakers in the Wild
Sindhu B. Hegde
Prajwal K R
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
45
10
0
01 Sep 2022
StyleTalker: One-shot Style-based Audio-driven Talking Head Video Generation
Dong Min
Min-Hwan Song
Eunji Ko
Sung Ju Hwang
VGen
38
12
0
23 Aug 2022
Towards MOOCs for Lipreading: Using Synthetic Talking Heads to Train Humans in Lipreading at Scale
Aditya Agarwal
Bipasha Sen
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
25
0
0
21 Aug 2022
FaceOff: A Video-to-Video Face Swapping System
Aditya Agarwal
Bipasha Sen
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
PICV
CVBM
45
2
0
21 Aug 2022
Extreme-scale Talking-Face Video Upsampling with Audio-Visual Priors
Sindhu B. Hegde
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
CVBM
16
1
0
17 Aug 2022
Face-Dubbing++: Lip-Synchronous, Voice Preserving Translation of Videos
Alexander Waibel
M. Behr
Fevziye Irem Eyiokur
Dogucan Yaman
Tuan-Nam Nguyen
Carlos Mullov
Mehmet Arif Demirtas
Alperen Kantarci
Stefan Constantin
H. K. Ekenel
CVBM
15
14
0
09 Jun 2022
Deep Learning for Visual Speech Analysis: A Survey
Changchong Sheng
Gangyao Kuang
L. Bai
Chen Hou
Y. Guo
Xin Xu
M. Pietikäinen
Li Liu
VLM
29
33
0
22 May 2022
Do You Really Mean That? Content Driven Audio-Visual Deepfake Dataset and Multimodal Method for Temporal Forgery Localization
Zhixi Cai
Kalin Stefanov
Abhinav Dhall
Munawar Hayat
20
3
0
13 Apr 2022
An Audio-Visual Attention Based Multimodal Network for Fake Talking Face Videos Detection
Gang Wang
Peng Zhang
Lei Xie
Wei Huang
Yufei Zha
Yanni Zhang
CVBM
21
5
0
10 Mar 2022
Attention-Based Lip Audio-Visual Synthesis for Talking Face Generation in the Wild
Gang Wang
Peng Zhang
Lei Xie
Wei Huang
Yufei Zha
CVBM
24
14
0
08 Mar 2022
ASRPU: A Programmable Accelerator for Low-Power Automatic Speech Recognition
D. Pinto
J. Arnau
Antonio González
25
0
0
10 Feb 2022
Towards Realistic Visual Dubbing with Heterogeneous Sources
Tianyi Xie
Liucheng Liao
Cheng Bi
Benlai Tang
Xiang Yin
Jianfei Yang
Mingjie Wang
Jiali Yao
Yang Zhang
Zejun Ma
VGen
30
37
0
17 Jan 2022
KazakhTTS2: Extending the Open-Source Kazakh TTS Corpus With More Data, Speakers, and Topics
Saida Mussakhojayeva
Yerbolat Khassanov
H. A. Varol
27
13
0
15 Jan 2022
DFA-NeRF: Personalized Talking Head Generation via Disentangled Face Attributes Neural Rendering
Shunyu Yao
Ruizhe Zhong
Yichao Yan
Guangtao Zhai
Xiaokang Yang
CVBM
24
90
0
03 Jan 2022
Audio-Visual Synchronisation in the wild
Honglie Chen
Weidi Xie
Triantafyllos Afouras
Arsha Nagrani
Andrea Vedaldi
Andrew Zisserman
26
37
0
08 Dec 2021
More than Words: In-the-Wild Visually-Driven Prosody for Text-to-Speech
Michael Hassid
Michelle Tadmor Ramanovich
Brendan Shillingford
Miaosen Wang
Ye Jia
Tal Remez
DiffM
19
16
0
19 Nov 2021
Impact of Benign Modifications on Discriminative Performance of Deepfake Detectors
Yuhang Lu
Evgeniy Upenik
Touradj Ebrahimi
AAML
36
0
0
14 Nov 2021
Intelligent Video Editing: Incorporating Modern Talking Face Generation Algorithms in a Video Editor
Anchit Gupta
Faizan Farooq Khan
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
CVBM
24
6
0
16 Oct 2021
Parallel and High-Fidelity Text-to-Lip Generation
Jinglin Liu
Zhiying Zhu
Yi Ren
Wencan Huang
Baoxing Huai
N. Yuan
Zhou Zhao
32
10
0
14 Jul 2021
LipSync3D: Data-Efficient Learning of Personalized 3D Talking Faces from Video using Pose and Lighting Normalization
A. Lahiri
Vivek Kwatra
C. Frueh
J. P. Lewis
C. Bregler
3DH
38
99
0
08 Jun 2021
Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation
Hang Zhou
Yasheng Sun
Wayne Wu
Chen Change Loy
Xiaogang Wang
Ziwei Liu
CVBM
28
360
0
22 Apr 2021
Deepfakes Generation and Detection: State-of-the-art, open challenges, countermeasures, and way forward
Momina Masood
M. Nawaz
K. Malik
A. Javed
Aun Irtaza
AAML
126
297
0
25 Feb 2021
Deepfake Video Detection Using Convolutional Vision Transformer
Deressa Wodajo
Solomon Atnafu
ViT
32
169
0
22 Feb 2021
AudioViewer: Learning to Visualize Sounds
Chunjin Song
Yuchi Zhang
Willis Peng
Parmis Mohaghegh
Bastian Wandt
Helge Rhodin
30
1
0
22 Dec 2020
Visual Speech Enhancement Without A Real Visual Stream
Sindhu B. Hegde
Prajwal K R
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
DiffM
12
17
0
20 Dec 2020
Large-scale multilingual audio visual dubbing
Yi Yang
Brendan Shillingford
Yannis Assael
Miaosen Wang
Wendi Liu
...
Eren Sezener
Luis C. Cobo
Misha Denil
Y. Aytar
Nando de Freitas
22
20
0
06 Nov 2020
A Lip Sync Expert Is All You Need for Speech to Lip Generation In The Wild
Prajwal K R
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
EGVM
52
757
0
23 Aug 2020
Revisiting Low Resource Status of Indian Languages in Machine Translation
Jerin Philip
Shashank Siripragada
Vinay P. Namboodiri
C. V. Jawahar
15
26
0
11 Aug 2020
Unsupervised Audiovisual Synthesis via Exemplar Autoencoders
Kangle Deng
Aayush Bansal
Deva Ramanan
SSL
VGen
31
12
0
13 Jan 2020
A Baseline Neural Machine Translation System for Indian Languages
Jerin Philip
Vinay P. Namboodiri
C. V. Jawahar
60
17
0
29 Jul 2019
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Z. Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
716
6,746
0
26 Sep 2016
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
218
7,925
0
17 Aug 2015
Previous
1
2