Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2110.08580
Cited By
Intelligent Video Editing: Incorporating Modern Talking Face Generation Algorithms in a Video Editor
16 October 2021
Anchit Gupta
Faizan Farooq Khan
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
CVBM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Intelligent Video Editing: Incorporating Modern Talking Face Generation Algorithms in a Video Editor"
28 / 28 papers shown
Title
LayoutParser: A Unified Toolkit for Deep Learning Based Document Image Analysis
Zejiang Shen
Ruochen Zhang
Melissa Dell
Benjamin Charles Germain Lee
Jacob Carlson
Weining Li
3DV
51
98
0
29 Mar 2021
Iterative Text-based Editing of Talking-heads Using Neural Retargeting
Xinwei Yao
Ohad Fried
Kayvon Fatahalian
Maneesh Agrawala
VGen
44
34
0
21 Nov 2020
A Lip Sync Expert Is All You Need for Speech to Lip Generation In The Wild
Prajwal K R
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
EGVM
96
777
0
23 Aug 2020
FastSpeech 2: Fast and High-Quality End-to-End Text to Speech
Yi Ren
Chenxu Hu
Xu Tan
Tao Qin
Sheng Zhao
Zhou Zhao
Tie-Yan Liu
105
1,396
0
08 Jun 2020
Glow-TTS: A Generative Flow for Text-to-Speech via Monotonic Alignment Search
Jaehyeon Kim
Sungwon Kim
Jungil Kong
Sungroh Yoon
81
491
0
22 May 2020
DeepFaceLab: Integrated, flexible and extensible face-swapping framework
Ivan Perov
Daiheng Gao
Nikolay Chervoniy
Kunlin Liu
Sugasa Marangonda
...
Jian Jiang
Sheng Zhang
Pingyu Wu
Wenbo Zhou
Weiming Zhang
CVBM
46
223
0
12 May 2020
MakeItTalk: Speaker-Aware Talking-Head Animation
Yang Zhou
Xintong Han
Eli Shechtman
J. Echevarria
E. Kalogerakis
Dingzeyu Li
63
421
0
27 Apr 2020
Towards Automatic Face-to-Face Translation
Prajwal K R
Rudrabha Mukhopadhyay
Jerin Philip
Abhishek Jha
Vinay P. Namboodiri
C. V. Jawahar
CVBM
89
174
0
01 Mar 2020
First Order Motion Model for Image Animation
Aliaksandr Siarohin
Stéphane Lathuilière
Sergey Tulyakov
Elisa Ricci
N. Sebe
VGen
DiffM
77
925
0
29 Feb 2020
PyTorch: An Imperative Style, High-Performance Deep Learning Library
Adam Paszke
Sam Gross
Francisco Massa
Adam Lerer
James Bradbury
...
Sasank Chilamkurthy
Benoit Steiner
Lu Fang
Junjie Bai
Soumith Chintala
ODL
406
42,393
0
03 Dec 2019
Neural Style-Preserving Visual Dubbing
Hyeongwoo Kim
Mohamed A. Elgharib
Michael Zollhöfer
Hans-Peter Seidel
Thabo Beeler
Christian Richardt
Christian Theobalt
VGen
47
94
0
05 Sep 2019
A Baseline Neural Machine Translation System for Indian Languages
Jerin Philip
Vinay P. Namboodiri
C. V. Jawahar
86
17
0
29 Jul 2019
What Is Wrong With Scene Text Recognition Model Comparisons? Dataset and Model Analysis
Jeonghun Baek
Geewook Kim
Junyeop Lee
Sungrae Park
Dongyoon Han
Sangdoo Yun
Seong Joon Oh
Hwalsuk Lee
428
478
0
03 Apr 2019
CVIT-MT Systems for WAT-2018
Jerin Philip
Vinay P. Namboodiri
C. V. Jawahar
22
10
0
19 Mar 2019
Animating Arbitrary Objects via Deep Motion Transfer
Aliaksandr Siarohin
Stéphane Lathuilière
Sergey Tulyakov
Elisa Ricci
N. Sebe
63
347
0
20 Dec 2018
Recycle-GAN: Unsupervised Video Retargeting
Aayush Bansal
Shugao Ma
Deva Ramanan
Yaser Sheikh
VGen
DiffM
73
297
0
15 Aug 2018
Talking Face Generation by Adversarially Disentangled Audio-Visual Representation
Hang Zhou
Yu Liu
Ziwei Liu
Ping Luo
Xiaogang Wang
CVBM
87
441
0
20 Jul 2018
Synthesizing Images of Humans in Unseen Poses
Guha Balakrishnan
Amy Zhao
Adrian Dalca
F. Durand
John Guttag
GAN
3DH
48
314
0
20 Apr 2018
Talking Face Generation by Conditional Recurrent Adversarial Network
Yang Song
Jingwen Zhu
Dawei Li
Xiaolong Wang
Hairong Qi
CVBM
120
194
0
13 Apr 2018
Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions
Jonathan Shen
Ruoming Pang
Ron J. Weiss
M. Schuster
Navdeep Jaitly
...
Yuxuan Wang
RJ Skerry-Ryan
Rif A. Saurous
Yannis Agiomyrgiannakis
Yonghui Wu
77
2,697
0
16 Dec 2017
Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning
Ming-Yu Liu
Kainan Peng
Andrew Gibiansky
Sercan O. Arik
Ajay Kannan
Sharan Narang
Jonathan Raiman
John Miller
63
307
0
20 Oct 2017
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
658
131,414
0
12 Jun 2017
Attention-based Extraction of Structured Information from Street View Imagery
Z. Wojna
Alexander N. Gorban
Dar-Shyang Lee
Kevin Patrick Murphy
Qian Yu
Yeqing Li
Julian Ibarz
42
153
0
11 Apr 2017
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Zhiwen Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
889
6,787
0
26 Sep 2016
Wav2Letter: an End-to-End ConvNet-based Speech Recognition System
R. Collobert
Christian Puhrsch
Gabriel Synnaeve
3DV
56
283
0
11 Sep 2016
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
Dario Amodei
Rishita Anubhai
Eric Battenberg
Carl Case
Jared Casper
...
Chong-Jun Wang
Bo Xiao
Dani Yogatama
J. Zhan
Zhenyao Zhu
116
2,972
0
08 Dec 2015
Sequence to Sequence Learning with Neural Networks
Ilya Sutskever
Oriol Vinyals
Quoc V. Le
AIMat
402
20,528
0
10 Sep 2014
Neural Machine Translation by Jointly Learning to Align and Translate
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
AIMat
521
27,295
0
01 Sep 2014
1