Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1611.05358
Cited By
v1
v2 (latest)
Lip Reading Sentences in the Wild
16 November 2016
Joon Son Chung
A. Senior
Oriol Vinyals
Andrew Zisserman
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Lip Reading Sentences in the Wild"
44 / 344 papers shown
Title
Zero-shot keyword spotting for visual speech recognition in-the-wild
Themos Stafylakis
Georgios Tzimiropoulos
68
38
0
23 Jul 2018
Talking Face Generation by Adversarially Disentangled Audio-Visual Representation
Hang Zhou
Yu Liu
Ziwei Liu
Ping Luo
Xiaogang Wang
CVBM
96
443
0
20 Jul 2018
Large-Scale Visual Speech Recognition
Brendan Shillingford
Yannis Assael
Matthew W. Hoffman
T. Paine
Cían Hughes
...
Marie Mulville
Ben Coppin
Ben Laurie
A. Senior
Nando de Freitas
95
155
0
13 Jul 2018
Deep Lip Reading: a comparison of models and an online application
Triantafyllos Afouras
Joon Son Chung
Andrew Zisserman
75
119
0
15 Jun 2018
A Framework for Speechreading Acquisition Tools
Benjamin M. Gorman
51
10
0
03 Jun 2018
Can DNNs Learn to Lipread Full Sentences?
George Sterpu
Christian Saam
N. Harte
51
8
0
29 May 2018
On Learning Associations of Faces and Voices
Changil Kim
Hijung Valentina Shin
Tae-Hyun Oh
Alexandre Kaspar
Mohamed A. Elgharib
Wojciech Matusik
CVBM
93
84
0
15 May 2018
Remote Detection of Idling Cars Using Infrared Imaging and Deep Networks
M. Bastan
Kim-Hui Yap
Lap-Pui Chau
59
6
0
28 Apr 2018
Automatic speech recognition for launch control center communication using recurrent neural networks with data augmentation and custom language model
Kyongsik Yun
Joseph Osborne
Madison Lee
Thomas Lu
Edward Chow
57
5
0
24 Apr 2018
The Conversation: Deep Audio-Visual Speech Enhancement
Triantafyllos Afouras
Joon Son Chung
Andrew Zisserman
93
360
0
11 Apr 2018
Audio-Visual Scene Analysis with Self-Supervised Multisensory Features
Andrew Owens
Alexei A. Efros
SSL
154
754
0
10 Apr 2018
Seeing Voices and Hearing Faces: Cross-modal biometric matching
Arsha Nagrani
Samuel Albanie
Andrew Zisserman
CVBM
111
221
0
01 Apr 2018
Lip Movements Generation at a Glance
Lele Chen
Zhiheng Li
R. Maddox
Z. Duan
Chenliang Xu
144
265
0
28 Mar 2018
Audio-Visual Event Localization in Unconstrained Videos
Yapeng Tian
Jing Shi
Bochen Li
Zhiyao Duan
Chenliang Xu
127
442
0
23 Mar 2018
LCANet: End-to-End Lipreading with Cascaded Attention-CTC
Kai Xu
Dawei Li
N. Cassimatis
Xiaolong Wang
68
97
0
13 Mar 2018
Resource aware design of a deep convolutional-recurrent neural network for speech recognition through audio-visual sensor fusion
Matthijs Van Keirsbilck
Bert Moons
Marian Verhelst
HAI
26
3
0
13 Mar 2018
Trustless Machine Learning Contracts; Evaluating and Exchanging Machine Learning Models on the Ethereum Blockchain
A. Krizhevsky
Geoffrey E. Hinton
SyDa
81
109
0
27 Feb 2018
End-to-end Audiovisual Speech Recognition
Stavros Petridis
Themos Stafylakis
Pingchuan Ma
Feipeng Cai
Georgios Tzimiropoulos
Maja Pantic
100
253
0
18 Feb 2018
Visual-Only Recognition of Normal, Whispered and Silent Speech
Stavros Petridis
Jie Shen
Doruk Cetin
Maja Pantic
47
55
0
18 Feb 2018
Deep Learning for Lip Reading using Audio-Visual Information for Urdu Language
M. Faisal
S. Manzoor
35
14
0
15 Feb 2018
Deep learning in radiology: an overview of the concepts and a survey of the state of the art
Maciej A. Mazurowski
Mateusz Buda
Ashirbani Saha
Mustafa R. Bashir
MedIm
AI4CE
62
443
0
10 Feb 2018
Dual Memory Neural Computer for Asynchronous Two-view Sequential Learning
Hung Le
T. Tran
Svetha Venkatesh
80
105
0
02 Feb 2018
Multi-Task Spatiotemporal Neural Networks for Structured Surface Reconstruction
Mingze Xu
Chenyou Fan
John Paden
Geoffrey C. Fox
David J. Crandall
32
14
0
11 Jan 2018
Audio to Body Dynamics
Eli Shlizerman
Lucio Dery
Hayden Schoen
Ira Kemelmacher-Shlizerman
VGen
116
154
0
19 Dec 2017
Visual Speech Enhancement
Aviv Gabbay
Asaph Shamir
Shmuel Peleg
54
16
0
23 Nov 2017
Deep word embeddings for visual speech recognition
Themos Stafylakis
Georgios Tzimiropoulos
63
19
0
30 Oct 2017
Combining Multiple Views for Visual Speech Recognition
Marina Zimmermann
Mostafa Mehdipour-Ghazi
H. K. Ekenel
Jean-Philippe Thiran
29
6
0
19 Oct 2017
Visual speech recognition: aligning terminologies for better understanding
Helen L. Bear
Sarah L. Taylor
79
8
0
03 Oct 2017
End-to-End Audiovisual Fusion with LSTMs
Stavros Petridis
Yujiang Wang
Zuwei Li
Maja Pantic
VLM
58
38
0
12 Sep 2017
Squeeze-and-Excitation Networks
Jie Hu
Li Shen
Samuel Albanie
Gang Sun
Enhua Wu
512
26,778
0
05 Sep 2017
End-to-End Multi-View Lipreading
Stavros Petridis
Yujiang Wang
Zuwei Li
Maja Pantic
58
49
0
01 Sep 2017
Audio-Visual Speech Enhancement Using Multimodal Deep Convolutional Neural Networks
Jen-Cheng Hou
Syu-Siang Wang
Ying-Hui Lai
Yu Tsao
Hsiu-Wen Chang
H. Wang
151
198
0
01 Sep 2017
Seeing Through Noise: Visually Driven Speaker Separation and Enhancement
Aviv Gabbay
Ariel Ephrat
Tavi Halperin
Shmuel Peleg
97
19
0
22 Aug 2017
Improving Speaker-Independent Lipreading with Domain-Adversarial Training
Michael Wand
Jürgen Schmidhuber
70
42
0
04 Aug 2017
Improved Speech Reconstruction from Silent Video
Ariel Ephrat
Tavi Halperin
Shmuel Peleg
117
89
0
01 Aug 2017
Streaming Architecture for Large-Scale Quantized Neural Networks on an FPGA-Based Dataflow Platform
Chaim Baskin
Natan Liss
Evgenii Zheltonozhskii
A. Bronstein
A. Mendelson
GNN
MQ
129
35
0
31 Jul 2017
Vision-based Detection of Acoustic Timed Events: a Case Study on Clarinet Note Onsets
A. Bazzica
Jan van Gemert
Cynthia C. S. Liem
A. Hanjalic
47
12
0
29 Jun 2017
You said that?
Joon Son Chung
A. Jamaludin
Andrew Zisserman
CVBM
77
260
0
08 May 2017
Towards Estimating the Upper Bound of Visual-Speech Recognition: The Visual Lip-Reading Feasibility Database
Adriana Fernandez-Lopez
Oriol Martínez
Federico Sukno
68
39
0
26 Apr 2017
Learning weakly supervised multimodal phoneme embeddings
Rahma Chaabouni
Ewan Dunbar
Neil Zeghidour
Emmanuel Dupoux
SSL
51
10
0
23 Apr 2017
Audio-Visual Speech Enhancement Using Multimodal Deep Convolutional Neural Networks
Jen-Cheng Hou
Syu-Siang Wang
Ying-Hui Lai
Yu Tsao
Hsiu-Wen Chang
H. Wang
113
23
0
30 Mar 2017
Combining Residual Networks with LSTMs for Lipreading
Themos Stafylakis
Georgios Tzimiropoulos
VLM
130
310
0
12 Mar 2017
Vid2speech: Speech Reconstruction from Silent Video
Ariel Ephrat
Shmuel Peleg
110
123
0
02 Jan 2017
Towards better decoding and language model integration in sequence to sequence models
J. Chorowski
Navdeep Jaitly
109
370
0
08 Dec 2016
Previous
1
2
3
4
5
6
7