Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1812.09336
Cited By
An Empirical Analysis of Deep Audio-Visual Models for Speech Recognition
21 December 2018
Devesh Walawalkar
Yihui He
R. Pillai
Re-assign community
ArXiv
PDF
HTML
Papers citing
"An Empirical Analysis of Deep Audio-Visual Models for Speech Recognition"
24 / 24 papers shown
Title
Bounding Box Regression with Uncertainty for Accurate Object Detection
Yihui He
Chenchen Zhu
Jianren Wang
Marios Savvides
Xinming Zhang
ObjD
74
466
0
23 Sep 2018
Attention-based Audio-Visual Fusion for Robust Automatic Speech Recognition
George Sterpu
Christian Saam
N. Harte
62
65
0
05 Sep 2018
MnasNet: Platform-Aware Neural Architecture Search for Mobile
Mingxing Tan
Bo Chen
Ruoming Pang
Vijay Vasudevan
Mark Sandler
Andrew G. Howard
Quoc V. Le
MQ
109
2,995
0
31 Jul 2018
Large-Scale Visual Speech Recognition
Brendan Shillingford
Yannis Assael
Matthew W. Hoffman
T. Paine
Cían Hughes
...
Marie Mulville
Ben Coppin
Ben Laurie
A. Senior
Nando de Freitas
53
152
0
13 Jul 2018
Deep Lip Reading: a comparison of models and an online application
Triantafyllos Afouras
Joon Son Chung
Andrew Zisserman
53
118
0
15 Jun 2018
The Conversation: Deep Audio-Visual Speech Enhancement
Triantafyllos Afouras
Joon Son Chung
Andrew Zisserman
72
360
0
11 Apr 2018
End-to-end Audiovisual Speech Recognition
Stavros Petridis
Themos Stafylakis
Pingchuan Ma
Feipeng Cai
Georgios Tzimiropoulos
Maja Pantic
64
250
0
18 Feb 2018
AMC: AutoML for Model Compression and Acceleration on Mobile Devices
Yihui He
Ji Lin
Zhijian Liu
Hanrui Wang
Li Li
Song Han
80
1,348
0
10 Feb 2018
Channel Pruning for Accelerating Very Deep Neural Networks
Yihui He
Xiangyu Zhang
Jian Sun
194
2,519
0
19 Jul 2017
3D Convolutional Neural Networks for Cross Audio-Visual Matching Recognition
A. Torfi
Seyed Mehdi Iranmanesh
Nasser M. Nasrabadi
J. Dawson
23
103
0
18 Jun 2017
Combining Residual Networks with LSTMs for Lipreading
Themos Stafylakis
Georgios Tzimiropoulos
VLM
52
307
0
12 Mar 2017
Lip Reading Sentences in the Wild
Joon Son Chung
A. Senior
Oriol Vinyals
Andrew Zisserman
245
788
0
16 Nov 2016
LipNet: End-to-End Sentence-level Lipreading
Yannis Assael
Brendan Shillingford
Shimon Whiteson
Nando de Freitas
61
395
0
05 Nov 2016
Xception: Deep Learning with Depthwise Separable Convolutions
François Chollet
MDE
BDL
PINN
984
14,493
0
07 Oct 2016
Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations
Ranjay Krishna
Yuke Zhu
Oliver Groth
Justin Johnson
Kenji Hata
...
Yannis Kalantidis
Li Li
David A. Shamma
Michael S. Bernstein
Fei-Fei Li
190
5,706
0
23 Feb 2016
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
1.6K
192,638
0
10 Dec 2015
Listen, Attend and Spell
William Chan
Navdeep Jaitly
Quoc V. Le
Oriol Vinyals
RALM
147
2,261
0
05 Aug 2015
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
AIMat
ObjD
430
61,900
0
04 Jun 2015
Fast R-CNN
Ross B. Girshick
ObjD
284
24,976
0
30 Apr 2015
Going Deeper with Convolutions
Christian Szegedy
Wei Liu
Yangqing Jia
P. Sermanet
Scott E. Reed
Dragomir Anguelov
D. Erhan
Vincent Vanhoucke
Andrew Rabinovich
366
43,511
0
17 Sep 2014
Very Deep Convolutional Networks for Large-Scale Image Recognition
Karen Simonyan
Andrew Zisserman
FAtt
MDE
1.1K
99,991
0
04 Sep 2014
Microsoft COCO: Common Objects in Context
Nayeon Lee
Michael Maire
Serge J. Belongie
Lubomir Bourdev
Ross B. Girshick
James Hays
Pietro Perona
Deva Ramanan
C. L. Zitnick
Piotr Dollár
ObjD
340
43,290
0
01 May 2014
Visualizing and Understanding Convolutional Networks
Matthew D. Zeiler
Rob Fergus
FAtt
SSL
400
15,825
0
12 Nov 2013
Speech Recognition by Machine, A Review
M. Anusuya
S. Katti
70
394
0
13 Jan 2010
1