Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2001.09727
Cited By
Scaling Up Online Speech Recognition Using ConvNets
27 January 2020
Vineel Pratap
Qiantong Xu
Jacob Kahn
Gilad Avidov
Tatiana Likhomanenko
Awni Y. Hannun
Vitaliy Liptchinsky
Gabriel Synnaeve
R. Collobert
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Scaling Up Online Speech Recognition Using ConvNets"
24 / 24 papers shown
Title
emg2qwerty: A Large Dataset with Baselines for Touch Typing using Surface Electromyography
Viswanath Sivakumar
Jeffrey Seely
Alan Du
Sean R Bittner
Adam Berenzweig
Anuoluwapo Bolarinwa
Alexandre Gramfort
Michael I Mandel
31
4
0
26 Oct 2024
End-to-end ASR: from Supervised to Semi-Supervised Learning with Modern Architectures
Gabriel Synnaeve
Qiantong Xu
Jacob Kahn
Tatiana Likhomanenko
Edouard Grave
Vineel Pratap
Anuroop Sriram
Vitaliy Liptchinsky
R. Collobert
SSL
AI4TS
83
245
0
19 Nov 2019
RNN-T For Latency Controlled ASR With Improved Beam Search
Mahaveer Jain
Kjell Schubert
Jay Mahadeokar
Ching-Feng Yeh
Kaustubh Kalgaonkar
Anuroop Sriram
Christian Fuegen
M. Seltzer
32
44
0
05 Nov 2019
Transformer-based Acoustic Modeling for Hybrid Speech Recognition
Yongqiang Wang
Abdel-rahman Mohamed
Duc Le
Chunxi Liu
Alex Xiao
...
Xiaohui Zhang
Frank Zhang
Christian Fuegen
Geoffrey Zweig
M. Seltzer
29
248
0
22 Oct 2019
From Senones to Chenones: Tied Context-Dependent Graphemes for Hybrid Speech Recognition
Duc Le
Xiaohui Zhang
Weiyi Zheng
C. Fügen
Geoffrey Zweig
M. Seltzer
28
63
0
02 Oct 2019
Reducing Transformer Depth on Demand with Structured Dropout
Angela Fan
Edouard Grave
Armand Joulin
76
586
0
25 Sep 2019
A Comparative Study on Transformer vs RNN in Speech Applications
Shigeki Karita
Nanxin Chen
Tomoki Hayashi
Takaaki Hori
Hirofumi Inaguma
...
Ryuichi Yamamoto
Xiao-fei Wang
Shinji Watanabe
Takenori Yoshimura
Wangyou Zhang
44
718
0
13 Sep 2019
SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
Daniel S. Park
William Chan
Yu Zhang
Chung-Cheng Chiu
Barret Zoph
E. D. Cubuk
Quoc V. Le
VLM
128
3,435
0
18 Apr 2019
Sequence-to-Sequence Speech Recognition with Time-Depth Separable Convolutions
Awni Y. Hannun
Ann Lee
Qiantong Xu
R. Collobert
35
95
0
04 Apr 2019
wav2letter++: The Fastest Open-source Speech Recognition System
Vineel Pratap
Awni Y. Hannun
Qiantong Xu
Jeff Cai
Jacob Kahn
Gabriel Synnaeve
Vitaliy Liptchinsky
R. Collobert
VLM
33
156
0
18 Dec 2018
Fully Convolutional Speech Recognition
Neil Zeghidour
Qiantong Xu
Vitaliy Liptchinsky
Nicolas Usunier
Gabriel Synnaeve
R. Collobert
28
91
0
17 Dec 2018
Streaming End-to-end Speech Recognition For Mobile Devices
Yanzhang He
Tara N. Sainath
Rohit Prabhavalkar
Ian McGraw
R. Álvarez
...
K. Sim
Tom Bagby
Shuo-yiin Chang
Kanishka Rao
A. Gruenstein
61
624
0
15 Nov 2018
Improved training of end-to-end attention models for speech recognition
Albert Zeyer
Kazuki Irie
Ralf Schluter
Hermann Ney
VLM
36
269
0
08 May 2018
Subword Regularization: Improving Neural Network Translation Models with Multiple Subword Candidates
Taku Kudo
83
1,153
0
29 Apr 2018
Efficient Neural Audio Synthesis
Nal Kalchbrenner
Erich Elsen
Karen Simonyan
Seb Noury
Norman Casagrande
Edward Lockhart
Florian Stimberg
Aaron van den Oord
Sander Dieleman
Koray Kavukcuoglu
59
866
0
23 Feb 2018
State-of-the-art Speech Recognition With Sequence-to-Sequence Models
Chung-Cheng Chiu
Tara N. Sainath
Yonghui Wu
Rohit Prabhavalkar
Patrick Nguyen
...
Katya Gonina
Navdeep Jaitly
Yue Liu
J. Chorowski
M. Bacchiani
AI4TS
65
1,150
0
05 Dec 2017
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
223
129,831
0
12 Jun 2017
Depthwise Separable Convolutions for Neural Machine Translation
Lukasz Kaiser
Aidan Gomez
François Chollet
48
279
0
09 Jun 2017
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications
Andrew G. Howard
Menglong Zhu
Bo Chen
Dmitry Kalenichenko
Weijun Wang
Tobias Weyand
M. Andreetto
Hartwig Adam
3DH
985
20,692
0
17 Apr 2017
Xception: Deep Learning with Depthwise Separable Convolutions
François Chollet
MDE
BDL
PINN
550
14,454
0
07 Oct 2016
Layer Normalization
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
160
10,412
0
21 Jul 2016
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
Dario Amodei
Rishita Anubhai
Eric Battenberg
Carl Case
Jared Casper
...
Chong-Jun Wang
Bo Xiao
Dani Yogatama
J. Zhan
Zhenyao Zhu
80
2,965
0
08 Dec 2015
Highway Long Short-Term Memory RNNs for Distant Speech Recognition
Yu Zhang
Guoguo Chen
Dong Yu
Kaisheng Yao
Sanjeev Khudanpur
James R. Glass
3DV
AI4TS
46
291
0
30 Oct 2015
Sequence Transduction with Recurrent Neural Networks
Alex Graves
66
1,858
0
14 Nov 2012
1