Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.00716
Cited By
State-of-the-Art Speech Recognition Using Multi-Stream Self-Attention With Dilated 1D Convolutions
1 October 2019
Kyu Jeong Han
R. Prieto
Kaixing(Kai) Wu
T. Ma
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"State-of-the-Art Speech Recognition Using Multi-Stream Self-Attention With Dilated 1D Convolutions"
20 / 20 papers shown
Title
A Transformer with Interleaved Self-attention and Convolution for Hybrid Acoustic Models
Liang Lu
79
4
0
23 Oct 2019
A Comparative Study on Transformer vs RNN in Speech Applications
Shigeki Karita
Nanxin Chen
Tomoki Hayashi
Takaaki Hori
Hirofumi Inaguma
...
Ryuichi Yamamoto
Xiao-fei Wang
Shinji Watanabe
Takenori Yoshimura
Wangyou Zhang
74
721
0
13 Sep 2019
XLNet: Generalized Autoregressive Pretraining for Language Understanding
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
234
8,444
0
19 Jun 2019
RWTH ASR Systems for LibriSpeech: Hybrid vs Attention -- w/o Data Augmentation
Christoph Luscher
Eugen Beck
Kazuki Irie
M. Kitza
Wilfried Michel
Albert Zeyer
Ralf Schluter
Hermann Ney
VLM
129
234
0
08 May 2019
SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
Daniel S. Park
William Chan
Yu Zhang
Chung-Cheng Chiu
Barret Zoph
E. D. Cubuk
Quoc V. Le
VLM
177
3,465
0
18 Apr 2019
Sequence-to-Sequence Speech Recognition with Time-Depth Separable Convolutions
Awni Y. Hannun
Ann Lee
Qiantong Xu
R. Collobert
53
97
0
04 Apr 2019
Self-Attention Networks for Connectionist Temporal Classification in Speech Recognition
Julian Salazar
Katrin Kirchhoff
Zhiheng Huang
AI4TS
53
118
0
22 Jan 2019
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context
Zihang Dai
Zhilin Yang
Yiming Yang
J. Carbonell
Quoc V. Le
Ruslan Salakhutdinov
VLM
253
3,745
0
09 Jan 2019
A novel pyramidal-FSMN architecture with lattice-free MMI for speech recognition
Xuerui Yang
Jiwei Li
Xi Zhou
66
15
0
26 Oct 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
1.8K
95,114
0
11 Oct 2018
Syllable-Based Sequence-to-Sequence Speech Recognition with the Transformer in Mandarin Chinese
Shiyu Zhou
Linhao Dong
Shuang Xu
Bo Xu
61
118
0
28 Apr 2018
The CAPIO 2017 Conversational Speech Recognition System
Kyu Jeong Han
Akshay Chandrashekaran
Jungsuk Kim
Ian Lane
116
72
0
29 Dec 2017
State-of-the-art Speech Recognition With Sequence-to-Sequence Models
Chung-Cheng Chiu
Tara N. Sainath
Yonghui Wu
Rohit Prabhavalkar
Patrick Nguyen
...
Katya Gonina
Navdeep Jaitly
Yue Liu
J. Chorowski
M. Bacchiani
AI4TS
93
1,154
0
05 Dec 2017
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
728
132,199
0
12 Jun 2017
Layer Normalization
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
416
10,526
0
21 Jul 2016
On the Compression of Recurrent Neural Networks with an Application to LVCSR acoustic modeling for Embedded Speech Recognition
Rohit Prabhavalkar
O. Alsharif
A. Bruguier
Ian McGraw
54
103
0
25 Mar 2016
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.2K
194,322
0
10 Dec 2015
Listen, Attend and Spell
William Chan
Navdeep Jaitly
Quoc V. Le
Oriol Vinyals
RALM
156
2,269
0
05 Aug 2015
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
Sergey Ioffe
Christian Szegedy
OOD
463
43,328
0
11 Feb 2015
Improving neural networks by preventing co-adaptation of feature detectors
Geoffrey E. Hinton
Nitish Srivastava
A. Krizhevsky
Ilya Sutskever
Ruslan Salakhutdinov
VLM
457
7,666
0
03 Jul 2012
1