Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2106.04275
Cited By
Raw Waveform Encoder with Multi-Scale Globally Attentive Locally Recurrent Networks for End-to-End Speech Recognition
8 June 2021
Max W. Y. Lam
Jun Wang
Chao Weng
Dan Su
Dong Yu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Raw Waveform Encoder with Multi-Scale Globally Attentive Locally Recurrent Networks for End-to-End Speech Recognition"
25 / 25 papers shown
Title
Tune-In: Training Under Negative Environments with Interference for Attention Networks Simulating Cocktail Party Effect
Jun Wang
Max W. Y. Lam
Dan Su
Dong Yu
53
6
0
02 Mar 2021
Sandglasset: A Light Multi-Granularity Self-attentive Network For Time-Domain Speech Separation
Max W. Y. Lam
Jun Wang
Dan Su
Dong Yu
AI4TS
97
49
0
01 Mar 2021
Echo State Speech Recognition
H. Shrivastava
Ankush Garg
Yuan Cao
Yu Zhang
Tara N. Sainath
99
22
0
18 Feb 2021
Effective Low-Cost Time-Domain Audio Separation Using Globally Attentive Locally Recurrent Networks
Max W. Y. Lam
Jun Wang
Dan Su
Dong Yu
84
29
0
13 Jan 2021
Recent Developments on ESPnet Toolkit Boosted by Conformer
Pengcheng Guo
Florian Boyer
Xuankai Chang
Tomoki Hayashi
Yosuke Higuchi
...
Jing Shi
Shinji Watanabe
Kun Wei
Wangyou Zhang
Yuekai Zhang
75
263
0
26 Oct 2020
Conformer: Convolution-augmented Transformer for Speech Recognition
Anmol Gulati
James Qin
Chung-Cheng Chiu
Niki Parmar
Yu Zhang
...
Wei Han
Shibo Wang
Zhengdong Zhang
Yonghui Wu
Ruoming Pang
224
3,153
0
16 May 2020
Minimum Bayes Risk Training of RNN-Transducer for End-to-End Speech Recognition
Chao Weng
Chengzhu Yu
Jia Cui
Chunlei Zhang
Dong Yu
125
39
0
28 Nov 2019
Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation
Yi Luo
Zhuo Chen
Takuya Yoshioka
AI4TS
86
771
0
14 Oct 2019
Improving RNN Transducer Modeling for End-to-End Speech Recognition
Jinyu Li
Rui Zhao
Hu Hu
Jiawei Liu
57
170
0
26 Sep 2019
Two-Pass End-to-End Speech Recognition
Tara N. Sainath
Ruoming Pang
David Rybach
Yanzhang He
Rohit Prabhavalkar
...
Qiao Liang
Trevor Strohman
Yonghui Wu
Ian McGraw
Chung-Cheng Chiu
77
148
0
29 Aug 2019
Speech and Speaker Recognition from Raw Waveform with SincNet
Mirco Ravanelli
Yoshua Bengio
34
30
0
13 Dec 2018
Exploring RNN-Transducer for Chinese Speech Recognition
Senmao Wang
Pan Zhou
Wei Chen
Jia Jia
Lei Xie
71
31
0
13 Nov 2018
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation
Yi Luo
N. Mesgarani
159
1,794
0
20 Sep 2018
AISHELL-2: Transforming Mandarin ASR Research Into Industrial Scale
Jiayu Du
Xingyu Na
Xuechen Liu
Hui Bu
VLM
54
287
0
31 Aug 2018
End-to-End Speech Recognition From the Raw Waveform
Neil Zeghidour
Nicolas Usunier
Gabriel Synnaeve
R. Collobert
Emmanuel Dupoux
95
84
0
19 Jun 2018
Exploring Architectures, Data and Units For Streaming End-to-End Speech Recognition with RNN-Transducer
Kanishka Rao
Hasim Sak
Rohit Prabhavalkar
AI4TS
81
348
0
02 Jan 2018
State-of-the-art Speech Recognition With Sequence-to-Sequence Models
Chung-Cheng Chiu
Tara N. Sainath
Yonghui Wu
Rohit Prabhavalkar
Patrick Nguyen
...
Katya Gonina
Navdeep Jaitly
Yue Liu
J. Chorowski
M. Bacchiani
AI4TS
91
1,154
0
05 Dec 2017
Learning Filterbanks from Raw Speech for Phone Recognition
Neil Zeghidour
Nicolas Usunier
Iasonas Kokkinos
Thomas Schatz
Gabriel Synnaeve
Emmanuel Dupoux
66
120
0
03 Nov 2017
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
722
132,199
0
12 Jun 2017
Neural Speech Recognizer: Acoustic-to-Word LSTM Model for Large Vocabulary Speech Recognition
H. Soltau
H. Liao
Hasim Sak
74
310
0
31 Oct 2016
Layer Normalization
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
413
10,526
0
21 Jul 2016
Learning Multiscale Features Directly From Waveforms
Zhenyao Zhu
Jesse Engel
Awni Y. Hannun
76
65
0
31 Mar 2016
Listen, Attend and Spell
William Chan
Navdeep Jaitly
Quoc V. Le
Oriol Vinyals
RALM
156
2,269
0
05 Aug 2015
Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling
Junyoung Chung
Çağlar Gülçehre
Kyunghyun Cho
Yoshua Bengio
593
12,734
0
11 Dec 2014
Sequence Transduction with Recurrent Neural Networks
Alex Graves
191
1,871
0
14 Nov 2012
1