Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1712.01769
Cited By
v1
v2
v3
v4
v5
v6 (latest)
State-of-the-art Speech Recognition With Sequence-to-Sequence Models
5 December 2017
Chung-Cheng Chiu
Tara N. Sainath
Yonghui Wu
Rohit Prabhavalkar
Patrick Nguyen
Zhiwen Chen
Anjuli Kannan
Ron J. Weiss
Kanishka Rao
Katya Gonina
Navdeep Jaitly
Yue Liu
J. Chorowski
M. Bacchiani
AI4TS
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"State-of-the-art Speech Recognition With Sequence-to-Sequence Models"
50 / 501 papers shown
Title
Towards Online End-to-end Transformer Automatic Speech Recognition
E. Tsunoo
Yosuke Kashiwagi
Toshiyuki Kumakura
Shinji Watanabe
84
32
0
25 Oct 2019
Recognizing long-form speech using streaming end-to-end models
A. Narayanan
Rohit Prabhavalkar
Chung-Cheng Chiu
David Rybach
Tara N. Sainath
Trevor Strohman
76
130
0
24 Oct 2019
G2G: TTS-Driven Pronunciation Learning for Graphemic Hybrid ASR
Duc Le
T. Koehler
Christian Fuegen
M. Seltzer
73
16
0
22 Oct 2019
Transformer-based Acoustic Modeling for Hybrid Speech Recognition
Yongqiang Wang
Abdel-rahman Mohamed
Duc Le
Chunxi Liu
Alex Xiao
...
Xiaohui Zhang
Frank Zhang
Christian Fuegen
Geoffrey Zweig
M. Seltzer
68
249
0
22 Oct 2019
Transformer ASR with Contextual Block Processing
E. Tsunoo
Yosuke Kashiwagi
Toshiyuki Kumakura
Shinji Watanabe
113
64
0
16 Oct 2019
Orthogonality Constrained Multi-Head Attention For Keyword Spotting
Mingu Lee
Jinkyu Lee
Hye Jin Jang
Byeonggeun Kim
Wonil Chang
Kyuwoong Hwang
47
11
0
10 Oct 2019
Federated Learning of N-gram Language Models
Mingqing Chen
A. Suresh
Rajiv Mathews
Adeline Wong
Cyril Allauzen
F. Beaufays
Michael Riley
FedML
117
75
0
08 Oct 2019
One-To-Many Multilingual End-to-end Speech Translation
Mattia Antonino Di Gangi
Matteo Negri
Marco Turchi
89
51
0
08 Oct 2019
From Senones to Chenones: Tied Context-Dependent Graphemes for Hybrid Speech Recognition
Duc Le
Xiaohui Zhang
Weiyi Zheng
C. Fügen
Geoffrey Zweig
M. Seltzer
92
64
0
02 Oct 2019
State-of-the-Art Speech Recognition Using Multi-Stream Self-Attention With Dilated 1D Convolutions
Kyu Jeong Han
R. Prieto
Kaixing(Kai) Wu
T. Ma
134
70
0
01 Oct 2019
AdaptivFloat: A Floating-point based Data Type for Resilient Deep Learning Inference
Thierry Tambe
En-Yu Yang
Zishen Wan
Yuntian Deng
Vijay Janapa Reddi
Alexander M. Rush
David Brooks
Gu-Yeon Wei
MQ
58
21
0
29 Sep 2019
Improving RNN Transducer Modeling for End-to-End Speech Recognition
Jinyu Li
Rui Zhao
Hu Hu
Jiawei Liu
76
170
0
26 Sep 2019
Optimizing Speech Recognition For The Edge
Yuan Shangguan
Jian Li
Qiao Liang
R. Álvarez
Ian McGraw
78
64
0
26 Sep 2019
Speech Recognition with Augmented Synthesized Speech
Andrew Rosenberg
Yu Zhang
Bhuvana Ramabhadran
Ye Jia
Pedro J. Moreno
Yonghui Wu
Zelin Wu
67
128
0
25 Sep 2019
Improving OOV Detection and Resolution with External Language Models in Acoustic-to-Word ASR
Hirofumi Inaguma
Masato Mimura
S. Sakai
Tatsuya Kawahara
40
5
0
22 Sep 2019
Alleviating Sequence Information Loss with Data Overlapping and Prime Batch Sizes
Noémien Kocher
Christian Scuito
Lorenzo Tarantino
Alexandros Lazaridis
Andreas Fischer
C. Musat
37
0
0
18 Sep 2019
An Investigation Into On-device Personalization of End-to-end Automatic Speech Recognition Models
K. Sim
P. Zadrazil
F. Beaufays
88
58
0
14 Sep 2019
Integrating Source-channel and Attention-based Sequence-to-sequence Models for Speech Recognition
Qiujia Li
Chao Zhang
P. Woodland
63
20
0
14 Sep 2019
Speculative Beam Search for Simultaneous Translation
Renjie Zheng
Mingbo Ma
Baigong Zheng
Liang Huang
98
24
0
12 Sep 2019
Learning Dynamic Author Representations with Temporal Language Models
E. Delasalles
Sylvain Lamprier
Ludovic Denoyer
57
9
0
11 Sep 2019
Self-Teaching Networks
Liang Lu
Eric Sun
Jiawei Liu
SSL
66
4
0
09 Sep 2019
CMU GetGoing: An Understandable and Memorable Dialog System for Seniors
Shikib Mehri
A. Black
M. Eskénazi
41
3
0
03 Sep 2019
Learning to Transfer Learn: Reinforcement Learning-Based Selection for Adaptive Transfer Learning
Linchao Zhu
Sercan O. Arik
Yezhou Yang
Tomas Pfister
76
5
0
29 Aug 2019
Two-Pass End-to-End Speech Recognition
Tara N. Sainath
Ruoming Pang
David Rybach
Yanzhang He
Rohit Prabhavalkar
...
Qiao Liang
Trevor Strohman
Yonghui Wu
Ian McGraw
Chung-Cheng Chiu
102
148
0
29 Aug 2019
Environment Sound Classification using Multiple Feature Channels and Attention based Deep Convolutional Neural Network
Jivitesh Sharma
Ole-Christoffer Granmo
M. G. Olsen
79
16
0
28 Aug 2019
Gender Representation in French Broadcast Corpora and Its Impact on ASR Performance
Mahault Garnerin
Solange Rossato
Laurent Besacier
53
52
0
23 Aug 2019
TabNet: Attentive Interpretable Tabular Learning
Sercan O. Arik
Tomas Pfister
LMTD
226
1,381
0
20 Aug 2019
Unpaired Image-to-Speech Synthesis with Multimodal Information Bottleneck
Shuang Ma
Daniel J. McDuff
Yale Song
89
25
0
19 Aug 2019
Survey on Deep Neural Networks in Speech and Vision Systems
M. Alam
Manar D. Samad
Lasitha Vidyaratne
Alexander M. Glandon
Khan M. Iftekharuddin
3DV
VLM
AI4TS
100
212
0
16 Aug 2019
Challenging the Boundaries of Speech Recognition: The MALACH Corpus
M. Picheny
Zoltán Tüske
Brian Kingsbury
Kartik Audhkhasi
Xiaodong Cui
G. Saon
AuLLM
34
13
0
09 Aug 2019
Understanding Optical Music Recognition
Jorge Calvo-Zaragoza
Jan Hajic
Alexander Pacha
75
118
0
07 Aug 2019
Classification of Hand Movements from EEG using a Deep Attention-based LSTM Network
Guangyi Zhang
Vandad Davoodnia
Alireza Sepas-Moghaddam
Yaoxue Zhang
Ali Etemad
83
130
0
06 Aug 2019
SF-Net: Structured Feature Network for Continuous Sign Language Recognition
Zhaoyang Yang
Zhenmei Shi
Xiaoyong Shen
Yu-Wing Tai
SLR
58
64
0
04 Aug 2019
Sound source detection, localization and classification using consecutive ensemble of CRNN models
Slawomir Kapka
M. Lewandowski
122
66
0
02 Aug 2019
Personalizing ASR for Dysarthric and Accented Speech with Limited Data
Joel Shor
Dotan Emanuel
Oran Lang
Omry Tuval
Michael P. Brenner
...
Maeve McNally
Taylor Charbonneau
Melissa Nollstadt
Avinatan Hassidim
Yossi Matias
57
105
0
31 Jul 2019
Learn Spelling from Teachers: Transferring Knowledge from Language Models to Sequence-to-Sequence Speech Recognition
Ye Bai
Jiangyan Yi
J. Tao
Zhengkun Tian
Zhengqi Wen
KELM
76
38
0
13 Jul 2019
Transfer Learning from Audio-Visual Grounding to Speech Recognition
Wei-Ning Hsu
David Harwath
James R. Glass
SSL
65
32
0
09 Jul 2019
Listen, Attend, Spell and Adapt: Speaker Adapted Sequence-to-Sequence ASR
F. Weninger
Jesús Andrés-Ferrer
Xinwei Li
P. Zhan
AI4TS
77
26
0
08 Jul 2019
FortuneTeller: Predicting Microarchitectural Attacks via Unsupervised Deep Learning
Berk Gülmezoglu
A. Moghimi
T. Eisenbarth
B. Sunar
AAML
66
38
0
08 Jul 2019
NIESR: Nuisance Invariant End-to-end Speech Recognition
I-Hung Hsu
Ayush Jaiswal
Premkumar Natarajan
49
6
0
07 Jul 2019
Improving Performance of End-to-End ASR on Numeric Sequences
Cal Peyser
Hao Zhang
Tara N. Sainath
Zelin Wu
AI4TS
63
36
0
01 Jul 2019
Self Multi-Head Attention for Speaker Recognition
Miquel India
Pooyan Safari
Javier Hernando
76
111
0
24 Jun 2019
End-to-End ASR for Code-switched Hindi-English Speech
B. M. L. Srivastava
Basil Abraham
Sunayana Sitaram
Rupeshkumar Mehta
Preethi Jyothi
28
2
0
22 Jun 2019
Phoneme-Based Contextualization for Cross-Lingual Speech Recognition in End-to-End Models
Ke Hu
A. Bruguier
Tara N. Sainath
Rohit Prabhavalkar
Golan Pundak
38
19
0
21 Jun 2019
Unsupervised Phoneme and Word Discovery from Multiple Speakers using Double Articulation Analyzer and Neural Network with Parametric Bias
Ryo Nakashima
Ryo Ozaki
T. Taniguchi
66
6
0
21 Jun 2019
Multi-Stream End-to-End Speech Recognition
Ruizhi Li
Xiaofei Wang
Sri Harish Reddy Mallidi
Shinji Watanabe
Takaaki Hori
H. Hermansky
55
21
0
17 Jun 2019
On Single Source Robustness in Deep Fusion Models
Taewan Kim
Joydeep Ghosh
AAML
52
22
0
11 Jun 2019
Efficient 8-Bit Quantization of Transformer Neural Machine Language Translation Model
Aishwarya Bhandare
Vamsi Sripathi
Deepthi Karkada
Vivek V. Menon
Sun Choi
Kushal Datta
V. Saletore
MQ
88
132
0
03 Jun 2019
Multivariate, Multistep Forecasting, Reconstruction and Feature Selection of Ocean Waves via Recurrent and Sequence-to-Sequence Networks
Mohammad Pirhooshyaran
L. Snyder
AI4TS
36
7
0
01 Jun 2019
CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition
Linhao Dong
Bo Xu
83
128
0
27 May 2019
Previous
1
2
3
...
10
11
7
8
9
Next