Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1712.01769
Cited By
State-of-the-art Speech Recognition With Sequence-to-Sequence Models
5 December 2017
Chung-Cheng Chiu
Tara N. Sainath
Yonghui Wu
Rohit Prabhavalkar
Patrick Nguyen
Zhehuai Chen
Anjuli Kannan
Ron J. Weiss
Kanishka Rao
Katya Gonina
Navdeep Jaitly
Yue Liu
J. Chorowski
M. Bacchiani
AI4TS
Re-assign community
ArXiv
PDF
HTML
Papers citing
"State-of-the-art Speech Recognition With Sequence-to-Sequence Models"
50 / 501 papers shown
Title
Dynamic curriculum learning via data parameters for noise robust keyword spotting
T. Higuchi
Shreyas Saxena
M. Souden
Tien Dung Tran
Masood Delfarah
C. Dhir
26
8
0
18 Feb 2021
Echo State Speech Recognition
H. Shrivastava
Ankush Garg
Yuan Cao
Yu Zhang
Tara N. Sainath
55
22
0
18 Feb 2021
End-to-End Automatic Speech Recognition with Deep Mutual Learning
Ryo Masumura
Mana Ihori
Akihiko Takashima
Tomohiro Tanaka
Takanori Ashihara
27
5
0
16 Feb 2021
Fast End-to-End Speech Recognition via Non-Autoregressive Models and Cross-Modal Knowledge Transferring from BERT
Ye Bai
Jiangyan Yi
J. Tao
Zhengkun Tian
Zhengqi Wen
Shuai Zhang
RALM
33
51
0
15 Feb 2021
Do as I mean, not as I say: Sequence Loss Training for Spoken Language Understanding
Milind Rao
Pranav Dheram
Gautam Tiwari
A. Raju
J. Droppo
Ariya Rastrow
A. Stolcke
24
17
0
12 Feb 2021
Neural Network Libraries: A Deep Learning Framework Designed from Engineers' Perspectives
T. Narihira
Javier Alonsogarcia
Fabien Cardinaux
Akio Hayakawa
Masato Ishii
...
Kenji Suzuki
Stephen Tiedmann
Stefan Uhlich
T. Yashima
K. Yoshiyama
18
10
0
12 Feb 2021
Internal Language Model Training for Domain-Adaptive End-to-End Speech Recognition
Zhong Meng
Naoyuki Kanda
Yashesh Gaur
S. Parthasarathy
Eric Sun
Liang Lu
Xie Chen
Jinyu Li
Jiawei Liu
AuLLM
49
52
0
02 Feb 2021
Transformer Based Deliberation for Two-Pass Speech Recognition
Ke Hu
Ruoming Pang
Tara N. Sainath
Trevor Strohman
27
37
0
27 Jan 2021
Leveraging End-to-End ASR for Endangered Language Documentation: An Empirical Study on Yoloxóchitl Mixtec
Jiatong Shi
Jiatong Shi. Jonathan D. Amith
Rey Castillo García
Esteban Guadalupe Sierra
Kevin Duh
Shinji Watanabe
33
46
0
26 Jan 2021
UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data
Chengyi Wang
Yu-Huan Wu
Yao Qian
K. Kumatani
Shujie Liu
Furu Wei
Michael Zeng
Xuedong Huang
OT
SSL
40
112
0
19 Jan 2021
Electrocardiogram Classification and Visual Diagnosis of Atrial Fibrillation with DenseECG
Dacheng Chen
Dan Li
Xiuqin Xu
Ruizhi Yang
See-Kiong Ng
22
4
0
19 Jan 2021
Motion-Based Handwriting Recognition and Word Reconstruction
Junshen Kevin Chen
Wanze Xie
Yutong He
29
1
0
15 Jan 2021
Deep Learning Methods for Vessel Trajectory Prediction based on Recurrent Neural Networks
Samuele Capobianco
L. Millefiori
N. Forti
P. Braca
P. Willett
46
134
0
07 Jan 2021
Global Context Networks
Yue Cao
Jiarui Xu
Stephen Lin
Fangyun Wei
Han Hu
ISeg
36
96
0
24 Dec 2020
NeurST: Neural Speech Translation Toolkit
Chengqi Zhao
Mingxuan Wang
Qianqian Dong
Rong Ye
Lei Li
30
32
0
18 Dec 2020
CIF-based Collaborative Decoding for End-to-end Contextual Speech Recognition
Minglun Han
Linhao Dong
Shiyu Zhou
Bo Xu
21
21
0
17 Dec 2020
User-friendly automatic transcription of low-resource languages: Plugging ESPnet into Elpis
Oliver Adams
Benjamin Galliot
Guillaume Wisniewski
Nicholas Lambourne
Ben Foley
...
Laurent Besacier
Christopher Cox
Katya Aplonova
Guillaume Jacques
Nathan W. Hill
40
10
0
15 Dec 2020
A review of on-device fully neural end-to-end automatic speech recognition algorithms
Chanwoo Kim
Dhananjaya N. Gowda
Dongsoo Lee
Jiyeon Kim
Ankur Kumar
Sungsoo Kim
Abhinav Garg
C. Han
27
27
0
14 Dec 2020
AV Taris: Online Audio-Visual Speech Recognition
George Sterpu
N. Harte
27
1
0
14 Dec 2020
Less Is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging
Rohit Prabhavalkar
Yanzhang He
David Rybach
S. Campbell
A. Narayanan
Trevor Strohman
Tara N. Sainath
52
35
0
12 Dec 2020
Frame-level SpecAugment for Deep Convolutional Neural Networks in Hybrid ASR Systems
Xinwei Li
Yuanyuan Zhang
Xiaodan Zhuang
Daben Liu
14
6
0
07 Dec 2020
Improving RNN Transducer With Target Speaker Extraction and Neural Uncertainty Estimation
Jiatong Shi
Chunlei Zhang
Chao Weng
Shinji Watanabe
Meng Yu
Dong Yu
25
12
0
26 Nov 2020
Streaming end-to-end multi-talker speech recognition
Liang Lu
Naoyuki Kanda
Jinyu Li
Jiawei Liu
15
41
0
26 Nov 2020
A Better and Faster End-to-End Model for Streaming ASR
Yue Liu
Anmol Gulati
Jiahui Yu
Tara N. Sainath
Chung-Cheng Chiu
...
Wei Han
Qiao Liang
Yu Zhang
Trevor Strohman
Yonghui Wu
AuLLM
25
123
0
21 Nov 2020
Redesigning the classification layer by randomizing the class representation vectors
Gabi Shalev
Gal-Lev Shalev
Joseph Keshet
VLM
24
4
0
16 Nov 2020
Deep Shallow Fusion for RNN-T Personalization
Duc Le
Gil Keren
Julian Chan
Jay Mahadeokar
Christian Fuegen
M. Seltzer
26
77
0
16 Nov 2020
Exploring End-to-End Multi-channel ASR with Bias Information for Meeting Transcription
Xiaofei Wang
Naoyuki Kanda
Yashesh Gaur
Zhuo Chen
Zhong Meng
Takuya Yoshioka
17
13
0
05 Nov 2020
Paralinguistic Privacy Protection at the Edge
Ranya Aloufi
Hamed Haddadi
David E. Boyle
25
14
0
04 Nov 2020
Sequence-to-Sequence Learning via Attention Transfer for Incremental Speech Recognition
Sashi Novitasari
Andros Tjandra
S. Sakti
Satoshi Nakamura
CLL
14
12
0
04 Nov 2020
Incremental Machine Speech Chain Towards Enabling Listening while Speaking in Real-time
Sashi Novitasari
Andros Tjandra
Tomoya Yanagita
S. Sakti
Satoshi Nakamura
CLL
6
1
0
04 Nov 2020
Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition
Zhong Meng
S. Parthasarathy
Eric Sun
Yashesh Gaur
Naoyuki Kanda
Liang Lu
Xie Chen
Rui Zhao
Jinyu Li
Jiawei Liu
AuLLM
19
107
0
03 Nov 2020
Dynamic latency speech recognition with asynchronous revision
Mingkun Huang
Meng Cai
Jun Zhang
Yang Zhang
Yongbin You
Yi He
Zejun Ma
BDL
24
2
0
03 Nov 2020
Streaming Attention-Based Models with Augmented Memory for End-to-End Speech Recognition
Ching-Feng Yeh
Yongqiang Wang
Yangyang Shi
Chunyang Wu
Frank Zhang
Julian Chan
M. Seltzer
AI4TS
RALM
39
8
0
03 Nov 2020
Multitask Training with Text Data for End-to-End Speech Recognition
Peidong Wang
Tara N. Sainath
Ron J. Weiss
21
27
0
27 Oct 2020
Improved Neural Language Model Fusion for Streaming Recurrent Neural Network Transducer
Suyoun Kim
Shangguan Yuan
Jay Mahadeokar
A. Bruguier
Christian Fuegen
M. Seltzer
Duc Le
23
28
0
26 Oct 2020
Real-Time Edge Classification: Optimal Offloading under Token Bucket Constraints
Ayan Chakrabarti
Roch Guérin
Chenyang Lu
Jiangnan Liu
15
15
0
26 Oct 2020
Improved Mask-CTC for Non-Autoregressive End-to-End ASR
Yosuke Higuchi
Hirofumi Inaguma
Shinji Watanabe
Tetsuji Ogawa
Tetsunori Kobayashi
23
61
0
26 Oct 2020
Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition
Qiujia Li
David Qiu
Yu Zhang
Yue Liu
Yanzhang He
P. Woodland
Liangliang Cao
Trevor Strohman
12
46
0
22 Oct 2020
Developing Real-time Streaming Transformer Transducer for Speech Recognition on Large-scale Dataset
Xie Chen
Yu-Huan Wu
Zhenghao Wang
Shujie Liu
Jinyu Li
22
169
0
22 Oct 2020
Reduce and Reconstruct: ASR for Low-Resource Phonetic Languages
Anuj Diwan
Preethi Jyothi
16
5
0
19 Oct 2020
Super-Human Performance in Online Low-latency Recognition of Conversational Speech
T. Nguyen
S. Stueker
A. Waibel
BDL
17
36
0
07 Oct 2020
Differentiable Weighted Finite-State Transducers
Awni Y. Hannun
Vineel Pratap
Jacob Kahn
Wei-Ning Hsu
36
29
0
02 Oct 2020
Sparse Communication for Training Deep Networks
Negar Foroutan
Martin Jaggi
FedML
30
16
0
19 Sep 2020
KoSpeech: Open-Source Toolkit for End-to-End Korean Speech Recognition
Soohwan Kim
Seyoung Bae
Cheolhwang Won
VLM
22
5
0
07 Sep 2020
Silent Speech Interfaces for Speech Restoration: A Review
Jose Andres Gonzalez Lopez
Alejandro Gomez-Alanis
Juan M. Martín-Donas
J. L. Pérez-Córdoba
A. Gómez
37
85
0
04 Sep 2020
Parallel Rescoring with Transformer for Streaming On-Device Speech Recognition
Wei Li
James Qin
Chung-Cheng Chiu
Ruoming Pang
Yanzhang He
20
14
0
30 Aug 2020
Adaptation Algorithms for Neural Network-Based Speech Recognition: An Overview
P. Bell
Joachim Fainberg
Ondˇrej Klejch
Jinyu Li
Steve Renals
P. Swietojanski
48
74
0
14 Aug 2020
Adaptable Multi-Domain Language Model for Transformer ASR
Taewoo Lee
Min-Joong Lee
Tae Gyoon Kang
Seokyeong Jung
Minseok Kwon
...
Ho-Gyeong Kim
Jiseung Jeong
Jihyun Lee
Hosik Lee
Y. S. Choi
24
17
0
14 Aug 2020
Speech To Semantics: Improve ASR and NLU Jointly via All-Neural Interfaces
Milind Rao
A. Raju
Pranav Dheram
Bach Bui
Ariya Rastrow
21
43
0
14 Aug 2020
Online Automatic Speech Recognition with Listen, Attend and Spell Model
Roger Hsiao
Dogan Can
Tim Ng
R. Travadi
Arnab Ghoshal
RALM
4
17
0
12 Aug 2020
Previous
1
2
3
4
5
6
...
9
10
11
Next