ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1712.01769
  4. Cited By
State-of-the-art Speech Recognition With Sequence-to-Sequence Models

State-of-the-art Speech Recognition With Sequence-to-Sequence Models

5 December 2017
Chung-Cheng Chiu
Tara N. Sainath
Yonghui Wu
Rohit Prabhavalkar
Patrick Nguyen
Zhehuai Chen
Anjuli Kannan
Ron J. Weiss
Kanishka Rao
Katya Gonina
Navdeep Jaitly
Yue Liu
J. Chorowski
M. Bacchiani
    AI4TS
ArXivPDFHTML

Papers citing "State-of-the-art Speech Recognition With Sequence-to-Sequence Models"

50 / 501 papers shown
Title
Dynamic curriculum learning via data parameters for noise robust keyword
  spotting
Dynamic curriculum learning via data parameters for noise robust keyword spotting
T. Higuchi
Shreyas Saxena
M. Souden
Tien Dung Tran
Masood Delfarah
C. Dhir
26
8
0
18 Feb 2021
Echo State Speech Recognition
Echo State Speech Recognition
H. Shrivastava
Ankush Garg
Yuan Cao
Yu Zhang
Tara N. Sainath
55
22
0
18 Feb 2021
End-to-End Automatic Speech Recognition with Deep Mutual Learning
End-to-End Automatic Speech Recognition with Deep Mutual Learning
Ryo Masumura
Mana Ihori
Akihiko Takashima
Tomohiro Tanaka
Takanori Ashihara
27
5
0
16 Feb 2021
Fast End-to-End Speech Recognition via Non-Autoregressive Models and
  Cross-Modal Knowledge Transferring from BERT
Fast End-to-End Speech Recognition via Non-Autoregressive Models and Cross-Modal Knowledge Transferring from BERT
Ye Bai
Jiangyan Yi
J. Tao
Zhengkun Tian
Zhengqi Wen
Shuai Zhang
RALM
33
51
0
15 Feb 2021
Do as I mean, not as I say: Sequence Loss Training for Spoken Language
  Understanding
Do as I mean, not as I say: Sequence Loss Training for Spoken Language Understanding
Milind Rao
Pranav Dheram
Gautam Tiwari
A. Raju
J. Droppo
Ariya Rastrow
A. Stolcke
24
17
0
12 Feb 2021
Neural Network Libraries: A Deep Learning Framework Designed from
  Engineers' Perspectives
Neural Network Libraries: A Deep Learning Framework Designed from Engineers' Perspectives
T. Narihira
Javier Alonsogarcia
Fabien Cardinaux
Akio Hayakawa
Masato Ishii
...
Kenji Suzuki
Stephen Tiedmann
Stefan Uhlich
T. Yashima
K. Yoshiyama
18
10
0
12 Feb 2021
Internal Language Model Training for Domain-Adaptive End-to-End Speech
  Recognition
Internal Language Model Training for Domain-Adaptive End-to-End Speech Recognition
Zhong Meng
Naoyuki Kanda
Yashesh Gaur
S. Parthasarathy
Eric Sun
Liang Lu
Xie Chen
Jinyu Li
Jiawei Liu
AuLLM
49
52
0
02 Feb 2021
Transformer Based Deliberation for Two-Pass Speech Recognition
Transformer Based Deliberation for Two-Pass Speech Recognition
Ke Hu
Ruoming Pang
Tara N. Sainath
Trevor Strohman
27
37
0
27 Jan 2021
Leveraging End-to-End ASR for Endangered Language Documentation: An
  Empirical Study on Yoloxóchitl Mixtec
Leveraging End-to-End ASR for Endangered Language Documentation: An Empirical Study on Yoloxóchitl Mixtec
Jiatong Shi
Jiatong Shi. Jonathan D. Amith
Rey Castillo García
Esteban Guadalupe Sierra
Kevin Duh
Shinji Watanabe
33
46
0
26 Jan 2021
UniSpeech: Unified Speech Representation Learning with Labeled and
  Unlabeled Data
UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data
Chengyi Wang
Yu-Huan Wu
Yao Qian
K. Kumatani
Shujie Liu
Furu Wei
Michael Zeng
Xuedong Huang
OT
SSL
40
112
0
19 Jan 2021
Electrocardiogram Classification and Visual Diagnosis of Atrial
  Fibrillation with DenseECG
Electrocardiogram Classification and Visual Diagnosis of Atrial Fibrillation with DenseECG
Dacheng Chen
Dan Li
Xiuqin Xu
Ruizhi Yang
See-Kiong Ng
22
4
0
19 Jan 2021
Motion-Based Handwriting Recognition and Word Reconstruction
Motion-Based Handwriting Recognition and Word Reconstruction
Junshen Kevin Chen
Wanze Xie
Yutong He
29
1
0
15 Jan 2021
Deep Learning Methods for Vessel Trajectory Prediction based on
  Recurrent Neural Networks
Deep Learning Methods for Vessel Trajectory Prediction based on Recurrent Neural Networks
Samuele Capobianco
L. Millefiori
N. Forti
P. Braca
P. Willett
46
134
0
07 Jan 2021
Global Context Networks
Global Context Networks
Yue Cao
Jiarui Xu
Stephen Lin
Fangyun Wei
Han Hu
ISeg
36
96
0
24 Dec 2020
NeurST: Neural Speech Translation Toolkit
NeurST: Neural Speech Translation Toolkit
Chengqi Zhao
Mingxuan Wang
Qianqian Dong
Rong Ye
Lei Li
30
32
0
18 Dec 2020
CIF-based Collaborative Decoding for End-to-end Contextual Speech
  Recognition
CIF-based Collaborative Decoding for End-to-end Contextual Speech Recognition
Minglun Han
Linhao Dong
Shiyu Zhou
Bo Xu
21
21
0
17 Dec 2020
User-friendly automatic transcription of low-resource languages:
  Plugging ESPnet into Elpis
User-friendly automatic transcription of low-resource languages: Plugging ESPnet into Elpis
Oliver Adams
Benjamin Galliot
Guillaume Wisniewski
Nicholas Lambourne
Ben Foley
...
Laurent Besacier
Christopher Cox
Katya Aplonova
Guillaume Jacques
Nathan W. Hill
40
10
0
15 Dec 2020
A review of on-device fully neural end-to-end automatic speech
  recognition algorithms
A review of on-device fully neural end-to-end automatic speech recognition algorithms
Chanwoo Kim
Dhananjaya N. Gowda
Dongsoo Lee
Jiyeon Kim
Ankur Kumar
Sungsoo Kim
Abhinav Garg
C. Han
27
27
0
14 Dec 2020
AV Taris: Online Audio-Visual Speech Recognition
AV Taris: Online Audio-Visual Speech Recognition
George Sterpu
N. Harte
27
1
0
14 Dec 2020
Less Is More: Improved RNN-T Decoding Using Limited Label Context and
  Path Merging
Less Is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging
Rohit Prabhavalkar
Yanzhang He
David Rybach
S. Campbell
A. Narayanan
Trevor Strohman
Tara N. Sainath
52
35
0
12 Dec 2020
Frame-level SpecAugment for Deep Convolutional Neural Networks in Hybrid
  ASR Systems
Frame-level SpecAugment for Deep Convolutional Neural Networks in Hybrid ASR Systems
Xinwei Li
Yuanyuan Zhang
Xiaodan Zhuang
Daben Liu
14
6
0
07 Dec 2020
Improving RNN Transducer With Target Speaker Extraction and Neural
  Uncertainty Estimation
Improving RNN Transducer With Target Speaker Extraction and Neural Uncertainty Estimation
Jiatong Shi
Chunlei Zhang
Chao Weng
Shinji Watanabe
Meng Yu
Dong Yu
25
12
0
26 Nov 2020
Streaming end-to-end multi-talker speech recognition
Streaming end-to-end multi-talker speech recognition
Liang Lu
Naoyuki Kanda
Jinyu Li
Jiawei Liu
15
41
0
26 Nov 2020
A Better and Faster End-to-End Model for Streaming ASR
A Better and Faster End-to-End Model for Streaming ASR
Yue Liu
Anmol Gulati
Jiahui Yu
Tara N. Sainath
Chung-Cheng Chiu
...
Wei Han
Qiao Liang
Yu Zhang
Trevor Strohman
Yonghui Wu
AuLLM
25
123
0
21 Nov 2020
Redesigning the classification layer by randomizing the class
  representation vectors
Redesigning the classification layer by randomizing the class representation vectors
Gabi Shalev
Gal-Lev Shalev
Joseph Keshet
VLM
24
4
0
16 Nov 2020
Deep Shallow Fusion for RNN-T Personalization
Deep Shallow Fusion for RNN-T Personalization
Duc Le
Gil Keren
Julian Chan
Jay Mahadeokar
Christian Fuegen
M. Seltzer
26
77
0
16 Nov 2020
Exploring End-to-End Multi-channel ASR with Bias Information for Meeting
  Transcription
Exploring End-to-End Multi-channel ASR with Bias Information for Meeting Transcription
Xiaofei Wang
Naoyuki Kanda
Yashesh Gaur
Zhuo Chen
Zhong Meng
Takuya Yoshioka
17
13
0
05 Nov 2020
Paralinguistic Privacy Protection at the Edge
Paralinguistic Privacy Protection at the Edge
Ranya Aloufi
Hamed Haddadi
David E. Boyle
25
14
0
04 Nov 2020
Sequence-to-Sequence Learning via Attention Transfer for Incremental
  Speech Recognition
Sequence-to-Sequence Learning via Attention Transfer for Incremental Speech Recognition
Sashi Novitasari
Andros Tjandra
S. Sakti
Satoshi Nakamura
CLL
14
12
0
04 Nov 2020
Incremental Machine Speech Chain Towards Enabling Listening while
  Speaking in Real-time
Incremental Machine Speech Chain Towards Enabling Listening while Speaking in Real-time
Sashi Novitasari
Andros Tjandra
Tomoya Yanagita
S. Sakti
Satoshi Nakamura
CLL
6
1
0
04 Nov 2020
Internal Language Model Estimation for Domain-Adaptive End-to-End Speech
  Recognition
Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition
Zhong Meng
S. Parthasarathy
Eric Sun
Yashesh Gaur
Naoyuki Kanda
Liang Lu
Xie Chen
Rui Zhao
Jinyu Li
Jiawei Liu
AuLLM
19
107
0
03 Nov 2020
Dynamic latency speech recognition with asynchronous revision
Dynamic latency speech recognition with asynchronous revision
Mingkun Huang
Meng Cai
Jun Zhang
Yang Zhang
Yongbin You
Yi He
Zejun Ma
BDL
24
2
0
03 Nov 2020
Streaming Attention-Based Models with Augmented Memory for End-to-End
  Speech Recognition
Streaming Attention-Based Models with Augmented Memory for End-to-End Speech Recognition
Ching-Feng Yeh
Yongqiang Wang
Yangyang Shi
Chunyang Wu
Frank Zhang
Julian Chan
M. Seltzer
AI4TS
RALM
39
8
0
03 Nov 2020
Multitask Training with Text Data for End-to-End Speech Recognition
Multitask Training with Text Data for End-to-End Speech Recognition
Peidong Wang
Tara N. Sainath
Ron J. Weiss
21
27
0
27 Oct 2020
Improved Neural Language Model Fusion for Streaming Recurrent Neural
  Network Transducer
Improved Neural Language Model Fusion for Streaming Recurrent Neural Network Transducer
Suyoun Kim
Shangguan Yuan
Jay Mahadeokar
A. Bruguier
Christian Fuegen
M. Seltzer
Duc Le
23
28
0
26 Oct 2020
Real-Time Edge Classification: Optimal Offloading under Token Bucket
  Constraints
Real-Time Edge Classification: Optimal Offloading under Token Bucket Constraints
Ayan Chakrabarti
Roch Guérin
Chenyang Lu
Jiangnan Liu
15
15
0
26 Oct 2020
Improved Mask-CTC for Non-Autoregressive End-to-End ASR
Improved Mask-CTC for Non-Autoregressive End-to-End ASR
Yosuke Higuchi
Hirofumi Inaguma
Shinji Watanabe
Tetsuji Ogawa
Tetsunori Kobayashi
23
61
0
26 Oct 2020
Confidence Estimation for Attention-based Sequence-to-sequence Models
  for Speech Recognition
Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition
Qiujia Li
David Qiu
Yu Zhang
Yue Liu
Yanzhang He
P. Woodland
Liangliang Cao
Trevor Strohman
12
46
0
22 Oct 2020
Developing Real-time Streaming Transformer Transducer for Speech
  Recognition on Large-scale Dataset
Developing Real-time Streaming Transformer Transducer for Speech Recognition on Large-scale Dataset
Xie Chen
Yu-Huan Wu
Zhenghao Wang
Shujie Liu
Jinyu Li
22
169
0
22 Oct 2020
Reduce and Reconstruct: ASR for Low-Resource Phonetic Languages
Reduce and Reconstruct: ASR for Low-Resource Phonetic Languages
Anuj Diwan
Preethi Jyothi
16
5
0
19 Oct 2020
Super-Human Performance in Online Low-latency Recognition of
  Conversational Speech
Super-Human Performance in Online Low-latency Recognition of Conversational Speech
T. Nguyen
S. Stueker
A. Waibel
BDL
17
36
0
07 Oct 2020
Differentiable Weighted Finite-State Transducers
Differentiable Weighted Finite-State Transducers
Awni Y. Hannun
Vineel Pratap
Jacob Kahn
Wei-Ning Hsu
36
29
0
02 Oct 2020
Sparse Communication for Training Deep Networks
Sparse Communication for Training Deep Networks
Negar Foroutan
Martin Jaggi
FedML
30
16
0
19 Sep 2020
KoSpeech: Open-Source Toolkit for End-to-End Korean Speech Recognition
KoSpeech: Open-Source Toolkit for End-to-End Korean Speech Recognition
Soohwan Kim
Seyoung Bae
Cheolhwang Won
VLM
22
5
0
07 Sep 2020
Silent Speech Interfaces for Speech Restoration: A Review
Silent Speech Interfaces for Speech Restoration: A Review
Jose Andres Gonzalez Lopez
Alejandro Gomez-Alanis
Juan M. Martín-Donas
J. L. Pérez-Córdoba
A. Gómez
37
85
0
04 Sep 2020
Parallel Rescoring with Transformer for Streaming On-Device Speech
  Recognition
Parallel Rescoring with Transformer for Streaming On-Device Speech Recognition
Wei Li
James Qin
Chung-Cheng Chiu
Ruoming Pang
Yanzhang He
20
14
0
30 Aug 2020
Adaptation Algorithms for Neural Network-Based Speech Recognition: An
  Overview
Adaptation Algorithms for Neural Network-Based Speech Recognition: An Overview
P. Bell
Joachim Fainberg
Ondˇrej Klejch
Jinyu Li
Steve Renals
P. Swietojanski
48
74
0
14 Aug 2020
Adaptable Multi-Domain Language Model for Transformer ASR
Adaptable Multi-Domain Language Model for Transformer ASR
Taewoo Lee
Min-Joong Lee
Tae Gyoon Kang
Seokyeong Jung
Minseok Kwon
...
Ho-Gyeong Kim
Jiseung Jeong
Jihyun Lee
Hosik Lee
Y. S. Choi
24
17
0
14 Aug 2020
Speech To Semantics: Improve ASR and NLU Jointly via All-Neural
  Interfaces
Speech To Semantics: Improve ASR and NLU Jointly via All-Neural Interfaces
Milind Rao
A. Raju
Pranav Dheram
Bach Bui
Ariya Rastrow
21
43
0
14 Aug 2020
Online Automatic Speech Recognition with Listen, Attend and Spell Model
Online Automatic Speech Recognition with Listen, Attend and Spell Model
Roger Hsiao
Dogan Can
Tim Ng
R. Travadi
Arnab Ghoshal
RALM
4
17
0
12 Aug 2020
Previous
123456...91011
Next