ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.11871
  4. Cited By
Towards Online End-to-end Transformer Automatic Speech Recognition

Towards Online End-to-end Transformer Automatic Speech Recognition

25 October 2019
E. Tsunoo
Yosuke Kashiwagi
Toshiyuki Kumakura
Shinji Watanabe
ArXiv (abs)PDFHTML

Papers citing "Towards Online End-to-end Transformer Automatic Speech Recognition"

38 / 38 papers shown
Title
Streaming Speech-to-Confusion Network Speech Recognition
Streaming Speech-to-Confusion Network Speech Recognition
Denis Filimonov
Prabhat Pandey
Ariya Rastrow
Ankur Gandhe
A. Stolcke
HAI
57
0
0
02 Jun 2023
Enhancing the Unified Streaming and Non-streaming Model with Contrastive
  Learning
Enhancing the Unified Streaming and Non-streaming Model with Contrastive Learning
Yuting Yang
Yuke Li
Binbin Du
AI4TS
59
0
0
01 Jun 2023
Self-regularised Minimum Latency Training for Streaming
  Transformer-based Speech Recognition
Self-regularised Minimum Latency Training for Streaming Transformer-based Speech Recognition
Mohan Li
R. Doddipatla
Catalin Zorila
133
0
0
24 Apr 2023
Transformer-based Streaming ASR with Cumulative Attention
Transformer-based Streaming ASR with Cumulative Attention
Mohan Li
Shucong Zhang
Catalin Zorila
R. Doddipatla
106
9
0
11 Mar 2022
CarneliNet: Neural Mixture Model for Automatic Speech Recognition
CarneliNet: Neural Mixture Model for Automatic Speech Recognition
A. Kalinov
Somshubra Majumdar
Jagadeesh Balam
Boris Ginsburg
MoE
40
3
0
22 Jul 2021
Alignment Knowledge Distillation for Online Streaming Attention-based
  Speech Recognition
Alignment Knowledge Distillation for Online Streaming Attention-based Speech Recognition
Hirofumi Inaguma
Tatsuya Kawahara
87
14
0
28 Feb 2021
Unidirectional Memory-Self-Attention Transducer for Online Speech
  Recognition
Unidirectional Memory-Self-Attention Transducer for Online Speech Recognition
Jian Luo
Jianzong Wang
Ning Cheng
Jing Xiao
RALM
46
6
0
23 Feb 2021
Improving Streaming Automatic Speech Recognition With Non-Streaming
  Model Distillation On Unsupervised Data
Improving Streaming Automatic Speech Recognition With Non-Streaming Model Distillation On Unsupervised Data
Thibault Doutre
Wei Han
Min Ma
Zhiyun Lu
Chung-Cheng Chiu
Ruoming Pang
A. Narayanan
Ananya Misra
Yu Zhang
Liangliang Cao
92
23
0
22 Oct 2020
Dual-mode ASR: Unify and Improve Streaming ASR with Full-context
  Modeling
Dual-mode ASR: Unify and Improve Streaming ASR with Full-context Modeling
Jiahui Yu
Wei Han
Anmol Gulati
Chung-Cheng Chiu
Yue Liu
Tara N. Sainath
Yonghui Wu
Ruoming Pang
94
19
0
12 Oct 2020
Super-Human Performance in Online Low-latency Recognition of
  Conversational Speech
Super-Human Performance in Online Low-latency Recognition of Conversational Speech
T. Nguyen
S. Stueker
A. Waibel
BDL
37
37
0
07 Oct 2020
Gated Recurrent Context: Softmax-free Attention for Online
  Encoder-Decoder Speech Recognition
Gated Recurrent Context: Softmax-free Attention for Online Encoder-Decoder Speech Recognition
Hyeonseung Lee
Woohyun Kang
Sung Jun Cheon
Hyeongju Kim
N. Kim
48
3
0
10 Jul 2020
Streaming Transformer ASR with Blockwise Synchronous Beam Search
Streaming Transformer ASR with Blockwise Synchronous Beam Search
E. Tsunoo
Yosuke Kashiwagi
Shinji Watanabe
98
11
0
25 Jun 2020
Streaming Chunk-Aware Multihead Attention for Online End-to-End Speech
  Recognition
Streaming Chunk-Aware Multihead Attention for Online End-to-End Speech Recognition
Shiliang Zhang
Zhifu Gao
Haoneng Luo
Ming Lei
Jie Ying Gao
Zhijie Yan
Lei Xie
47
29
0
21 May 2020
Enhancing Monotonic Multihead Attention for Streaming ASR
Enhancing Monotonic Multihead Attention for Streaming ASR
Hirofumi Inaguma
Masato Mimura
Tatsuya Kawahara
65
34
0
19 May 2020
High Performance Sequence-to-Sequence Model for Streaming Speech
  Recognition
High Performance Sequence-to-Sequence Model for Streaming Speech Recognition
T. Nguyen
Ngoc-Quan Pham
S. Stueker
A. Waibel
27
7
0
22 Mar 2020
Transformer Transducer: A Streamable Speech Recognition Model with
  Transformer Encoders and RNN-T Loss
Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss
Qian Zhang
Han Lu
Hasim Sak
Anshuman Tripathi
Erik McDermott
Stephen Koo
Shankar Kumar
88
481
0
07 Feb 2020
Transformer ASR with Contextual Block Processing
Transformer ASR with Contextual Block Processing
E. Tsunoo
Yosuke Kashiwagi
Toshiyuki Kumakura
Shinji Watanabe
94
64
0
16 Oct 2019
A Comparative Study on Transformer vs RNN in Speech Applications
A Comparative Study on Transformer vs RNN in Speech Applications
Shigeki Karita
Nanxin Chen
Tomoki Hayashi
Takaaki Hori
Hirofumi Inaguma
...
Ryuichi Yamamoto
Xiao-fei Wang
Shinji Watanabe
Takenori Yoshimura
Wangyou Zhang
74
721
0
13 Sep 2019
Self-Attention Aligner: A Latency-Control End-to-End Model for ASR Using
  Self-Attention Network and Chunk-Hopping
Self-Attention Aligner: A Latency-Control End-to-End Model for ASR Using Self-Attention Network and Chunk-Hopping
Linhao Dong
Feng Wang
Bo Xu
51
91
0
18 Feb 2019
Self-Attention Networks for Connectionist Temporal Classification in
  Speech Recognition
Self-Attention Networks for Connectionist Temporal Classification in Speech Recognition
Julian Salazar
Katrin Kirchhoff
Zhiheng Huang
AI4TS
53
118
0
22 Jan 2019
An Online Attention-based Model for Speech Recognition
An Online Attention-based Model for Speech Recognition
Ruchao Fan
Pan Zhou
Wei Chen
Jia Jia
Gang Liu
45
48
0
13 Nov 2018
Improved training of end-to-end attention models for speech recognition
Improved training of end-to-end attention models for speech recognition
Albert Zeyer
Kazuki Irie
Ralf Schluter
Hermann Ney
VLM
73
270
0
08 May 2018
ESPnet: End-to-End Speech Processing Toolkit
ESPnet: End-to-End Speech Processing Toolkit
Shinji Watanabe
Takaaki Hori
Shigeki Karita
Tomoki Hayashi
Jiro Nishitoba
...
Jahn Heymann
Sanjeev Khudanpur
Nanxin Chen
Adithya Renduchintala
Tsubasa Ochiai
VLM
109
1,509
0
30 Mar 2018
Self-Attentional Acoustic Models
Self-Attentional Acoustic Models
Matthias Sperber
Jan Niehues
Graham Neubig
Sebastian Stüker
A. Waibel
54
153
0
26 Mar 2018
Exploring Architectures, Data and Units For Streaming End-to-End Speech
  Recognition with RNN-Transducer
Exploring Architectures, Data and Units For Streaming End-to-End Speech Recognition with RNN-Transducer
Kanishka Rao
Hasim Sak
Rohit Prabhavalkar
AI4TS
81
348
0
02 Jan 2018
Monotonic Chunkwise Attention
Monotonic Chunkwise Attention
Chung-Cheng Chiu
Colin Raffel
67
256
0
14 Dec 2017
An analysis of incorporating an external language model into a
  sequence-to-sequence model
An analysis of incorporating an external language model into a sequence-to-sequence model
Anjuli Kannan
Yonghui Wu
Patrick Nguyen
Tara N. Sainath
Zhiwen Chen
Rohit Prabhavalkar
72
247
0
06 Dec 2017
State-of-the-art Speech Recognition With Sequence-to-Sequence Models
State-of-the-art Speech Recognition With Sequence-to-Sequence Models
Chung-Cheng Chiu
Tara N. Sainath
Yonghui Wu
Rohit Prabhavalkar
Patrick Nguyen
...
Katya Gonina
Navdeep Jaitly
Yue Liu
J. Chorowski
M. Bacchiani
AI4TS
93
1,154
0
05 Dec 2017
AISHELL-1: An Open-Source Mandarin Speech Corpus and A Speech
  Recognition Baseline
AISHELL-1: An Open-Source Mandarin Speech Corpus and A Speech Recognition Baseline
Hui Bu
Jiayu Du
Xingyu Na
Bengu Wu
Hao Zheng
CVBM
66
844
0
16 Sep 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
728
132,199
0
12 Jun 2017
Joint CTC-Attention based End-to-End Speech Recognition using Multi-task
  Learning
Joint CTC-Attention based End-to-End Speech Recognition using Multi-task Learning
Suyoun Kim
Takaaki Hori
Shinji Watanabe
82
931
0
21 Sep 2016
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
Dario Amodei
Rishita Anubhai
Eric Battenberg
Carl Case
Jared Casper
...
Chong-Jun Wang
Bo Xiao
Dani Yogatama
J. Zhan
Zhenyao Zhu
137
2,974
0
08 Dec 2015
A Neural Transducer
A Neural Transducer
Navdeep Jaitly
David Sussillo
Quoc V. Le
Oriol Vinyals
Ilya Sutskever
Samy Bengio
AI4TS
62
47
0
16 Nov 2015
Listen, Attend and Spell
Listen, Attend and Spell
William Chan
Navdeep Jaitly
Quoc V. Le
Oriol Vinyals
RALM
156
2,269
0
05 Aug 2015
EESEN: End-to-End Speech Recognition using Deep RNN Models and
  WFST-based Decoding
EESEN: End-to-End Speech Recognition using Deep RNN Models and WFST-based Decoding
Yajie Miao
M. Gowayyed
Florian Metze
100
755
0
29 Jul 2015
Attention-Based Models for Speech Recognition
Attention-Based Models for Speech Recognition
J. Chorowski
Dzmitry Bahdanau
Dmitriy Serdyuk
Kyunghyun Cho
Yoshua Bengio
129
2,609
0
24 Jun 2015
Speech Recognition with Deep Recurrent Neural Networks
Speech Recognition with Deep Recurrent Neural Networks
Alex Graves
Abdel-rahman Mohamed
Geoffrey E. Hinton
228
8,523
0
22 Mar 2013
Sequence Transduction with Recurrent Neural Networks
Sequence Transduction with Recurrent Neural Networks
Alex Graves
191
1,871
0
14 Nov 2012
1