ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.14318
  4. Cited By
Multitask Training with Text Data for End-to-End Speech Recognition

Multitask Training with Text Data for End-to-End Speech Recognition

27 October 2020
Peidong Wang
Tara N. Sainath
Ron J. Weiss
ArXivPDFHTML

Papers citing "Multitask Training with Text Data for End-to-End Speech Recognition"

28 / 28 papers shown
Title
Less Is More: Improved RNN-T Decoding Using Limited Label Context and
  Path Merging
Less Is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging
Rohit Prabhavalkar
Yanzhang He
David Rybach
S. Campbell
A. Narayanan
Trevor Strohman
Tara N. Sainath
89
35
0
12 Dec 2020
A Better and Faster End-to-End Model for Streaming ASR
A Better and Faster End-to-End Model for Streaming ASR
Yue Liu
Anmol Gulati
Jiahui Yu
Tara N. Sainath
Chung-Cheng Chiu
...
Wei Han
Qiao Liang
Yu Zhang
Trevor Strohman
Yonghui Wu
AuLLM
112
123
0
21 Nov 2020
Distilling the Knowledge of BERT for Sequence-to-Sequence ASR
Distilling the Knowledge of BERT for Sequence-to-Sequence ASR
Hayato Futami
Hirofumi Inaguma
Sei Ueno
Masato Mimura
S. Sakai
Tatsuya Kawahara
54
52
0
09 Aug 2020
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech
  Representations
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Alexei Baevski
Henry Zhou
Abdel-rahman Mohamed
Michael Auli
SSL
241
5,774
0
20 Jun 2020
Iterative Pseudo-Labeling for Speech Recognition
Iterative Pseudo-Labeling for Speech Recognition
Qiantong Xu
Tatiana Likhomanenko
Jacob Kahn
Awni Y. Hannun
Gabriel Synnaeve
R. Collobert
VLM
56
132
0
19 May 2020
Conformer: Convolution-augmented Transformer for Speech Recognition
Conformer: Convolution-augmented Transformer for Speech Recognition
Anmol Gulati
James Qin
Chung-Cheng Chiu
Niki Parmar
Yu Zhang
...
Wei Han
Shibo Wang
Zhengdong Zhang
Yonghui Wu
Ruoming Pang
212
3,119
0
16 May 2020
ContextNet: Improving Convolutional Neural Networks for Automatic Speech
  Recognition with Global Context
ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context
Wei Han
Zhengdong Zhang
Yu Zhang
Jiahui Yu
Chung-Cheng Chiu
James Qin
Anmol Gulati
Ruoming Pang
Yonghui Wu
61
263
0
07 May 2020
Hybrid Autoregressive Transducer (hat)
Hybrid Autoregressive Transducer (hat)
Ehsan Variani
David Rybach
Cyril Allauzen
Michael Riley
53
160
0
12 Mar 2020
A Density Ratio Approach to Language Model Fusion in End-To-End
  Automatic Speech Recognition
A Density Ratio Approach to Language Model Fusion in End-To-End Automatic Speech Recognition
Erik McDermott
Hasim Sak
Ehsan Variani
52
113
0
26 Feb 2020
vq-wav2vec: Self-Supervised Learning of Discrete Speech Representations
vq-wav2vec: Self-Supervised Learning of Discrete Speech Representations
Alexei Baevski
Steffen Schneider
Michael Auli
SSL
144
666
0
12 Oct 2019
Two-Pass End-to-End Speech Recognition
Two-Pass End-to-End Speech Recognition
Tara N. Sainath
Ruoming Pang
David Rybach
Yanzhang He
Rohit Prabhavalkar
...
Qiao Liang
Trevor Strohman
Yonghui Wu
Ian McGraw
Chung-Cheng Chiu
77
147
0
29 Aug 2019
Learn Spelling from Teachers: Transferring Knowledge from Language
  Models to Sequence-to-Sequence Speech Recognition
Learn Spelling from Teachers: Transferring Knowledge from Language Models to Sequence-to-Sequence Speech Recognition
Ye Bai
Jiangyan Yi
J. Tao
Zhengkun Tian
Zhengqi Wen
KELM
60
38
0
13 Jul 2019
Almost Unsupervised Text to Speech and Automatic Speech Recognition
Almost Unsupervised Text to Speech and Automatic Speech Recognition
Yi Ren
Xu Tan
Tao Qin
Sheng Zhao
Zhou Zhao
Tie-Yan Liu
69
102
0
13 May 2019
Semi-supervised Sequence-to-sequence ASR using Unpaired Speech and Text
Semi-supervised Sequence-to-sequence ASR using Unpaired Speech and Text
M. Baskar
Shinji Watanabe
Ramón Fernández Astudillo
Takaaki Hori
L. Burget
J. Černocký
61
40
0
30 Apr 2019
On the Choice of Modeling Unit for Sequence-to-Sequence Speech
  Recognition
On the Choice of Modeling Unit for Sequence-to-Sequence Speech Recognition
Kazuki Irie
Rohit Prabhavalkar
Anjuli Kannan
A. Bruguier
David Rybach
Patrick Nguyen
57
37
0
05 Feb 2019
Streaming End-to-end Speech Recognition For Mobile Devices
Streaming End-to-end Speech Recognition For Mobile Devices
Yanzhang He
Tara N. Sainath
Rohit Prabhavalkar
Ian McGraw
R. Álvarez
...
K. Sim
Tom Bagby
Shuo-yiin Chang
Kanishka Rao
A. Gruenstein
98
626
0
15 Nov 2018
Cycle-consistency training for end-to-end speech recognition
Cycle-consistency training for end-to-end speech recognition
Takaaki Hori
Ramón Fernández Astudillo
Tomoki Hayashi
Yu Zhang
Shinji Watanabe
Jonathan Le Roux
65
87
0
02 Nov 2018
Exploring Architectures, Data and Units For Streaming End-to-End Speech
  Recognition with RNN-Transducer
Exploring Architectures, Data and Units For Streaming End-to-End Speech Recognition with RNN-Transducer
Kanishka Rao
Hasim Sak
Rohit Prabhavalkar
AI4TS
72
347
0
02 Jan 2018
An analysis of incorporating an external language model into a
  sequence-to-sequence model
An analysis of incorporating an external language model into a sequence-to-sequence model
Anjuli Kannan
Yonghui Wu
Patrick Nguyen
Tara N. Sainath
Zhiwen Chen
Rohit Prabhavalkar
58
246
0
06 Dec 2017
State-of-the-art Speech Recognition With Sequence-to-Sequence Models
State-of-the-art Speech Recognition With Sequence-to-Sequence Models
Chung-Cheng Chiu
Tara N. Sainath
Yonghui Wu
Rohit Prabhavalkar
Patrick Nguyen
...
Katya Gonina
Navdeep Jaitly
Yue Liu
J. Chorowski
M. Bacchiani
AI4TS
86
1,151
0
05 Dec 2017
Cold Fusion: Training Seq2Seq Models Together with Language Models
Cold Fusion: Training Seq2Seq Models Together with Language Models
Anuroop Sriram
Heewoo Jun
S. Satheesh
Adam Coates
VLM
75
281
0
21 Aug 2017
Towards better decoding and language model integration in sequence to
  sequence models
Towards better decoding and language model integration in sequence to sequence models
J. Chorowski
Navdeep Jaitly
71
369
0
08 Dec 2016
Joint CTC-Attention based End-to-End Speech Recognition using Multi-task
  Learning
Joint CTC-Attention based End-to-End Speech Recognition using Multi-task Learning
Suyoun Kim
Takaaki Hori
Shinji Watanabe
74
925
0
21 Sep 2016
Improving Neural Machine Translation Models with Monolingual Data
Improving Neural Machine Translation Models with Monolingual Data
Rico Sennrich
Barry Haddow
Alexandra Birch
241
2,716
0
20 Nov 2015
Listen, Attend and Spell
Listen, Attend and Spell
William Chan
Navdeep Jaitly
Quoc V. Le
Oriol Vinyals
RALM
147
2,265
0
05 Aug 2015
Attention-Based Models for Speech Recognition
Attention-Based Models for Speech Recognition
J. Chorowski
Dzmitry Bahdanau
Dmitriy Serdyuk
Kyunghyun Cho
Yoshua Bengio
117
2,606
0
24 Jun 2015
Speech Recognition with Deep Recurrent Neural Networks
Speech Recognition with Deep Recurrent Neural Networks
Alex Graves
Abdel-rahman Mohamed
Geoffrey E. Hinton
210
8,513
0
22 Mar 2013
Sequence Transduction with Recurrent Neural Networks
Sequence Transduction with Recurrent Neural Networks
Alex Graves
181
1,866
0
14 Nov 2012
1