ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2207.10600
  4. Cited By
Knowledge Transfer and Distillation from Autoregressive to
  Non-Autoregressive Speech Recognition

Knowledge Transfer and Distillation from Autoregressive to Non-Autoregressive Speech Recognition

15 July 2022
Xun Gong
Zhikai Zhou
Y. Qian
ArXivPDFHTML

Papers citing "Knowledge Transfer and Distillation from Autoregressive to Non-Autoregressive Speech Recognition"

17 / 17 papers shown
Title
BR-ASR: Efficient and Scalable Bias Retrieval Framework for Contextual Biasing ASR in Speech LLM
BR-ASR: Efficient and Scalable Bias Retrieval Framework for Contextual Biasing ASR in Speech LLM
Xun Gong
Anqi Lv
Zhiming Wang
Huijia Zhu
Y. Qian
25
0
0
25 May 2025
Improving non-autoregressive end-to-end speech recognition with
  pre-trained acoustic and language models
Improving non-autoregressive end-to-end speech recognition with pre-trained acoustic and language models
Keqi Deng
Zehui Yang
Shinji Watanabe
Yosuke Higuchi
Gaofeng Cheng
Pengyuan Zhang
38
23
0
25 Jan 2022
An Improved Single Step Non-autoregressive Transformer for Automatic
  Speech Recognition
An Improved Single Step Non-autoregressive Transformer for Automatic Speech Recognition
Ruchao Fan
Wei Chu
Peng Chang
Jing Xiao
Abeer Alwan
40
15
0
18 Jun 2021
Improved Mask-CTC for Non-Autoregressive End-to-End ASR
Improved Mask-CTC for Non-Autoregressive End-to-End ASR
Yosuke Higuchi
Hirofumi Inaguma
Shinji Watanabe
Tetsuji Ogawa
Tetsunori Kobayashi
37
61
0
26 Oct 2020
Align-Refine: Non-Autoregressive Speech Recognition via Iterative
  Realignment
Align-Refine: Non-Autoregressive Speech Recognition via Iterative Realignment
Ethan A. Chi
Julian Salazar
Katrin Kirchhoff
AI4TS
45
51
0
24 Oct 2020
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech
  Representations
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Alexei Baevski
Henry Zhou
Abdel-rahman Mohamed
Michael Auli
SSL
160
5,677
0
20 Jun 2020
Knowledge Distillation: A Survey
Knowledge Distillation: A Survey
Jianping Gou
B. Yu
Stephen J. Maybank
Dacheng Tao
VLM
46
2,907
0
09 Jun 2020
Mask CTC: Non-Autoregressive End-to-End ASR with CTC and Mask Predict
Mask CTC: Non-Autoregressive End-to-End ASR with CTC and Mask Predict
Yosuke Higuchi
Shinji Watanabe
Nanxin Chen
Tetsuji Ogawa
Tetsunori Kobayashi
34
137
0
18 May 2020
Conformer: Convolution-augmented Transformer for Speech Recognition
Conformer: Convolution-augmented Transformer for Speech Recognition
Anmol Gulati
James Qin
Chung-Cheng Chiu
Niki Parmar
Yu Zhang
...
Wei Han
Shibo Wang
Zhengdong Zhang
Yonghui Wu
Ruoming Pang
188
3,082
0
16 May 2020
SpecAugment: A Simple Data Augmentation Method for Automatic Speech
  Recognition
SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
Daniel S. Park
William Chan
Yu Zhang
Chung-Cheng Chiu
Barret Zoph
E. D. Cubuk
Quoc V. Le
VLM
145
3,435
0
18 Apr 2019
Sequence-Level Knowledge Distillation for Model Compression of
  Attention-based Sequence-to-Sequence Speech Recognition
Sequence-Level Knowledge Distillation for Model Compression of Attention-based Sequence-to-Sequence Speech Recognition
Raden Muáz Muním
Nakamasa Inoue
Koichi Shinoda
40
26
0
12 Nov 2018
SentencePiece: A simple and language independent subword tokenizer and
  detokenizer for Neural Text Processing
SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing
Taku Kudo
John Richardson
142
3,490
0
19 Aug 2018
ESPnet: End-to-End Speech Processing Toolkit
ESPnet: End-to-End Speech Processing Toolkit
Shinji Watanabe
Takaaki Hori
Shigeki Karita
Tomoki Hayashi
Jiro Nishitoba
...
Jahn Heymann
Sanjeev Khudanpur
Nanxin Chen
Adithya Renduchintala
Tsubasa Ochiai
VLM
76
1,492
0
30 Mar 2018
Minimum Word Error Rate Training for Attention-based
  Sequence-to-Sequence Models
Minimum Word Error Rate Training for Attention-based Sequence-to-Sequence Models
Rohit Prabhavalkar
Tara N. Sainath
Yonghui Wu
Patrick Nguyen
Zhiwen Chen
Chung-Cheng Chiu
Anjuli Kannan
40
161
0
05 Dec 2017
AISHELL-1: An Open-Source Mandarin Speech Corpus and A Speech
  Recognition Baseline
AISHELL-1: An Open-Source Mandarin Speech Corpus and A Speech Recognition Baseline
Hui Bu
Jiayu Du
Xingyu Na
Bengu Wu
Hao Zheng
CVBM
44
832
0
16 Sep 2017
Exploring Neural Transducers for End-to-End Speech Recognition
Exploring Neural Transducers for End-to-End Speech Recognition
Eric Battenberg
Jitong Chen
R. Child
Adam Coates
Yashesh Gaur Yi Li
...
Hairong Liu
S. Satheesh
David Seetapun
Anuroop Sriram
Zhenyao Zhu
AI4TS
59
230
0
24 Jul 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
427
129,831
0
12 Jun 2017
1