ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2101.06699
  4. Cited By
Efficiently Fusing Pretrained Acoustic and Linguistic Encoders for
  Low-resource Speech Recognition

Efficiently Fusing Pretrained Acoustic and Linguistic Encoders for Low-resource Speech Recognition

17 January 2021
Cheng Yi
Shiyu Zhou
Bo Xu
ArXivPDFHTML

Papers citing "Efficiently Fusing Pretrained Acoustic and Linguistic Encoders for Low-resource Speech Recognition"

25 / 25 papers shown
Title
Applying Wav2vec2.0 to Speech Recognition in Various Low-resource
  Languages
Applying Wav2vec2.0 to Speech Recognition in Various Low-resource Languages
Cheng Yi
Jianzhong Wang
Ning Cheng
Shiyu Zhou
Bo Xu
SSL
VLM
64
82
0
22 Dec 2020
Multilingual Speech Translation with Efficient Finetuning of Pretrained
  Models
Multilingual Speech Translation with Efficient Finetuning of Pretrained Models
Xian Li
Changhan Wang
Yun Tang
C. Tran
Yuqing Tang
J. Pino
Alexei Baevski
Alexis Conneau
Michael Auli
44
6
0
24 Oct 2020
Unsupervised Cross-lingual Representation Learning for Speech
  Recognition
Unsupervised Cross-lingual Representation Learning for Speech Recognition
Alexis Conneau
Alexei Baevski
R. Collobert
Abdel-rahman Mohamed
Michael Auli
SSL
142
780
0
24 Jun 2020
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech
  Representations
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Alexei Baevski
Henry Zhou
Abdel-rahman Mohamed
Michael Auli
SSL
275
5,783
0
20 Jun 2020
A Comparison of Label-Synchronous and Frame-Synchronous End-to-End
  Models for Speech Recognition
A Comparison of Label-Synchronous and Frame-Synchronous End-to-End Models for Speech Recognition
Linhao Dong
Cheng Yi
Jianzong Wang
Shiyu Zhou
Shuang Xu
X. Jia
Bo Xu
44
17
0
20 May 2020
Mask CTC: Non-Autoregressive End-to-End ASR with CTC and Mask Predict
Mask CTC: Non-Autoregressive End-to-End ASR with CTC and Mask Predict
Yosuke Higuchi
Shinji Watanabe
Nanxin Chen
Tetsuji Ogawa
Tetsunori Kobayashi
44
138
0
18 May 2020
Semi-supervised ASR by End-to-end Self-training
Semi-supervised ASR by End-to-end Self-training
Yang Chen
Weiran Wang
Chao Wang
57
53
0
24 Jan 2020
Multilingual Denoising Pre-training for Neural Machine Translation
Multilingual Denoising Pre-training for Neural Machine Translation
Yinhan Liu
Jiatao Gu
Naman Goyal
Xian Li
Sergey Edunov
Marjan Ghazvininejad
M. Lewis
Luke Zettlemoyer
AI4CE
AIMat
111
1,808
0
22 Jan 2020
Improving Transformer-based Speech Recognition Using Unsupervised
  Pre-training
Improving Transformer-based Speech Recognition Using Unsupervised Pre-training
Dongwei Jiang
Xiaoning Lei
Wubo Li
Ne Luo
Yuxuan Hu
Wei Zou
Xiangang Li
48
99
0
22 Oct 2019
vq-wav2vec: Self-Supervised Learning of Discrete Speech Representations
vq-wav2vec: Self-Supervised Learning of Discrete Speech Representations
Alexei Baevski
Steffen Schneider
Michael Auli
SSL
150
666
0
12 Oct 2019
ALBERT: A Lite BERT for Self-supervised Learning of Language
  Representations
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
351
6,448
0
26 Sep 2019
XLNet: Generalized Autoregressive Pretraining for Language Understanding
XLNet: Generalized Autoregressive Pretraining for Language Understanding
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
227
8,424
0
19 Jun 2019
CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition
CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition
Linhao Dong
Bo Xu
69
128
0
27 May 2019
Semi-supervised Sequence-to-sequence ASR using Unpaired Speech and Text
Semi-supervised Sequence-to-sequence ASR using Unpaired Speech and Text
M. Baskar
Shinji Watanabe
Ramón Fernández Astudillo
Takaaki Hori
L. Burget
J. Černocký
61
40
0
30 Apr 2019
fairseq: A Fast, Extensible Toolkit for Sequence Modeling
fairseq: A Fast, Extensible Toolkit for Sequence Modeling
Myle Ott
Sergey Edunov
Alexei Baevski
Angela Fan
Sam Gross
Nathan Ng
David Grangier
Michael Auli
VLM
FaML
101
3,150
0
01 Apr 2019
Adversarial Training of End-to-end Speech Recognition Using a
  Criticizing Language Model
Adversarial Training of End-to-end Speech Recognition Using a Criticizing Language Model
Alexander H. Liu
Hung-yi Lee
Lin-Shan Lee
AuLLM
52
47
0
02 Nov 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
1.7K
94,729
0
11 Oct 2018
A Comparison of Techniques for Language Model Integration in
  Encoder-Decoder Speech Recognition
A Comparison of Techniques for Language Model Integration in Encoder-Decoder Speech Recognition
Shubham Toshniwal
Anjuli Kannan
Chung-Cheng Chiu
Yonghui Wu
Tara N. Sainath
Karen Livescu
67
156
0
27 Jul 2018
Multilingual End-to-End Speech Recognition with A Single Transformer on
  Low-Resource Languages
Multilingual End-to-End Speech Recognition with A Single Transformer on Low-Resource Languages
Shiyu Zhou
Shuang Xu
Bo Xu
56
73
0
12 Jun 2018
A Comparison of Modeling Units in Sequence-to-Sequence Speech
  Recognition with the Transformer on Mandarin Chinese
A Comparison of Modeling Units in Sequence-to-Sequence Speech Recognition with the Transformer on Mandarin Chinese
Shiyu Zhou
Linhao Dong
Shuang Xu
Bo Xu
55
63
0
16 May 2018
Supervised and Unsupervised Transfer Learning for Question Answering
Supervised and Unsupervised Transfer Learning for Question Answering
Yu-An Chung
Hung-yi Lee
James R. Glass
64
83
0
14 Nov 2017
Non-Autoregressive Neural Machine Translation
Non-Autoregressive Neural Machine Translation
Jiatao Gu
James Bradbury
Caiming Xiong
Victor O.K. Li
R. Socher
97
795
0
07 Nov 2017
Cold Fusion: Training Seq2Seq Models Together with Language Models
Cold Fusion: Training Seq2Seq Models Together with Language Models
Anuroop Sriram
Heewoo Jun
S. Satheesh
Adam Coates
VLM
75
281
0
21 Aug 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
682
131,414
0
12 Jun 2017
Joint CTC-Attention based End-to-End Speech Recognition using Multi-task
  Learning
Joint CTC-Attention based End-to-End Speech Recognition using Multi-task Learning
Suyoun Kim
Takaaki Hori
Shinji Watanabe
76
928
0
21 Sep 2016
1