Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2101.06699
Cited By
Efficiently Fusing Pretrained Acoustic and Linguistic Encoders for Low-resource Speech Recognition
17 January 2021
Cheng Yi
Shiyu Zhou
Bo Xu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Efficiently Fusing Pretrained Acoustic and Linguistic Encoders for Low-resource Speech Recognition"
25 / 25 papers shown
Title
Applying Wav2vec2.0 to Speech Recognition in Various Low-resource Languages
Cheng Yi
Jianzhong Wang
Ning Cheng
Shiyu Zhou
Bo Xu
SSL
VLM
64
82
0
22 Dec 2020
Multilingual Speech Translation with Efficient Finetuning of Pretrained Models
Xian Li
Changhan Wang
Yun Tang
C. Tran
Yuqing Tang
J. Pino
Alexei Baevski
Alexis Conneau
Michael Auli
44
6
0
24 Oct 2020
Unsupervised Cross-lingual Representation Learning for Speech Recognition
Alexis Conneau
Alexei Baevski
R. Collobert
Abdel-rahman Mohamed
Michael Auli
SSL
142
780
0
24 Jun 2020
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Alexei Baevski
Henry Zhou
Abdel-rahman Mohamed
Michael Auli
SSL
275
5,783
0
20 Jun 2020
A Comparison of Label-Synchronous and Frame-Synchronous End-to-End Models for Speech Recognition
Linhao Dong
Cheng Yi
Jianzong Wang
Shiyu Zhou
Shuang Xu
X. Jia
Bo Xu
44
17
0
20 May 2020
Mask CTC: Non-Autoregressive End-to-End ASR with CTC and Mask Predict
Yosuke Higuchi
Shinji Watanabe
Nanxin Chen
Tetsuji Ogawa
Tetsunori Kobayashi
44
138
0
18 May 2020
Semi-supervised ASR by End-to-end Self-training
Yang Chen
Weiran Wang
Chao Wang
57
53
0
24 Jan 2020
Multilingual Denoising Pre-training for Neural Machine Translation
Yinhan Liu
Jiatao Gu
Naman Goyal
Xian Li
Sergey Edunov
Marjan Ghazvininejad
M. Lewis
Luke Zettlemoyer
AI4CE
AIMat
111
1,808
0
22 Jan 2020
Improving Transformer-based Speech Recognition Using Unsupervised Pre-training
Dongwei Jiang
Xiaoning Lei
Wubo Li
Ne Luo
Yuxuan Hu
Wei Zou
Xiangang Li
48
99
0
22 Oct 2019
vq-wav2vec: Self-Supervised Learning of Discrete Speech Representations
Alexei Baevski
Steffen Schneider
Michael Auli
SSL
150
666
0
12 Oct 2019
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
351
6,448
0
26 Sep 2019
XLNet: Generalized Autoregressive Pretraining for Language Understanding
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
227
8,424
0
19 Jun 2019
CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition
Linhao Dong
Bo Xu
69
128
0
27 May 2019
Semi-supervised Sequence-to-sequence ASR using Unpaired Speech and Text
M. Baskar
Shinji Watanabe
Ramón Fernández Astudillo
Takaaki Hori
L. Burget
J. Černocký
61
40
0
30 Apr 2019
fairseq: A Fast, Extensible Toolkit for Sequence Modeling
Myle Ott
Sergey Edunov
Alexei Baevski
Angela Fan
Sam Gross
Nathan Ng
David Grangier
Michael Auli
VLM
FaML
101
3,150
0
01 Apr 2019
Adversarial Training of End-to-end Speech Recognition Using a Criticizing Language Model
Alexander H. Liu
Hung-yi Lee
Lin-Shan Lee
AuLLM
52
47
0
02 Nov 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
1.7K
94,729
0
11 Oct 2018
A Comparison of Techniques for Language Model Integration in Encoder-Decoder Speech Recognition
Shubham Toshniwal
Anjuli Kannan
Chung-Cheng Chiu
Yonghui Wu
Tara N. Sainath
Karen Livescu
67
156
0
27 Jul 2018
Multilingual End-to-End Speech Recognition with A Single Transformer on Low-Resource Languages
Shiyu Zhou
Shuang Xu
Bo Xu
56
73
0
12 Jun 2018
A Comparison of Modeling Units in Sequence-to-Sequence Speech Recognition with the Transformer on Mandarin Chinese
Shiyu Zhou
Linhao Dong
Shuang Xu
Bo Xu
55
63
0
16 May 2018
Supervised and Unsupervised Transfer Learning for Question Answering
Yu-An Chung
Hung-yi Lee
James R. Glass
64
83
0
14 Nov 2017
Non-Autoregressive Neural Machine Translation
Jiatao Gu
James Bradbury
Caiming Xiong
Victor O.K. Li
R. Socher
97
795
0
07 Nov 2017
Cold Fusion: Training Seq2Seq Models Together with Language Models
Anuroop Sriram
Heewoo Jun
S. Satheesh
Adam Coates
VLM
75
281
0
21 Aug 2017
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
682
131,414
0
12 Jun 2017
Joint CTC-Attention based End-to-End Speech Recognition using Multi-task Learning
Suyoun Kim
Takaaki Hori
Shinji Watanabe
76
928
0
21 Sep 2016
1