Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.03939
Cited By
GigaST: A 10,000-hour Pseudo Speech Translation Corpus
8 April 2022
Rong Ye
Chengqi Zhao
Tom Ko
Chutong Meng
Tao Wang
Mingxuan Wang
Jun Cao
Re-assign community
ArXiv
PDF
HTML
Papers citing
"GigaST: A 10,000-hour Pseudo Speech Translation Corpus"
20 / 20 papers shown
Title
HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units
Wei-Ning Hsu
Benjamin Bolte
Yao-Hung Hubert Tsai
Kushal Lakhotia
Ruslan Salakhutdinov
Abdel-rahman Mohamed
SSL
127
2,879
0
14 Jun 2021
GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio
Guoguo Chen
Shuzhou Chai
Guan-Bo Wang
Jiayu Du
Weiqiang Zhang
...
Xuchen Yao
Yongqing Wang
Yujun Wang
Zhao You
Zhiyong Yan
81
360
0
13 Jun 2021
SUPERB: Speech processing Universal PERformance Benchmark
Shu-Wen Yang
Po-Han Chi
Yung-Sung Chuang
Cheng-I Jeff Lai
Kushal Lakhotia
...
Shuyan Dong
Shang-Wen Li
Shinji Watanabe
Abdel-rahman Mohamed
Hung-yi Lee
SSL
78
910
0
03 May 2021
The Multilingual TEDx Corpus for Speech Recognition and Translation
Elizabeth Salesky
Sanjeev Khudanpur
Jacob Bremerman
R. Cattoni
Matteo Negri
Marco Turchi
Douglas W. Oard
Matt Post
35
121
0
02 Feb 2021
NeurST: Neural Speech Translation Toolkit
Chengqi Zhao
Mingxuan Wang
Qianqian Dong
Rong Ye
Lei Li
52
32
0
18 Dec 2020
The Volctrans Machine Translation System for WMT20
Liwei Wu
Xiao Pan
Zehui Lin
Yaoming Zhu
Mingxuan Wang
Lei Li
VLM
22
17
0
28 Oct 2020
fairseq S2T: Fast Speech-to-Text Modeling with fairseq
Changhan Wang
Yun Tang
Xutai Ma
Anne Wu
Sravya Popuri
Dmytro Okhonko
J. Pino
VLM
LRM
51
267
0
11 Oct 2020
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Alexei Baevski
Henry Zhou
Abdel-rahman Mohamed
Michael Auli
SSL
160
5,677
0
20 Jun 2020
ESPnet-ST: All-in-One Speech Translation Toolkit
Hirofumi Inaguma
Shun Kiyono
Kevin Duh
Shigeki Karita
Nelson Yalta
Tomoki Hayashi
Shinji Watanabe
63
161
0
21 Apr 2020
SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
Daniel S. Park
William Chan
Yu Zhang
Chung-Cheng Chiu
Barret Zoph
E. D. Cubuk
Quoc V. Le
VLM
145
3,435
0
18 Apr 2019
End-to-End Speech Translation with Knowledge Distillation
Yuchen Liu
Hao Xiong
Zhongjun He
Jiajun Zhang
Hua Wu
Haifeng Wang
Chengqing Zong
63
153
0
17 Apr 2019
Leveraging Weakly Supervised Data to Improve End-to-End Speech-to-Text Translation
Ye Jia
Melvin Johnson
Wolfgang Macherey
Ron J. Weiss
Yuan Cao
Chung-Cheng Chiu
Naveen Ari
Stella Laurenzo
Yonghui Wu
52
159
0
05 Nov 2018
Has Machine Translation Achieved Human Parity? A Case for Document-level Evaluation
Samuel Läubli
Rico Sennrich
M. Volk
30
257
0
21 Aug 2018
Evaluating Discourse Phenomena in Neural Machine Translation
Rachel Bawden
Rico Sennrich
Alexandra Birch
Barry Haddow
50
262
0
01 Nov 2017
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
430
129,831
0
12 Jun 2017
Ensemble Distillation for Neural Machine Translation
Markus Freitag
Yaser Al-Onaizan
B. Sankaran
FedML
35
111
0
06 Feb 2017
Listen and Translate: A Proof of Concept for End-to-End Speech-to-Text Translation
Alexandre Berard
Olivier Pietquin
Christophe Servan
Laurent Besacier
65
314
0
06 Dec 2016
Sequence-Level Knowledge Distillation
Yoon Kim
Alexander M. Rush
81
1,109
0
25 Jun 2016
Improving Neural Machine Translation Models with Monolingual Data
Rico Sennrich
Barry Haddow
Alexandra Birch
193
2,705
0
20 Nov 2015
Neural Machine Translation of Rare Words with Subword Units
Rico Sennrich
Barry Haddow
Alexandra Birch
153
7,683
0
31 Aug 2015
1