Improving speech translation by fusing speech and text

23 May 2023

Papers citing "Improving speech translation by fusing speech and text"

21 / 21 papers shown

Title
Cross-modal Contrastive Learning for Speech Translation Rong Ye Mingxuan Wang Lei Li SSL 52 90 0 05 May 2022
GigaST: A 10,000-hour Pseudo Speech Translation Corpus Rong Ye Chengqi Zhao Tom Ko Chutong Meng Tao Wang Mingxuan Wang Jun Cao 34 23 0 08 Apr 2022
STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation Qingkai Fang Rong Ye Lei Li Yang Feng Mingxuan Wang 81 99 0 20 Mar 2022
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing Sanyuan Chen Chengyi Wang Zhengyang Chen Yu-Huan Wu Shujie Liu ... Yao Qian Jian Wu Micheal Zeng Xiangzhan Yu Furu Wei SSL 239 1,857 0 26 Oct 2021
Improving Speech Translation by Understanding and Learning from the Auxiliary Text Translation Task Yun Tang J. Pino Xian Li Changhan Wang Dmitriy Genzel 146 84 0 12 Jul 2021
Stacked Acoustic-and-Textual Encoding: Integrating the Pre-trained Models into Speech Translation Encoders Chen Xu Bojie Hu Yanyang Li Yuhao Zhang Shen Huang Qi Ju Tong Xiao Jingbo Zhu 59 78 0 12 May 2021
End-to-end Speech Translation via Cross-modal Progressive Training Rong Ye Mingxuan Wang Lei Li 55 73 0 21 Apr 2021
UNIMO: Towards Unified-Modal Understanding and Generation via Cross-Modal Contrastive Learning Wei Li Can Gao Guocheng Niu Xinyan Xiao Hao Liu Jiachen Liu Hua Wu Haifeng Wang 91 378 0 31 Dec 2020
Dual-decoder Transformer for Joint Automatic Speech Recognition and Multilingual Speech Translation Hang Le J. Pino Changhan Wang Jiatao Gu D. Schwab Laurent Besacier 82 82 0 02 Nov 2020
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations Alexei Baevski Henry Zhou Abdel-rahman Mohamed Michael Auli SSL 282 5,790 0 20 Jun 2020
End-to-End Speech-Translation with Knowledge Distillation: FBK@IWSLT2020 Marco Gaido Mattia Antonino Di Gangi Matteo Negri Marco Turchi 69 54 0 04 Jun 2020
Self-Training for End-to-End Speech Translation J. Pino Qiantong Xu Xutai Ma M. Dousti Yun Tang 67 60 0 03 Jun 2020
On Using SpecAugment for End-to-End Speech Translation Parnia Bahar Albert Zeyer Ralf Schluter Hermann Ney 57 54 0 20 Nov 2019
Self-Attentional Models for Lattice Inputs Matthias Sperber Graham Neubig Ngoc-Quan Pham A. Waibel 55 44 0 04 Jun 2019
Fluent Translations from Disfluent Speech in End-to-End Speech Translation Elizabeth Salesky Matthias Sperber A. Waibel 56 34 0 03 Jun 2019
End-to-End Speech Translation with Knowledge Distillation Yuchen Liu Hao Xiong Zhongjun He Jiajun Zhang Hua Wu Haifeng Wang Chengqing Zong 76 155 0 17 Apr 2019
SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing Taku Kudo John Richardson 196 3,518 0 19 Aug 2018
A Call for Clarity in Reporting BLEU Scores Matt Post 145 2,985 0 23 Apr 2018
End-to-End Automatic Speech Translation of Audiobooks Alexandre Berard Laurent Besacier A. Kocabiyikoglu Olivier Pietquin 110 192 0 12 Feb 2018
Sequence-to-Sequence Models Can Directly Translate Foreign Speech Ron J. Weiss J. Chorowski Navdeep Jaitly Yonghui Wu Zhiwen Chen 79 344 0 24 Mar 2017
Listen and Translate: A Proof of Concept for End-to-End Speech-to-Text Translation Alexandre Berard Olivier Pietquin Christophe Servan Laurent Besacier 70 319 0 06 Dec 2016