Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2002.12231
Cited By
SkinAugment: Auto-Encoding Speaker Conversions for Automatic Speech Translation
27 February 2020
Arya D. McCarthy
Liezl Puzon
J. Pino
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SkinAugment: Auto-Encoding Speaker Conversions for Automatic Speech Translation"
17 / 17 papers shown
Title
Recent Advances in Direct Speech-to-text Translation
Chen Xu
Rong Ye
Qianqian Dong
Chengqi Zhao
Tom Ko
Mingxuan Wang
Tong Xiao
Jingbo Zhu
19
18
0
20 Jun 2023
Adapting Offline Speech Translation Models for Streaming with Future-Aware Distillation and Inference
Biao Fu
Minpeng Liao
Kai Fan
Zhongqiang Huang
Boxing Chen
Yidong Chen
Xiaodon Shi
48
8
0
14 Mar 2023
Improved Long-Form Spoken Language Translation with Large Language Models
Arya D. McCarthy
Haotong Zhang
Shankar Kumar
Felix Stahlberg
Axel H. Ng
13
2
0
19 Dec 2022
SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations
Ioannis Tsiamas
José A. R. Fonollosa
Marta R. Costa-jussá
41
6
0
19 Dec 2022
WACO: Word-Aligned Contrastive Learning for Speech Translation
Siqi Ouyang
Rong Ye
Lei Li
32
25
0
19 Dec 2022
M3ST: Mix at Three Levels for Speech Translation
Xuxin Cheng
Qianqian Dong
Fengpeng Yue
Tom Ko
Mingxuan Wang
Yuexian Zou
30
40
0
07 Dec 2022
Leveraging Pseudo-labeled Data to Improve Direct Speech-to-Speech Translation
Qianqian Dong
Fengpeng Yue
Tom Ko
Mingxuan Wang
Qibing Bai
Yu Zhang
32
16
0
18 May 2022
Personalized Adversarial Data Augmentation for Dysarthric and Elderly Speech Recognition
Zengrui Jin
Mengzhe Geng
Jiajun Deng
Tianzi Wang
Shujie Hu
Guinan Li
Xunying Liu
19
19
0
13 May 2022
Sample, Translate, Recombine: Leveraging Audio Alignments for Data Augmentation in End-to-end Speech Translation
Tsz Kin Lam
Shigehiko Schamoni
Stefan Riezler
17
32
0
16 Mar 2022
Learning When to Translate for Streaming Speech
Qianqian Dong
Yaoming Zhu
Mingxuan Wang
Lei Li
52
29
0
15 Sep 2021
Speaker-Conditioned Hierarchical Modeling for Automated Speech Scoring
Yaman Kumar Singla
Avykat Gupta
Shaurya Bagga
Changyou Chen
Balaji Krishnamurthy
R. Shah
29
12
0
30 Aug 2021
Translatotron 2: High-quality direct speech-to-speech translation with voice preservation
Ye Jia
Michelle Tadmor Ramanovich
Tal Remez
Roi Pomerantz
26
67
0
19 Jul 2021
Large-Scale Self- and Semi-Supervised Learning for Speech Translation
Changhan Wang
Anne Wu
J. Pino
Alexei Baevski
Michael Auli
Alexis Conneau
SSL
31
44
0
14 Apr 2021
Tight Integrated End-to-End Training for Cascaded Speech Translation
Parnia Bahar
Tobias Bieschke
Ralf Schluter
Hermann Ney
37
26
0
24 Nov 2020
Self-Supervised Representations Improve End-to-End Speech Translation
Anne Wu
Changhan Wang
J. Pino
Jiatao Gu
SSL
25
40
0
22 Jun 2020
Unsupervised Morphological Paradigm Completion
Huiming Jin
Liwei Cai
Yihui Peng
Chen Xia
Arya D. McCarthy
Katharina Kann
16
27
0
03 May 2020
End-to-End Automatic Speech Translation of Audiobooks
Alexandre Berard
Laurent Besacier
A. Kocabiyikoglu
Olivier Pietquin
75
190
0
12 Feb 2018
1