Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.08757
Cited By
Sample, Translate, Recombine: Leveraging Audio Alignments for Data Augmentation in End-to-end Speech Translation
16 March 2022
Tsz Kin Lam
Shigehiko Schamoni
Stefan Riezler
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Sample, Translate, Recombine: Leveraging Audio Alignments for Data Augmentation in End-to-end Speech Translation"
12 / 12 papers shown
Title
Speech Translation Refinement using Large Language Models
Huaixia Dou
Xinyu Tian
Xinglin Lyu
Jie Zhu
Junhui Li
Lifan Guo
176
0
0
28 Jan 2025
Prepending or Cross-Attention for Speech-to-Text? An Empirical Comparison
Tsz Kin Lam
Marco Gaido
Sara Papi
L. Bentivogli
Barry Haddow
36
0
0
04 Jan 2025
UniPSDA: Unsupervised Pseudo Semantic Data Augmentation for Zero-Shot Cross-Lingual Natural Language Understanding
Dongyang Li
Taolin Zhang
Jiali Deng
Longtao Huang
Chengyu Wang
Xiaofeng He
Hui Xue
34
1
0
24 Jun 2024
Recent Advances in Direct Speech-to-text Translation
Chen Xu
Rong Ye
Qianqian Dong
Chengqi Zhao
Tom Ko
Mingxuan Wang
Tong Xiao
Jingbo Zhu
21
18
0
20 Jun 2023
ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation
Chenyang Le
Yao Qian
Long Zhou
Shujie Liu
Yanmin Qian
Michael Zeng
Xuedong Huang
24
13
0
24 May 2023
Back Translation for Speech-to-text Translation Without Transcripts
Qingkai Fang
Yang Feng
38
13
0
15 May 2023
SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations
Ioannis Tsiamas
José A. R. Fonollosa
Marta R. Costa-jussá
41
6
0
19 Dec 2022
WACO: Word-Aligned Contrastive Learning for Speech Translation
Siqi Ouyang
Rong Ye
Lei Li
32
25
0
19 Dec 2022
M3ST: Mix at Three Levels for Speech Translation
Xuxin Cheng
Qianqian Dong
Fengpeng Yue
Tom Ko
Mingxuan Wang
Yuexian Zou
30
40
0
07 Dec 2022
Efficient Speech Translation with Pre-trained Models
Zhaolin Li
Jan Niehues
27
2
0
09 Nov 2022
Make More of Your Data: Minimal Effort Data Augmentation for Automatic Speech Recognition and Translation
Tsz Kin Lam
Shigehiko Schamoni
Stefan Riezler
VLM
42
8
0
27 Oct 2022
Joint Speech Translation and Named Entity Recognition
Marco Gaido
Sara Papi
Matteo Negri
Marco Turchi
33
3
0
21 Oct 2022
1