ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2306.11646
  4. Cited By
Recent Advances in Direct Speech-to-text Translation

Recent Advances in Direct Speech-to-text Translation

20 June 2023
Chen Xu
Rong Ye
Qianqian Dong
Chengqi Zhao
Tom Ko
Mingxuan Wang
Tong Xiao
Jingbo Zhu
ArXivPDFHTML

Papers citing "Recent Advances in Direct Speech-to-text Translation"

15 / 15 papers shown
Title
Low-Rank and Sparse Model Merging for Multi-Lingual Speech Recognition and Translation
Low-Rank and Sparse Model Merging for Multi-Lingual Speech Recognition and Translation
Qiuming Zhao
Guangzhi Sun
Chao Zhang
Mingxing Xu
Thomas Fang Zheng
MoMe
VLM
196
0
0
24 Feb 2025
Audio-Language Models for Audio-Centric Tasks: A survey
Yi Su
Jisheng Bai
Qisheng Xu
Kele Xu
Yong Dou
AuLLM
99
2
0
28 Jan 2025
A Modular-based Strategy for Mitigating Gradient Conflicts in Simultaneous Speech Translation
A Modular-based Strategy for Mitigating Gradient Conflicts in Simultaneous Speech Translation
Xiaoqian Liu
Yangfan Du
Jiadong Wang
Yuan Ge
Chen Xu
Tong Xiao
Guocheng Chen
Jingbo Zhu
36
0
0
31 Dec 2024
LLaST: Improved End-to-end Speech Translation System Leveraged by Large
  Language Models
LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models
Xi Chen
Songyang Zhang
Qibing Bai
Kai-xiang Chen
Satoshi Nakamura
AuLLM
35
6
0
22 Jul 2024
Revisiting Interpolation Augmentation for Speech-to-Text Generation
Revisiting Interpolation Augmentation for Speech-to-Text Generation
Chen Xu
Jie Wang
Xiaoqian Liu
Qianqian Dong
Chunliang Zhang
Tong Xiao
Jingbo Zhu
Dapeng Man
Wu Yang
43
1
0
22 Jun 2024
SBAAM! Eliminating Transcript Dependency in Automatic Subtitling
SBAAM! Eliminating Transcript Dependency in Automatic Subtitling
Marco Gaido
Sara Papi
Matteo Negri
Mauro Cettolo
L. Bentivogli
43
1
0
17 May 2024
GenTranslate: Large Language Models are Generative Multilingual Speech
  and Machine Translators
GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators
Yuchen Hu
Chen Chen
Chao-Han Huck Yang
Ruizhe Li
Dong Zhang
Zhehuai Chen
E. Chng
20
21
0
10 Feb 2024
End-to-End Speech-to-Text Translation: A Survey
End-to-End Speech-to-Text Translation: A Survey
Nivedita Sethiya
Chandresh Kumar Maurya
32
7
0
02 Dec 2023
Rethinking and Improving Multi-task Learning for End-to-end Speech
  Translation
Rethinking and Improving Multi-task Learning for End-to-end Speech Translation
Yuhao Zhang
Chen Xu
Bei Li
Hao Chen
Tong Xiao
Chunliang Zhang
Jingbo Zhu
26
5
0
07 Nov 2023
Bridging the Gaps of Both Modality and Language: Synchronous Bilingual
  CTC for Speech Translation and Speech Recognition
Bridging the Gaps of Both Modality and Language: Synchronous Bilingual CTC for Speech Translation and Speech Recognition
Chen Xu
Xiaoqian Liu
Erfeng He
Yuhao Zhang
Qianqian Dong
Tong Xiao
Jingbo Zhu
Dapeng Man
Wu Yang
35
0
0
21 Sep 2023
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language
  Processing
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing
Junyi Ao
Rui Wang
Long Zhou
Chengyi Wang
Shuo Ren
...
Yu Zhang
Zhihua Wei
Yao Qian
Jinyu Li
Furu Wei
118
193
0
14 Oct 2021
Fast-MD: Fast Multi-Decoder End-to-End Speech Translation with
  Non-Autoregressive Hidden Intermediates
Fast-MD: Fast Multi-Decoder End-to-End Speech Translation with Non-Autoregressive Hidden Intermediates
Hirofumi Inaguma
Siddharth Dalmia
Brian Yan
Shinji Watanabe
65
11
0
27 Sep 2021
Improving Speech Translation by Understanding and Learning from the
  Auxiliary Text Translation Task
Improving Speech Translation by Understanding and Learning from the Auxiliary Text Translation Task
Yun Tang
J. Pino
Xian Li
Changhan Wang
Dmitriy Genzel
114
81
0
12 Jul 2021
Beyond Voice Activity Detection: Hybrid Audio Segmentation for Direct
  Speech Translation
Beyond Voice Activity Detection: Hybrid Audio Segmentation for Direct Speech Translation
Marco Gaido
Matteo Negri
Mauro Cettolo
Marco Turchi
VLM
58
25
0
23 Apr 2021
Tied Multitask Learning for Neural Speech Translation
Tied Multitask Learning for Neural Speech Translation
Antonios Anastasopoulos
David Chiang
102
172
0
19 Feb 2018
1