Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.11095
Cited By
v1
v2
v3 (latest)
Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot Task Generalization
18 May 2023
Puyuan Peng
Brian Yan
Shinji Watanabe
David Harwath
VLM
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Github (144★)
Papers citing
"Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot Task Generalization"
11 / 11 papers shown
Title
A Self-Refining Framework for Enhancing ASR Using TTS-Synthesized Data
Cheng-Kang Chou
Chan-Jan Hsu
Ho-Lam Chung
Liang-Hsuan Tseng
H. Cheng
Yu-Kuan Fu
Kuan Po Huang
Hung-yi Lee
64
0
0
10 Jun 2025
Improving Code Switching with Supervised Fine Tuning and GELU Adapters
Linh Pham
38
0
0
30 May 2025
Languages in Multilingual Speech Foundation Models Align Both Phonetically and Semantically
Ryan Soh-Eun Shim
Domenico De Cristofaro
Chengzhi Hu
Alessandro Vietti
Barbara Plank
66
0
0
26 May 2025
Chinese-LiPS: A Chinese audio-visual speech recognition dataset with Lip-reading and Presentation Slides
Jinghua Zhao
Yuhang Jia
Shiyao Wang
Jiaming Zhou
Hui Wang
Yong Qin
115
0
0
21 Apr 2025
Adapting Whisper for Code-Switching through Encoding Refining and Language-Aware Decoding
Jiahui Zhao
Hao Shi
Chenrui Cui
Tianrui Wang
Hexin Liu
Zhaoheng Ni
Lingxuan Ye
Longbiao Wang
199
1
0
21 Dec 2024
Finetuning End-to-End Models for Estonian Conversational Spoken Language Translation
Tiia Sildam
Andra Velve
Tanel Alumäe
114
0
0
04 Jul 2024
Cross-Lingual Transfer Learning for Speech Translation
Rao Ma
Yassir Fathullah
Mengjie Qian
Siyuan Tang
Mark Gales
Kate Knill
207
5
0
01 Jul 2024
Generative error correction for code-switching speech recognition using large language models
Chen Chen
Yuchen Hu
Chao-Han Huck Yang
Hexin Liu
Sabato Marco Siniscalchi
Chng Eng Siong
72
9
0
17 Oct 2023
Instruction-Following Speech Recognition
Cheng-I Jeff Lai
Zhiyun Lu
Liangliang Cao
Ruoming Pang
AuLLM
99
6
0
18 Sep 2023
Dynamic-SUPERB: Towards A Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark for Speech
Chien-yu Huang
Ke-Han Lu
Shi Wang
Chi-Yuan Hsiao
Chun-Yi Kuan
...
Roshan S. Sharma
Shinji Watanabe
Bhiksha Ramakrishnan
Shady Shehata
Hung-yi Lee
AuLLM
94
63
0
18 Sep 2023
Can Whisper perform speech-based in-context learning?
Siyin Wang
Chao-Han Huck Yang
Ji Wu
Chao Zhang
119
30
0
13 Sep 2023
1