ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.11095
  4. Cited By
Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot
  Task Generalization
v1v2v3 (latest)

Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot Task Generalization

18 May 2023
Puyuan Peng
Brian Yan
Shinji Watanabe
David Harwath
    VLMLRM
ArXiv (abs)PDFHTMLGithub (144★)

Papers citing "Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot Task Generalization"

11 / 11 papers shown
Title
A Self-Refining Framework for Enhancing ASR Using TTS-Synthesized Data
A Self-Refining Framework for Enhancing ASR Using TTS-Synthesized Data
Cheng-Kang Chou
Chan-Jan Hsu
Ho-Lam Chung
Liang-Hsuan Tseng
H. Cheng
Yu-Kuan Fu
Kuan Po Huang
Hung-yi Lee
64
0
0
10 Jun 2025
Improving Code Switching with Supervised Fine Tuning and GELU Adapters
Improving Code Switching with Supervised Fine Tuning and GELU Adapters
Linh Pham
38
0
0
30 May 2025
Languages in Multilingual Speech Foundation Models Align Both Phonetically and Semantically
Languages in Multilingual Speech Foundation Models Align Both Phonetically and Semantically
Ryan Soh-Eun Shim
Domenico De Cristofaro
Chengzhi Hu
Alessandro Vietti
Barbara Plank
66
0
0
26 May 2025
Chinese-LiPS: A Chinese audio-visual speech recognition dataset with Lip-reading and Presentation Slides
Chinese-LiPS: A Chinese audio-visual speech recognition dataset with Lip-reading and Presentation Slides
Jinghua Zhao
Yuhang Jia
Shiyao Wang
Jiaming Zhou
Hui Wang
Yong Qin
115
0
0
21 Apr 2025
Adapting Whisper for Code-Switching through Encoding Refining and Language-Aware Decoding
Adapting Whisper for Code-Switching through Encoding Refining and Language-Aware Decoding
Jiahui Zhao
Hao Shi
Chenrui Cui
Tianrui Wang
Hexin Liu
Zhaoheng Ni
Lingxuan Ye
Longbiao Wang
199
1
0
21 Dec 2024
Finetuning End-to-End Models for Estonian Conversational Spoken Language
  Translation
Finetuning End-to-End Models for Estonian Conversational Spoken Language Translation
Tiia Sildam
Andra Velve
Tanel Alumäe
114
0
0
04 Jul 2024
Cross-Lingual Transfer Learning for Speech Translation
Cross-Lingual Transfer Learning for Speech Translation
Rao Ma
Yassir Fathullah
Mengjie Qian
Siyuan Tang
Mark Gales
Kate Knill
207
5
0
01 Jul 2024
Generative error correction for code-switching speech recognition using
  large language models
Generative error correction for code-switching speech recognition using large language models
Chen Chen
Yuchen Hu
Chao-Han Huck Yang
Hexin Liu
Sabato Marco Siniscalchi
Chng Eng Siong
72
9
0
17 Oct 2023
Instruction-Following Speech Recognition
Instruction-Following Speech Recognition
Cheng-I Jeff Lai
Zhiyun Lu
Liangliang Cao
Ruoming Pang
AuLLM
99
6
0
18 Sep 2023
Dynamic-SUPERB: Towards A Dynamic, Collaborative, and Comprehensive
  Instruction-Tuning Benchmark for Speech
Dynamic-SUPERB: Towards A Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark for Speech
Chien-yu Huang
Ke-Han Lu
Shi Wang
Chi-Yuan Hsiao
Chun-Yi Kuan
...
Roshan S. Sharma
Shinji Watanabe
Bhiksha Ramakrishnan
Shady Shehata
Hung-yi Lee
AuLLM
94
63
0
18 Sep 2023
Can Whisper perform speech-based in-context learning?
Can Whisper perform speech-based in-context learning?
Siyin Wang
Chao-Han Huck Yang
Ji Wu
Chao Zhang
119
30
0
13 Sep 2023
1