Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2307.11005
Cited By
Integrating Pretrained ASR and LM to Perform Sequence Generation for Spoken Language Understanding
20 July 2023
Siddhant Arora
Hayato Futami
Yosuke Kashiwagi
E. Tsunoo
Brian Yan
Shinji Watanabe
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Integrating Pretrained ASR and LM to Perform Sequence Generation for Spoken Language Understanding"
5 / 5 papers shown
Title
Efficient Streaming LLM for Speech Recognition
J. Jia
Gil Keren
Wei Zhou
Egor Lakomkin
Xiaohui Zhang
Chunyang Wu
Frank Seide
Jay Mahadeokar
Ozlem Kalinli
AuLLM
27
0
0
02 Oct 2024
Decoder-only Architecture for Streaming End-to-end Speech Recognition
E. Tsunoo
Hayato Futami
Yosuke Kashiwagi
Siddhant Arora
Shinji Watanabe
RALM
AuLLM
36
6
0
23 Jun 2024
Decoder-only Architecture for Speech Recognition with CTC Prompts and Text Data Augmentation
E. Tsunoo
Hayato Futami
Yosuke Kashiwagi
Siddhant Arora
Shinji Watanabe
VLM
AuLLM
RALM
40
9
0
16 Sep 2023
T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5
Chan-Jan Hsu
Ho-Lam Chung
Hung-yi Lee
Yu Tsao
21
6
0
01 Nov 2022
Searchable Hidden Intermediates for End-to-End Models of Decomposable Sequence Tasks
Siddharth Dalmia
Brian Yan
Vikas Raunak
Florian Metze
Shinji Watanabe
37
30
0
02 May 2021
1