Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.16107
Cited By
Decoder-only Architecture for Streaming End-to-end Speech Recognition
23 June 2024
E. Tsunoo
Hayato Futami
Yosuke Kashiwagi
Siddhant Arora
Shinji Watanabe
RALM
AuLLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Decoder-only Architecture for Streaming End-to-end Speech Recognition"
10 / 10 papers shown
Title
Decoder-only Architecture for Speech Recognition with CTC Prompts and Text Data Augmentation
E. Tsunoo
Hayato Futami
Yosuke Kashiwagi
Siddhant Arora
Shinji Watanabe
VLM
AuLLM
RALM
48
9
0
16 Sep 2023
Integration of Frame- and Label-synchronous Beam Search for Streaming Encoder-decoder Speech Recognition
E. Tsunoo
Hayato Futami
Yosuke Kashiwagi
Siddhant Arora
Shinji Watanabe
51
4
0
24 Jul 2023
Integrating Pretrained ASR and LM to Perform Sequence Generation for Spoken Language Understanding
Siddhant Arora
Hayato Futami
Yosuke Kashiwagi
E. Tsunoo
Brian Yan
Shinji Watanabe
46
4
0
20 Jul 2023
BASS: Block-wise Adaptation for Speech Summarization
Roshan S. Sharma
Kenneth Zheng
Siddhant Arora
Shinji Watanabe
Rita Singh
Bhiksha Raj
48
7
0
17 Jul 2023
WeNet: Production oriented Streaming and Non-streaming End-to-End Speech Recognition Toolkit
Zhuoyuan Yao
Di Wu
Xiong Wang
Binbin Zhang
Fan Yu
Chao Yang
Zhendong Peng
Xiaoyu Chen
Lei Xie
X. Lei
44
265
0
02 Feb 2021
Phoneme Based Neural Transducer for Large Vocabulary Speech Recognition
Wei Zhou
Simon Berger
Ralf Schluter
Hermann Ney
36
33
0
30 Oct 2020
Transformer ASR with Contextual Block Processing
E. Tsunoo
Yosuke Kashiwagi
Toshiyuki Kumakura
Shinji Watanabe
66
64
0
16 Oct 2019
Two-Pass End-to-End Speech Recognition
Tara N. Sainath
Ruoming Pang
David Rybach
Yanzhang He
Rohit Prabhavalkar
...
Qiao Liang
Trevor Strohman
Yonghui Wu
Ian McGraw
Chung-Cheng Chiu
52
147
0
29 Aug 2019
Self-Attention Aligner: A Latency-Control End-to-End Model for ASR Using Self-Attention Network and Chunk-Hopping
Linhao Dong
Feng Wang
Bo Xu
41
90
0
18 Feb 2019
EESEN: End-to-End Speech Recognition using Deep RNN Models and WFST-based Decoding
Yajie Miao
M. Gowayyed
Florian Metze
73
753
0
29 Jul 2015
1