Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2003.12687
Cited By
Serialized Output Training for End-to-End Overlapped Speech Recognition
28 March 2020
Naoyuki Kanda
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Takuya Yoshioka
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Serialized Output Training for End-to-End Overlapped Speech Recognition"
10 / 10 papers shown
Title
Target Speaker ASR with Whisper
Alexander Polok
Dominik Klement
Sanjeev Khudanpur
Kevin Duh
J. Černocký
L. Burget
139
2
0
17 Jan 2025
Disentangling Speakers in Multi-Talker Speech Recognition with Speaker-Aware CTC
Jiawen Kang
Lingwei Meng
Mingyu Cui
Yuejiao Wang
Xixin Wu
Xunying Liu
Helen Meng
74
2
0
19 Sep 2024
Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions
Lingwei Meng
Shujie Hu
Jiawen Kang
Zhaoqing Li
Yuejiao Wang
Wenxuan Wu
Xixin Wu
Xunying Liu
Helen Meng
AuLLM
119
3
0
13 Sep 2024
Speech Recognition with Augmented Synthesized Speech
Andrew Rosenberg
Yu Zhang
Bhuvana Ramabhadran
Ye Jia
Pedro J. Moreno
Yonghui Wu
Zelin Wu
59
127
0
25 Sep 2019
Joint Speech Recognition and Speaker Diarization via Sequence Transduction
Laurent El Shafey
H. Soltau
Izhak Shafran
55
99
0
09 Jul 2019
A Purely End-to-end System for Multi-speaker Speech Recognition
Hiroshi Seki
Takaaki Hori
Shinji Watanabe
Jonathan Le Roux
J. Hershey
42
86
0
15 May 2018
English Conversational Telephone Speech Recognition by Humans and Machines
G. Saon
Gakuto Kurata
Tom Sercu
Kartik Audhkhasi
Samuel Thomas
...
Bhuvana Ramabhadran
M. Picheny
L. Lim
Bergul Roomi
Phil Hall
60
365
0
06 Mar 2017
Layer Normalization
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
307
10,412
0
21 Jul 2016
Listen, Attend and Spell
William Chan
Navdeep Jaitly
Quoc V. Le
Oriol Vinyals
RALM
147
2,261
0
05 Aug 2015
Attention-Based Models for Speech Recognition
J. Chorowski
Dzmitry Bahdanau
Dmitriy Serdyuk
Kyunghyun Cho
Yoshua Bengio
110
2,605
0
24 Jun 2015
1