Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.05337
Cited By
Joint Speech Recognition and Speaker Diarization via Sequence Transduction
9 July 2019
Laurent El Shafey
H. Soltau
Izhak Shafran
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Joint Speech Recognition and Speaker Diarization via Sequence Transduction"
13 / 63 papers shown
Title
Streaming Multi-speaker ASR with RNN-T
Ilya Sklyar
A. Piunova
Yulan Liu
17
36
0
23 Nov 2020
Spoken Language Interaction with Robots: Research Issues and Recommendations, Report from the NSF Future Directions Workshop
M. Marge
C. Espy-Wilson
Roger K. Moore
26
78
0
11 Nov 2020
Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR
Naoyuki Kanda
Zhong Meng
Liang Lu
Yashesh Gaur
Xiaofei Wang
Zhuo Chen
Takuya Yoshioka
20
17
0
03 Nov 2020
The HUAWEI Speaker Diarisation System for the VoxCeleb Speaker Diarisation Challenge
Renyu Wang
Ruilin Tong
Y. Yeung
Xiao Chen
6
1
0
22 Oct 2020
Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings
Naoyuki Kanda
Xuankai Chang
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Zhuo Chen
Takuya Yoshioka
12
48
0
11 Aug 2020
Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers
Naoyuki Kanda
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Zhuo Chen
Tianyan Zhou
Takuya Yoshioka
6
74
0
19 Jun 2020
Speech Recognition and Multi-Speaker Diarization of Long Conversations
H. H. Mao
Shuyang Li
Julian McAuley
G. Cottrell
VLM
22
40
0
16 May 2020
Serialized Output Training for End-to-End Overlapped Speech Recognition
Naoyuki Kanda
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Takuya Yoshioka
6
113
0
28 Mar 2020
The Medical Scribe: Corpus Development and Model Performance Analyses
Izhak Shafran
Nan Du
Linh Tran
Amanda N. Perry
Lauren Keyes
...
Gang Li
Mingqiu Wang
Laurent El Shafey
H. Soltau
Justin S. Paul
6
14
0
12 Mar 2020
Rnn-transducer with language bias for end-to-end Mandarin-English code-switching speech recognition
Shuai Zhang
Jiangyan Yi
Zhengkun Tian
J. Tao
Ye Bai
25
25
0
19 Feb 2020
Multi-task Learning for Speaker Verification and Voice Trigger Detection
Siddharth Sigtia
Erik Marchi
S. Kajarekar
Devang Naik
J. Bridle
40
29
0
26 Jan 2020
Linguistically Aided Speaker Diarization Using Speaker Role Information
Nikolaos Flemotomos
P. Georgiou
Shrikanth Narayanan
25
2
0
18 Nov 2019
VoxCeleb2: Deep Speaker Recognition
Joon Son Chung
Arsha Nagrani
Andrew Zisserman
257
2,233
0
14 Jun 2018
Previous
1
2