Joint Speech Recognition and Speaker Diarization via Sequence
Transduction

Joint Speech Recognition and Speaker Diarization via Sequence Transduction

9 July 2019

Laurent El Shafey

Papers citing "Joint Speech Recognition and Speaker Diarization via Sequence Transduction"

13 / 63 papers shown

Title
Streaming Multi-speaker ASR with RNN-T Ilya Sklyar A. Piunova Yulan Liu 17 36 0 23 Nov 2020
Spoken Language Interaction with Robots: Research Issues and Recommendations, Report from the NSF Future Directions Workshop M. Marge C. Espy-Wilson Roger K. Moore 26 78 0 11 Nov 2020
Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR Naoyuki Kanda Zhong Meng Liang Lu Yashesh Gaur Xiaofei Wang Zhuo Chen Takuya Yoshioka 20 17 0 03 Nov 2020
The HUAWEI Speaker Diarisation System for the VoxCeleb Speaker Diarisation Challenge Renyu Wang Ruilin Tong Y. Yeung Xiao Chen 6 1 0 22 Oct 2020
Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings Naoyuki Kanda Xuankai Chang Yashesh Gaur Xiaofei Wang Zhong Meng Zhuo Chen Takuya Yoshioka 12 48 0 11 Aug 2020
Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers Naoyuki Kanda Yashesh Gaur Xiaofei Wang Zhong Meng Zhuo Chen Tianyan Zhou Takuya Yoshioka 6 74 0 19 Jun 2020
Speech Recognition and Multi-Speaker Diarization of Long Conversations H. H. Mao Shuyang Li Julian McAuley G. Cottrell VLM 22 40 0 16 May 2020
Serialized Output Training for End-to-End Overlapped Speech Recognition Naoyuki Kanda Yashesh Gaur Xiaofei Wang Zhong Meng Takuya Yoshioka 6 113 0 28 Mar 2020
The Medical Scribe: Corpus Development and Model Performance Analyses Izhak Shafran Nan Du Linh Tran Amanda N. Perry Lauren Keyes ... Gang Li Mingqiu Wang Laurent El Shafey H. Soltau Justin S. Paul 6 14 0 12 Mar 2020
Rnn-transducer with language bias for end-to-end Mandarin-English code-switching speech recognition Shuai Zhang Jiangyan Yi Zhengkun Tian J. Tao Ye Bai 25 25 0 19 Feb 2020
Multi-task Learning for Speaker Verification and Voice Trigger Detection Siddharth Sigtia Erik Marchi S. Kajarekar Devang Naik J. Bridle 40 29 0 26 Jan 2020
Linguistically Aided Speaker Diarization Using Speaker Role Information Nikolaos Flemotomos P. Georgiou Shrikanth Narayanan 25 2 0 18 Nov 2019
VoxCeleb2: Deep Speaker Recognition Joon Son Chung Arsha Nagrani Andrew Zisserman 257 2,233 0 14 Jun 2018