Continuous Streaming Multi-Talker ASR with Dual-path Transducers

Continuous Streaming Multi-Talker ASR with Dual-path Transducers

17 September 2021

Papers citing "Continuous Streaming Multi-Talker ASR with Dual-path Transducers"

18 / 18 papers shown

Title
Alignment-Free Training for Transducer-based Multi-Talker ASR Takafumi Moriya Shota Horiguchi Marc Delcroix Ryo Masumura Takanori Ashihara Hiroshi Sato Kohei Matsuura Masato Mimura 39 2 0 30 Sep 2024
Jointly Recognizing Speech and Singing Voices Based on Multi-Task Audio Source Separation Ye Bai Chenxing Li Hao Li Yuanyuan Zhao Xiaorui Wang 24 0 0 17 Apr 2024
On Speaker Attribution with SURT Desh Raj Matthew Wiesner Matthew Maciejewski Leibny Paola García-Perera Daniel Povey Sanjeev Khudanpur 32 3 0 28 Jan 2024
EEND-DEMUX: End-to-End Neural Speaker Diarization via Demultiplexed Speaker Embeddings Sung Hwan Mun Mingrui Han Canyeong Moon Nam Soo Kim 42 1 0 11 Dec 2023
End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation Juan Pablo Zuluaga Zhaocheng Huang Xing Niu Rohit Paturi S. Srinivasan Prashant Mathur Brian Thompson Marcello Federico BDL 35 2 0 01 Nov 2023
One model to rule them all ? Towards End-to-End Joint Speaker Diarization and Speech Recognition Samuele Cornell Jee-weon Jung Shinji Watanabe S. Squartini VLM 32 16 0 02 Oct 2023
t-SOT FNT: Streaming Multi-talker ASR with Text-only Domain Adaptation Capability Jian Wu Naoyuki Kanda Takuya Yoshioka Rui Zhao Zhuo Chen Jinyu Li 21 5 0 15 Sep 2023
MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems Thilo von Neumann Christoph Boeddeker Marc Delcroix Reinhold Haeb-Umbach 29 16 0 21 Jul 2023
Cascaded encoders for fine-tuning ASR models on overlapped speech R. Rose Oscar Chang Olivier Siohan 15 1 0 28 Jun 2023
SURT 2.0: Advances in Transducer-based Multi-talker Speech Recognition Desh Raj Daniel Povey Sanjeev Khudanpur VLM 31 9 0 18 Jun 2023
On Word Error Rate Definitions and their Efficient Computation for Multi-Speaker Speech Recognition Systems Thilo von Neumann Christoph Boeddeker K. Kinoshita Marc Delcroix Reinhold Haeb-Umbach 37 19 0 29 Nov 2022
Separator-Transducer-Segmenter: Streaming Recognition and Segmentation of Multi-party Speech Ilya Sklyar A. Piunova Christian Osendorfer 11 6 0 10 May 2022
End-to-end multi-talker audio-visual ASR using an active speaker attention module R. Rose Olivier Siohan 13 3 0 01 Apr 2022
Streaming Multi-Talker ASR with Token-Level Serialized Output Training Naoyuki Kanda Jian Wu Yu Wu Xiong Xiao Zhong Meng Xiaofei Wang Yashesh Gaur Zhuo Chen Jinyu Li Takuya Yoshioka 34 54 0 02 Feb 2022
Endpoint Detection for Streaming End-to-End Multi-talker ASR Liang Lu Jinyu Li Yifan Gong 17 17 0 24 Jan 2022
Directed Speech Separation for Automatic Speech Recognition of Long Form Conversational Speech Rohit Paturi S. Srinivasan Katrin Kirchhoff Daniel Garcia-Romero 17 9 0 10 Dec 2021
Recent Advances in End-to-End Automatic Speech Recognition Jinyu Li VLM 35 363 0 02 Nov 2021
Dual-Path Transformer Network: Direct Context-Aware Modeling for End-to-End Monaural Speech Separation Jing-jing Chen Qi-rong Mao Dong Liu 62 280 0 28 Jul 2020