v1v2v3 (latest)

Directed Speech Separation for Automatic Speech Recognition of Long Form Conversational Speech

10 December 2021

Papers citing "Directed Speech Separation for Automatic Speech Recognition of Long Form Conversational Speech"

28 / 28 papers shown

Title
Continuous Streaming Multi-Talker ASR with Dual-path Transducers Desh Raj Liang Lu Zhuo Chen Yashesh Gaur Jinyu Li 50 18 0 17 Sep 2021
End-to-End Speaker-Attributed ASR with Transformer Naoyuki Kanda Guoli Ye Yashesh Gaur Xiaofei Wang Zhong Meng Zhuo Chen Takuya Yoshioka 54 49 0 05 Apr 2021
Hypothesis Stitcher for End-to-End Speaker-attributed ASR on Long-form Multi-talker Recordings Xuankai Chang Naoyuki Kanda Yashesh Gaur Xiaofei Wang Zhong Meng Takuya Yoshioka RALM 53 15 0 06 Jan 2021
Continuous Speech Separation Using Speaker Inventory for Long Multi-talker Recording Cong Han Yi Luo Chenda Li Tianyan Zhou K. Kinoshita ... Marc Delcroix Hakan Erdogan J. Hershey N. Mesgarani Zhuo Chen 58 8 0 17 Dec 2020
Attention is All You Need in Speech Separation Cem Subakan Mirco Ravanelli Samuele Cornell Mirko Bronzi Jianyuan Zhong 95 561 0 25 Oct 2020
Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings Naoyuki Kanda Xuankai Chang Yashesh Gaur Xiaofei Wang Zhong Meng Zhuo Chen Takuya Yoshioka 52 49 0 11 Aug 2020
Dual-Path Transformer Network: Direct Context-Aware Modeling for End-to-End Monaural Speech Separation Jing-jing Chen Qi-rong Mao Dong Liu 85 287 0 28 Jul 2020
Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers Naoyuki Kanda Yashesh Gaur Xiaofei Wang Zhong Meng Zhuo Chen Tianyan Zhou Takuya Yoshioka 53 78 0 19 Jun 2020
Asteroid: the PyTorch-based audio source separation toolkit for researchers Manuel Pariente Samuele Cornell Joris Cosentino S. Sivasankaran Efthymios Tzinis ... Juan M. Martín-Donas David Ditter Ariel Frank Antoine Deleforge Emmanuel Vincent 68 155 0 08 May 2020
In defence of metric learning for speaker recognition Joon Son Chung Jaesung Huh Seongkyu Mun Minjae Lee Hee-Soo Heo Soyeon Choe Chiheon Ham Sung-Ye Jung Bong-Jin Lee Icksang Han 67 437 0 26 Mar 2020
Voice Separation with an Unknown Number of Multiple Speakers Eliya Nachmani Yossi Adi Lior Wolf 61 175 0 29 Feb 2020
Wavesplit: End-to-End Speech Separation by Speaker Clustering Neil Zeghidour David Grangier VLM 90 264 0 20 Feb 2020
An empirical study of Conv-TasNet Berkan Kadıoğlu Michael Horgan Xiaoyu Liu Jordi Pons Dan Darcy Vivek Kumar 40 44 0 20 Feb 2020
Continuous speech separation: dataset and analysis Zhuo Chen Takuya Yoshioka Liang Lu Tianyan Zhou Zhong Meng Yi Luo Jian Wu Xiong Xiao Jinyu Li 71 216 0 30 Jan 2020
Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation Yi Luo Zhuo Chen Takuya Yoshioka AI4TS 86 771 0 14 Oct 2019
Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping Speech T. Menne Ilya Sklyar Ralf Schluter Hermann Ney 145 35 0 09 May 2019
SDR - half-baked or well done? F. Sánchez-Martínez M. Esplà-Gomis Hakan Erdogan J. Hershey 153 1,202 0 06 Nov 2018
VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking Quan Wang Hannah Muckenhirn K. Wilson Prashant Sridhar Zelin Wu J. Hershey Rif A. Saurous Ron J. Weiss Ye Jia Ignacio López Moreno 71 370 0 11 Oct 2018
Recognizing Overlapped Speech in Meetings: A Multichannel Separation Approach Using Neural Networks Takuya Yoshioka Hakan Erdogan Zhuo Chen Xiong Xiao F. Alleva BDL 55 82 0 08 Oct 2018
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Yi Luo N. Mesgarani 159 1,794 0 20 Sep 2018
The fifth 'CHiME' Speech Separation and Recognition Challenge: Dataset, task and baselines Jon Barker Shinji Watanabe Emmanuel Vincent J. Trmal 59 685 0 28 Mar 2018
Speaker Diarization with LSTM Quan Wang Carlton Downey Li Wan Philip Mansfield Ignacio López Moreno 65 319 0 28 Oct 2017
Supervised Speech Separation Based on Deep Learning: An Overview DeLiang Wang Jitong Chen SSL 77 1,374 0 24 Aug 2017
Speaker-independent Speech Separation with Deep Attractor Network Yi Luo Zhuo Chen N. Mesgarani 56 247 0 12 Jul 2017
Multi-talker Speech Separation with Utterance-level Permutation Invariant Training of Deep Recurrent Neural Networks Morten Kolbaek Dong Yu Zheng-Hua Tan Jesper Jensen 57 726 0 18 Mar 2017
English Conversational Telephone Speech Recognition by Humans and Machines G. Saon Gakuto Kurata Tom Sercu Kartik Audhkhasi Samuel Thomas ... Bhuvana Ramabhadran M. Picheny L. Lim Bergul Roomi Phil Hall 65 365 0 06 Mar 2017
Achieving Human Parity in Conversational Speech Recognition Wayne Xiong J. Droppo Xuedong Huang Frank Seide M. Seltzer A. Stolcke Dong Yu Geoffrey Zweig 89 581 0 17 Oct 2016
Deep clustering: Discriminative embeddings for segmentation and separation J. Hershey Zhuo Chen Jonathan Le Roux Shinji Watanabe 62 1,319 0 18 Aug 2015