Multi-talker Speech Separation with Utterance-level Permutation
Invariant Training of Deep Recurrent Neural Networks

Multi-talker Speech Separation with Utterance-level Permutation Invariant Training of Deep Recurrent Neural Networks

18 March 2017

Papers citing "Multi-talker Speech Separation with Utterance-level Permutation Invariant Training of Deep Recurrent Neural Networks"

13 / 113 papers shown

Title
Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping Speech T. Menne Ilya Sklyar Ralf Schluter Hermann Ney 19 35 0 09 May 2019
Universal Sound Separation Ilya Kavalerov Scott Wisdom Hakan Erdogan Brian Patton K. Wilson Jonathan Le Roux J. Hershey 11 184 0 08 May 2019
Divide and Conquer: A Deep CASA Approach to Talker-independent Monaural Speaker Separation Yuzhou Liu DeLiang Wang 27 157 0 25 Apr 2019
Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering Gene-Ping Yang Chao-I Tuan Hung-yi Lee Lin-Shan Lee 20 25 0 16 Apr 2019
Time Domain Audio Visual Speech Separation Jian Wu Yong-mei Xu Shi-Xiong Zhang Lianwu Chen Meng Yu Lei Xie Dong Yu 20 114 0 07 Apr 2019
Optimization of Speaker Extraction Neural Network with Magnitude and Temporal Spectrum Approximation Loss Chenglin Xu Wei Rao Chng Eng Siong Haizhou Li 34 53 0 24 Mar 2019
FurcaNet: An end-to-end deep gated convolutional, long short-term memory, deep neural networks for single channel speech separation Ziqiang Shi Huibin Lin L. Liu Rujie Liu Shoji Hayakawa Shouji Harada Jiqing Han 17 22 0 02 Feb 2019
Deep Learning Based Phase Reconstruction for Speaker Separation: A Trigonometric Perspective Zhong-Qiu Wang Ke Tan DeLiang Wang 50 95 0 22 Nov 2018
Trainable Adaptive Window Switching for Speech Enhancement Yuma Koizumi N. Harada Y. Haneda 16 8 0 05 Nov 2018
Recognizing Overlapped Speech in Meetings: A Multichannel Separation Approach Using Neural Networks Takuya Yoshioka Hakan Erdogan Zhuo Chen Xiong Xiao F. Alleva BDL 22 81 0 08 Oct 2018
Phasebook and Friends: Leveraging Discrete Representations for Source Separation Jonathan Le Roux G. Wichern Shinji Watanabe Andy M. Sarroff J. Hershey 16 76 0 02 Oct 2018
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Yi Luo N. Mesgarani 19 1,748 0 20 Sep 2018
End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics Optimization by Fully Convolutional Neural Networks Szu-Wei Fu Tao-Wei Wang Yu Tsao Xugang Lu Hisashi Kawai 22 271 0 12 Sep 2017