Multi-talker Speech Separation with Utterance-level Permutation Invariant Training of Deep Recurrent Neural Networks

18 March 2017

Papers citing "Multi-talker Speech Separation with Utterance-level Permutation Invariant Training of Deep Recurrent Neural Networks"

50 / 107 papers shown

Title
Graph-PIT: Generalized permutation invariant training for continuous separation of arbitrary numbers of speakers Thilo von Neumann K. Kinoshita Christoph Boeddeker Marc Delcroix Reinhold Haeb-Umbach 16 23 0 30 Jul 2021
Speeding Up Permutation Invariant Training for Source Separation Thilo von Neumann Christoph Boeddeker K. Kinoshita Marc Delcroix Reinhold Haeb-Umbach 16 6 0 30 Jul 2021
Multi-Task Audio Source Separation Lu Zhang Chenxing Li Feng Deng Xiaorui Wang 41 8 0 14 Jul 2021
Lightweight Dual-channel Target Speaker Separation for Mobile Voice Communication Yuanyuan Bao Yanze Xu Na Xu Wenjing Yang Hongfeng Li Shicong Li Y. Jia Fei Xiang Jincheng He Ming Li 24 1 0 05 Jun 2021
A Database for Research on Detection and Enhancement of Speech Transmitted over HF links Jens Heitkaemper Joerg Schmalenstroeer Joerg Ullmann Valentin Ion Reinhold Haeb-Umbach 16 3 0 04 Jun 2021
Target Speaker Verification with Selective Auditory Attention for Single and Multi-talker Speech Chenglin Xu Wei Rao Jibin Wu Haizhou Li 30 32 0 30 Mar 2021
Audio-Visual Speech Separation Using Cross-Modal Correspondence Loss Naoki Makishima Mana Ihori Akihiko Takashima Tomohiro Tanaka Shota Orihashi Ryo Masumura 22 8 0 02 Mar 2021
TransMask: A Compact and Fast Speech Separation Model Based on Transformer Zining Zhang Bingsheng He Zhenjie Zhang 21 21 0 19 Feb 2021
Group Communication with Context Codec for Lightweight Source Separation Yi Luo Cong Han N. Mesgarani 23 20 0 14 Dec 2020
Deep Ad-hoc Beamforming Based on Speaker Extraction for Target-Dependent Speech Separation Ziye Yang Shanzheng Guan Xiao-Lei Zhang 14 14 0 01 Dec 2020
On End-to-end Multi-channel Time Domain Speech Separation in Reverberant Environments Jisi Zhang Catalin Zorila R. Doddipatla Jon Barker 9 46 0 11 Nov 2020
Surrogate Source Model Learning for Determined Source Separation Robin Scheibler M. Togami 20 22 0 11 Nov 2020
Single channel voice separation for unknown number of speakers under reverberant and noisy settings Shlomo E. Chazan Lior Wolf Eliya Nachmani Yossi Adi 19 29 0 04 Nov 2020
Recent Developments on ESPnet Toolkit Boosted by Conformer Pengcheng Guo Florian Boyer Xuankai Chang Tomoki Hayashi Yosuke Higuchi ... Jing Shi Shinji Watanabe Kun Wei Wangyou Zhang Yuekai Zhang 34 262 0 26 Oct 2020
Attention is All You Need in Speech Separation Cem Subakan Mirco Ravanelli Samuele Cornell Mirko Bronzi Jianyuan Zhong 27 536 0 25 Oct 2020
An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection Yin Cao Turab Iqbal Qiuqiang Kong Y. Zhong Wenwu Wang Mark D. Plumbley 16 75 0 25 Oct 2020
Speaker Separation Using Speaker Inventories and Estimated Speech Peidong Wang Zhuo Chen DeLiang Wang Jinyu Li Jiawei Liu 30 11 0 20 Oct 2020
Attention-based scaling adaptation for target speech extraction Jiangyu Han Wei Rao Yanhua Long Jiaen Liang 11 9 0 19 Oct 2020
Multi-microphone Complex Spectral Mapping for Utterance-wise and Continuous Speech Separation Zhong-Qiu Wang Peidong Wang DeLiang Wang 17 88 0 04 Oct 2020
An End-to-end Architecture of Online Multi-channel Speech Separation Jian Wu Zhuo Chen Jinyu Li Takuya Yoshioka Zhili Tan Ed Lin Yi Luo Lei Xie 3DV 11 20 0 07 Sep 2020
Exploring the time-domain deep attractor network with two-stream architectures in a reverberant environment Hangting Chen Pengyuan Zhang 6 6 0 01 Jul 2020
Speaker-Conditional Chain Model for Speech Separation and Extraction Jing Shi Jiaming Xu Yusuke Fujita Shinji Watanabe Bo Xu BDL 41 20 0 25 Jun 2020
Listen to What You Want: Neural Network-based Universal Sound Selector Tsubasa Ochiai Marc Delcroix Yuma Koizumi Hiroaki Ito K. Kinoshita S. Araki 8 61 0 10 Jun 2020
Efficient Integration of Multi-channel Information for Speaker-independent Speech Separation Yuichiro Koyama Oluwafemi Azeez Bhiksha Raj 22 4 0 23 May 2020
Jointly optimal denoising, dereverberation, and source separation Tomohiro Nakatani Christoph Boeddeker K. Kinoshita Rintaro Ikeshita Marc Delcroix Reinhold Haeb-Umbach 16 46 0 20 May 2020
Dual-Signal Transformation LSTM Network for Real-Time Noise Suppression Nils L. Westhausen B. Meyer 17 99 0 15 May 2020
FaceFilter: Audio-visual speech separation using still images Soo-Whan Chung Soyeon Choe Joon Son Chung Hong-Goo Kang CVBM 21 66 0 14 May 2020
SpEx+: A Complete Time Domain Speaker Extraction Network Meng Ge Chenglin Xu Longbiao Wang Chng Eng Siong J. Dang Haizhou Li 21 141 0 10 May 2020
Asteroid: the PyTorch-based audio source separation toolkit for researchers Manuel Pariente Samuele Cornell Joris Cosentino S. Sivasankaran Efthymios Tzinis ... Juan M. Martín-Donas David Ditter Ariel Frank Antoine Deleforge Emmanuel Vincent 21 151 0 08 May 2020
Separating Varying Numbers of Sources with Auxiliary Autoencoding Loss Yi Luo N. Mesgarani 16 29 0 27 Mar 2020
Improving noise robust automatic speech recognition with single-channel time-domain enhancement network K. Kinoshita Tsubasa Ochiai Marc Delcroix Tomohiro Nakatani 21 97 0 09 Mar 2020
Tackling real noisy reverberant meetings with all-neural source separation, counting, and diarization system K. Kinoshita Marc Delcroix S. Araki Tomohiro Nakatani 194 30 0 09 Mar 2020
Voice Separation with an Unknown Number of Multiple Speakers Eliya Nachmani Yossi Adi Lior Wolf 20 175 0 29 Feb 2020
End-to-End Neural Diarization: Reformulating Speaker Diarization as Simple Multi-label Classification Yusuke Fujita Shinji Watanabe Shota Horiguchi Yawen Xue Kenji Nagamatsu 12 49 0 24 Feb 2020
Wavesplit: End-to-End Speech Separation by Speaker Clustering Neil Zeghidour David Grangier VLM 27 261 0 20 Feb 2020
Speech Enhancement using Self-Adaptation and Multi-Head Self-Attention Yuma Koizumi Kohei Yatabe Marc Delcroix Yoshiki Masuyama Daiki Takeuchi 12 125 0 14 Feb 2020
CNN-LSTM models for Multi-Speaker Source Separation using Bayesian Hyper Parameter Optimization Jeroen Zegers Hugo Van hamme BDL 26 7 0 19 Dec 2019
End-to-end Microphone Permutation and Number Invariant Multi-channel Speech Separation Yi Luo Zhuo Chen N. Mesgarani Takuya Yoshioka 11 178 0 30 Oct 2019
Mixup-breakdown: a consistency training method for improving generalization of speech separation models Max W. Y. Lam Jun Wang Dan Su Dong Yu 33 22 0 28 Oct 2019
A Multi-Phase Gammatone Filterbank for Speech Separation via TasNet David Ditter Timo Gerkmann 9 57 0 25 Oct 2019
Filterbank design for end-to-end speech separation Manuel Pariente Samuele Cornell Antoine Deleforge Emmanuel Vincent 18 69 0 23 Oct 2019
WHAMR!: Noisy and Reverberant Single-Channel Speech Separation Matthew Maciejewski G. Wichern E. McQuinn Jonathan Le Roux 6 179 0 22 Oct 2019
On Loss Functions for Supervised Monaural Time-Domain Speech Enhancement Morten Kolbæk Zheng-Hua Tan S. H. Jensen Jesper Jensen AAML 60 125 0 03 Sep 2019
A comprehensive study of speech separation: spectrogram vs waveform separation F. Bahmaninezhad Jian Wu Rongzhi Gu Shi-Xiong Zhang Yong-mei Xu Meng Yu Dong Yu 34 80 0 17 May 2019
Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping Speech T. Menne Ilya Sklyar Ralf Schluter Hermann Ney 14 35 0 09 May 2019
Universal Sound Separation Ilya Kavalerov Scott Wisdom Hakan Erdogan Brian Patton K. Wilson Jonathan Le Roux J. Hershey 11 184 0 08 May 2019
Divide and Conquer: A Deep CASA Approach to Talker-independent Monaural Speaker Separation Yuzhou Liu DeLiang Wang 27 157 0 25 Apr 2019
Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering Gene-Ping Yang Chao-I Tuan Hung-yi Lee Lin-Shan Lee 20 25 0 16 Apr 2019
Time Domain Audio Visual Speech Separation Jian Wu Yong-mei Xu Shi-Xiong Zhang Lianwu Chen Meng Yu Lei Xie Dong Yu 20 114 0 07 Apr 2019
Optimization of Speaker Extraction Neural Network with Magnitude and Temporal Spectrum Approximation Loss Chenglin Xu Wei Rao Chng Eng Siong Haizhou Li 29 53 0 24 Mar 2019