SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition

30 October 2019

Papers citing "SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition"

39 / 39 papers shown

Title
ArrayDPS: Unsupervised Blind Speech Separation with a Diffusion Prior Zhongweiyang Xu Xulin Fan Zhong-Qiu Wang Xilin Jiang Romit Roy Choudhury DiffM 54 0 0 08 May 2025
Elevating Robust Multi-Talker ASR by Decoupling Speaker Separation and Speech Recognition Yufeng Yang H. Taherian Vahid Ahmadi Kalkhorani DeLiang Wang 44 0 0 23 Mar 2025
30+ Years of Source Separation Research: Achievements and Future Challenges S. Araki N. Ito Reinhold Haeb-Umbach Gordon Wichern Zhong-Qiu Wang Yuki Mitsufuji AI4TS 44 0 0 21 Jan 2025
Improving Generalization of Speech Separation in Real-World Scenarios: Strategies in Simulation, Optimization, and Evaluation K. Chen Jiaqi Su Taylor Berg-Kirkpatrick Shlomo Dubnov Zeyu Jin 45 1 0 28 Aug 2024
Enhanced Reverberation as Supervision for Unsupervised Speech Separation Kohei Saijo Gordon Wichern François G. Germain Zexu Pan Jonathan Le Roux 39 1 0 06 Aug 2024
Cross-Talk Reduction Zhong-Qiu Wang Anurag Kumar Shinji Watanabe 34 2 0 30 May 2024
CrossNet: Leveraging Global, Cross-Band, Narrow-Band, and Positional Encoding for Single- and Multi-Channel Speaker Separation Vahid Ahmadi Kalkhorani DeLiang Wang 46 3 0 06 Mar 2024
Combining TF-GridNet and Mixture Encoder for Continuous Speech Separation for Meeting Transcription Peter Vieting Simon Berger Thilo von Neumann Christoph Boeddeker Ralf Schluter Reinhold Haeb-Umbach 26 0 0 15 Sep 2023
Recovering from Privacy-Preserving Masking with Large Language Models A. Vats Zhe Liu Peng Su Debjyoti Paul Yingyi Ma Yutong Pang Zeeshan Ahmed Ozlem Kalinli 31 9 0 12 Sep 2023
Remixing-based Unsupervised Source Separation from Scratch Kohei Saijo Tetsuji Ogawa 16 3 0 01 Sep 2023
SpatialNet: Extensively Learning Spatial Information for Multichannel Joint Speech Separation, Denoising and Dereverberation Changsheng Quan Xiaofei Li 18 36 0 31 Jul 2023
Mixture Encoder for Joint Speech Separation and Recognition Simon Berger Peter Vieting Christoph Boeddeker Ralf Schluter Reinhold Häb-Umbach 26 6 0 21 Jun 2023
UNSSOR: Unsupervised Neural Speech Separation by Leveraging Over-determined Training Mixtures Zhong-Qiu Wang Shinji Watanabe 38 10 0 31 May 2023
Svarah: Evaluating English ASR Systems on Indian Accents Tahir Javed Sakshi Joshi Vignesh Nagarajan Sairam Sundaresan J. Nawale A. Raman Kaushal Bhogale Pratyush Kumar Mitesh M. Khapra 22 8 0 25 May 2023
Tackling the Cocktail Fork Problem for Separation and Transcription of Real-World Soundtracks Darius Petermann Gordon Wichern Aswin Shanmugam Subramanian Zhong-Qiu Wang Jonathan Le Roux 27 10 0 14 Dec 2022
TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech Separation Zhongqiu Wang Samuele Cornell Shukjae Choi Younglo Lee Byeonghak Kim Shinji Watanabe 43 121 0 22 Nov 2022
Self-Remixing: Unsupervised Speech Separation via Separation and Remixing Kohei Saijo Tetsuji Ogawa SSL 22 11 0 18 Nov 2022
PodcastMix: A dataset for separating music and speech in podcasts Nico M. Schmidt Jordi Pons M. Miron 25 2 0 15 Jul 2022
A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network Tobias Gburrek Christoph Boeddeker Thilo von Neumann Tobias Cord-Landwehr Joerg Schmalenstroeer Reinhold Haeb-Umbach 11 5 0 02 May 2022
GWA: A Large High-Quality Acoustic Dataset for Audio Processing Zhenyu Tang R. Aralikatti Anton Ratnarajah Tianyi Zhou 35 31 0 04 Apr 2022
An Initialization Scheme for Meeting Separation with Spatial Mixture Models Christoph Boeddeker Tobias Cord-Landwehr Thilo von Neumann Reinhold Haeb-Umbach 30 10 0 04 Apr 2022
Monaural source separation: From anechoic to reverberant environments Tobias Cord-Landwehr Christoph Boeddeker Thilo von Neumann Catalin Zorila R. Doddipatla Reinhold Haeb-Umbach 24 31 0 15 Nov 2021
LiMuSE: Lightweight Multi-modal Speaker Extraction Qinghua Liu Yating Huang Yunzhe Hao Jiaming Xu Bo Xu 43 6 0 07 Nov 2021
The Cocktail Fork Problem: Three-Stem Audio Separation for Real-World Soundtracks Darius Petermann Gordon Wichern Zhong-Qiu Wang Jonathan Le Roux 23 37 0 19 Oct 2021
Convolutive Prediction for Monaural Speech Dereverberation and Noisy-Reverberant Speaker Separation Zhong-Qiu Wang Gordon Wichern Jonathan Le Roux 22 31 0 16 Aug 2021
Convolutive Prediction for Reverberant Speech Separation Zhong-Qiu Wang Gordon Wichern Jonathan Le Roux 28 12 0 16 Aug 2021
On The Compensation Between Magnitude and Phase in Speech Separation Zhong-Qiu Wang Gordon Wichern Jonathan Le Roux 27 71 0 11 Aug 2021
Teacher-Student MixIT for Unsupervised and Semi-supervised Speech Separation Jisi Zhang Catalin Zorila R. Doddipatla Jon Barker 23 22 0 15 Jun 2021
A Comparison and Combination of Unsupervised Blind Source Separation Techniques Christoph Boeddeker F. Rautenberg Reinhold Haeb-Umbach VLM 28 11 0 10 Jun 2021
A Database for Research on Detection and Enhancement of Speech Transmitted over HF links Jens Heitkaemper Joerg Schmalenstroeer Joerg Ullmann Valentin Ion Reinhold Haeb-Umbach 21 3 0 04 Jun 2021
Deep Learning based Multi-Source Localization with Source Splitting and its Effectiveness in Multi-Talker Speech Recognition Aswin Shanmugam Subramanian Chao Weng Shinji Watanabe Meng Yu Dong Yu 34 78 0 16 Feb 2021
Convolutive Transfer Function Invariant SDR training criteria for Multi-Channel Reverberant Speech Separation Christoph Boeddeker Wangyou Zhang Tomohiro Nakatani K. Kinoshita Tsubasa Ochiai Marc Delcroix Naoyuki Kamo Y. Qian Reinhold Haeb-Umbach 13 29 0 30 Nov 2020
Rethinking the Separation Layers in Speech Separation Networks Yi Luo Zhuo Chen Cong Han Chenda Li Tianyan Zhou N. Mesgarani 19 10 0 17 Nov 2020
ESPnet-se: end-to-end speech enhancement and separation toolkit designed for asr integration Chenda Li Jing Shi Wangyou Zhang Aswin Shanmugam Subramanian Xuankai Chang ... Moto Hira Tomoki Hayashi Christoph Boeddeker Zhuo Chen Shinji Watanabe VLM 39 81 0 07 Nov 2020
Directional ASR: A New Paradigm for E2E Multi-Speaker Speech Recognition with Source Localization Aswin Shanmugam Subramanian Chao Weng Shinji Watanabe Meng Yu Yong-mei Xu Shi-Xiong Zhang Dong Yu 15 20 0 30 Oct 2020
Multi-microphone Complex Spectral Mapping for Utterance-wise and Continuous Speech Separation Zhong-Qiu Wang Peidong Wang DeLiang Wang 33 88 0 04 Oct 2020
Exploring the time-domain deep attractor network with two-stream architectures in a reverberant environment Hangting Chen Pengyuan Zhang 11 6 0 01 Jul 2020
Asteroid: the PyTorch-based audio source separation toolkit for researchers Manuel Pariente Samuele Cornell Joris Cosentino S. Sivasankaran Efthymios Tzinis ... Juan M. Martín-Donas David Ditter Ariel Frank Antoine Deleforge Emmanuel Vincent 27 151 0 08 May 2020
Demystifying TasNet: A Dissecting Approach Jens Heitkaemper Darius Jakobeit Christoph Boeddeker Lukas Drude Reinhold Haeb-Umbach 25 58 0 20 Nov 2019