ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.13934
  4. Cited By
SMS-WSJ: Database, performance measures, and baseline recipe for
  multi-channel source separation and recognition

SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition

30 October 2019
Lukas Drude
Jens Heitkaemper
Christoph Boeddeker
Reinhold Haeb-Umbach
ArXivPDFHTML

Papers citing "SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition"

39 / 39 papers shown
Title
ArrayDPS: Unsupervised Blind Speech Separation with a Diffusion Prior
ArrayDPS: Unsupervised Blind Speech Separation with a Diffusion Prior
Zhongweiyang Xu
Xulin Fan
Zhong-Qiu Wang
Xilin Jiang
Romit Roy Choudhury
DiffM
54
0
0
08 May 2025
Elevating Robust Multi-Talker ASR by Decoupling Speaker Separation and Speech Recognition
Elevating Robust Multi-Talker ASR by Decoupling Speaker Separation and Speech Recognition
Yufeng Yang
H. Taherian
Vahid Ahmadi Kalkhorani
DeLiang Wang
44
0
0
23 Mar 2025
30+ Years of Source Separation Research: Achievements and Future Challenges
30+ Years of Source Separation Research: Achievements and Future Challenges
S. Araki
N. Ito
Reinhold Haeb-Umbach
Gordon Wichern
Zhong-Qiu Wang
Yuki Mitsufuji
AI4TS
44
0
0
21 Jan 2025
Improving Generalization of Speech Separation in Real-World Scenarios:
  Strategies in Simulation, Optimization, and Evaluation
Improving Generalization of Speech Separation in Real-World Scenarios: Strategies in Simulation, Optimization, and Evaluation
K. Chen
Jiaqi Su
Taylor Berg-Kirkpatrick
Shlomo Dubnov
Zeyu Jin
45
1
0
28 Aug 2024
Enhanced Reverberation as Supervision for Unsupervised Speech Separation
Enhanced Reverberation as Supervision for Unsupervised Speech Separation
Kohei Saijo
Gordon Wichern
François G. Germain
Zexu Pan
Jonathan Le Roux
39
1
0
06 Aug 2024
Cross-Talk Reduction
Cross-Talk Reduction
Zhong-Qiu Wang
Anurag Kumar
Shinji Watanabe
34
2
0
30 May 2024
CrossNet: Leveraging Global, Cross-Band, Narrow-Band, and Positional
  Encoding for Single- and Multi-Channel Speaker Separation
CrossNet: Leveraging Global, Cross-Band, Narrow-Band, and Positional Encoding for Single- and Multi-Channel Speaker Separation
Vahid Ahmadi Kalkhorani
DeLiang Wang
46
3
0
06 Mar 2024
Combining TF-GridNet and Mixture Encoder for Continuous Speech Separation for Meeting Transcription
Combining TF-GridNet and Mixture Encoder for Continuous Speech Separation for Meeting Transcription
Peter Vieting
Simon Berger
Thilo von Neumann
Christoph Boeddeker
Ralf Schluter
Reinhold Haeb-Umbach
26
0
0
15 Sep 2023
Recovering from Privacy-Preserving Masking with Large Language Models
Recovering from Privacy-Preserving Masking with Large Language Models
A. Vats
Zhe Liu
Peng Su
Debjyoti Paul
Yingyi Ma
Yutong Pang
Zeeshan Ahmed
Ozlem Kalinli
31
9
0
12 Sep 2023
Remixing-based Unsupervised Source Separation from Scratch
Remixing-based Unsupervised Source Separation from Scratch
Kohei Saijo
Tetsuji Ogawa
16
3
0
01 Sep 2023
SpatialNet: Extensively Learning Spatial Information for Multichannel
  Joint Speech Separation, Denoising and Dereverberation
SpatialNet: Extensively Learning Spatial Information for Multichannel Joint Speech Separation, Denoising and Dereverberation
Changsheng Quan
Xiaofei Li
18
36
0
31 Jul 2023
Mixture Encoder for Joint Speech Separation and Recognition
Mixture Encoder for Joint Speech Separation and Recognition
Simon Berger
Peter Vieting
Christoph Boeddeker
Ralf Schluter
Reinhold Häb-Umbach
26
6
0
21 Jun 2023
UNSSOR: Unsupervised Neural Speech Separation by Leveraging
  Over-determined Training Mixtures
UNSSOR: Unsupervised Neural Speech Separation by Leveraging Over-determined Training Mixtures
Zhong-Qiu Wang
Shinji Watanabe
38
10
0
31 May 2023
Svarah: Evaluating English ASR Systems on Indian Accents
Svarah: Evaluating English ASR Systems on Indian Accents
Tahir Javed
Sakshi Joshi
Vignesh Nagarajan
Sairam Sundaresan
J. Nawale
A. Raman
Kaushal Bhogale
Pratyush Kumar
Mitesh M. Khapra
22
8
0
25 May 2023
Tackling the Cocktail Fork Problem for Separation and Transcription of
  Real-World Soundtracks
Tackling the Cocktail Fork Problem for Separation and Transcription of Real-World Soundtracks
Darius Petermann
Gordon Wichern
Aswin Shanmugam Subramanian
Zhong-Qiu Wang
Jonathan Le Roux
27
10
0
14 Dec 2022
TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech
  Separation
TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech Separation
Zhongqiu Wang
Samuele Cornell
Shukjae Choi
Younglo Lee
Byeonghak Kim
Shinji Watanabe
43
121
0
22 Nov 2022
Self-Remixing: Unsupervised Speech Separation via Separation and
  Remixing
Self-Remixing: Unsupervised Speech Separation via Separation and Remixing
Kohei Saijo
Tetsuji Ogawa
SSL
22
11
0
18 Nov 2022
PodcastMix: A dataset for separating music and speech in podcasts
PodcastMix: A dataset for separating music and speech in podcasts
Nico M. Schmidt
Jordi Pons
M. Miron
25
2
0
15 Jul 2022
A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network
A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network
Tobias Gburrek
Christoph Boeddeker
Thilo von Neumann
Tobias Cord-Landwehr
Joerg Schmalenstroeer
Reinhold Haeb-Umbach
11
5
0
02 May 2022
GWA: A Large High-Quality Acoustic Dataset for Audio Processing
GWA: A Large High-Quality Acoustic Dataset for Audio Processing
Zhenyu Tang
R. Aralikatti
Anton Ratnarajah
Tianyi Zhou
35
31
0
04 Apr 2022
An Initialization Scheme for Meeting Separation with Spatial Mixture
  Models
An Initialization Scheme for Meeting Separation with Spatial Mixture Models
Christoph Boeddeker
Tobias Cord-Landwehr
Thilo von Neumann
Reinhold Haeb-Umbach
30
10
0
04 Apr 2022
Monaural source separation: From anechoic to reverberant environments
Monaural source separation: From anechoic to reverberant environments
Tobias Cord-Landwehr
Christoph Boeddeker
Thilo von Neumann
Catalin Zorila
R. Doddipatla
Reinhold Haeb-Umbach
24
31
0
15 Nov 2021
LiMuSE: Lightweight Multi-modal Speaker Extraction
LiMuSE: Lightweight Multi-modal Speaker Extraction
Qinghua Liu
Yating Huang
Yunzhe Hao
Jiaming Xu
Bo Xu
43
6
0
07 Nov 2021
The Cocktail Fork Problem: Three-Stem Audio Separation for Real-World
  Soundtracks
The Cocktail Fork Problem: Three-Stem Audio Separation for Real-World Soundtracks
Darius Petermann
Gordon Wichern
Zhong-Qiu Wang
Jonathan Le Roux
23
37
0
19 Oct 2021
Convolutive Prediction for Monaural Speech Dereverberation and
  Noisy-Reverberant Speaker Separation
Convolutive Prediction for Monaural Speech Dereverberation and Noisy-Reverberant Speaker Separation
Zhong-Qiu Wang
Gordon Wichern
Jonathan Le Roux
22
31
0
16 Aug 2021
Convolutive Prediction for Reverberant Speech Separation
Convolutive Prediction for Reverberant Speech Separation
Zhong-Qiu Wang
Gordon Wichern
Jonathan Le Roux
28
12
0
16 Aug 2021
On The Compensation Between Magnitude and Phase in Speech Separation
On The Compensation Between Magnitude and Phase in Speech Separation
Zhong-Qiu Wang
Gordon Wichern
Jonathan Le Roux
27
71
0
11 Aug 2021
Teacher-Student MixIT for Unsupervised and Semi-supervised Speech
  Separation
Teacher-Student MixIT for Unsupervised and Semi-supervised Speech Separation
Jisi Zhang
Catalin Zorila
R. Doddipatla
Jon Barker
23
22
0
15 Jun 2021
A Comparison and Combination of Unsupervised Blind Source Separation
  Techniques
A Comparison and Combination of Unsupervised Blind Source Separation Techniques
Christoph Boeddeker
F. Rautenberg
Reinhold Haeb-Umbach
VLM
28
11
0
10 Jun 2021
A Database for Research on Detection and Enhancement of Speech
  Transmitted over HF links
A Database for Research on Detection and Enhancement of Speech Transmitted over HF links
Jens Heitkaemper
Joerg Schmalenstroeer
Joerg Ullmann
Valentin Ion
Reinhold Haeb-Umbach
21
3
0
04 Jun 2021
Deep Learning based Multi-Source Localization with Source Splitting and
  its Effectiveness in Multi-Talker Speech Recognition
Deep Learning based Multi-Source Localization with Source Splitting and its Effectiveness in Multi-Talker Speech Recognition
Aswin Shanmugam Subramanian
Chao Weng
Shinji Watanabe
Meng Yu
Dong Yu
34
78
0
16 Feb 2021
Convolutive Transfer Function Invariant SDR training criteria for
  Multi-Channel Reverberant Speech Separation
Convolutive Transfer Function Invariant SDR training criteria for Multi-Channel Reverberant Speech Separation
Christoph Boeddeker
Wangyou Zhang
Tomohiro Nakatani
K. Kinoshita
Tsubasa Ochiai
Marc Delcroix
Naoyuki Kamo
Y. Qian
Reinhold Haeb-Umbach
13
29
0
30 Nov 2020
Rethinking the Separation Layers in Speech Separation Networks
Rethinking the Separation Layers in Speech Separation Networks
Yi Luo
Zhuo Chen
Cong Han
Chenda Li
Tianyan Zhou
N. Mesgarani
19
10
0
17 Nov 2020
ESPnet-se: end-to-end speech enhancement and separation toolkit designed
  for asr integration
ESPnet-se: end-to-end speech enhancement and separation toolkit designed for asr integration
Chenda Li
Jing Shi
Wangyou Zhang
Aswin Shanmugam Subramanian
Xuankai Chang
...
Moto Hira
Tomoki Hayashi
Christoph Boeddeker
Zhuo Chen
Shinji Watanabe
VLM
39
81
0
07 Nov 2020
Directional ASR: A New Paradigm for E2E Multi-Speaker Speech Recognition
  with Source Localization
Directional ASR: A New Paradigm for E2E Multi-Speaker Speech Recognition with Source Localization
Aswin Shanmugam Subramanian
Chao Weng
Shinji Watanabe
Meng Yu
Yong-mei Xu
Shi-Xiong Zhang
Dong Yu
15
20
0
30 Oct 2020
Multi-microphone Complex Spectral Mapping for Utterance-wise and
  Continuous Speech Separation
Multi-microphone Complex Spectral Mapping for Utterance-wise and Continuous Speech Separation
Zhong-Qiu Wang
Peidong Wang
DeLiang Wang
33
88
0
04 Oct 2020
Exploring the time-domain deep attractor network with two-stream
  architectures in a reverberant environment
Exploring the time-domain deep attractor network with two-stream architectures in a reverberant environment
Hangting Chen
Pengyuan Zhang
11
6
0
01 Jul 2020
Asteroid: the PyTorch-based audio source separation toolkit for
  researchers
Asteroid: the PyTorch-based audio source separation toolkit for researchers
Manuel Pariente
Samuele Cornell
Joris Cosentino
S. Sivasankaran
Efthymios Tzinis
...
Juan M. Martín-Donas
David Ditter
Ariel Frank
Antoine Deleforge
Emmanuel Vincent
27
151
0
08 May 2020
Demystifying TasNet: A Dissecting Approach
Demystifying TasNet: A Dissecting Approach
Jens Heitkaemper
Darius Jakobeit
Christoph Boeddeker
Lukas Drude
Reinhold Haeb-Umbach
25
58
0
20 Nov 2019
1