ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1708.07524
  4. Cited By
Supervised Speech Separation Based on Deep Learning: An Overview

Supervised Speech Separation Based on Deep Learning: An Overview

24 August 2017
DeLiang Wang
Jitong Chen
    SSL
ArXivPDFHTML

Papers citing "Supervised Speech Separation Based on Deep Learning: An Overview"

50 / 219 papers shown
Title
Partially Adaptive Multichannel Joint Reduction of Ego-noise and
  Environmental Noise
Partially Adaptive Multichannel Joint Reduction of Ego-noise and Environmental Noise
Hu Fang
Niklas Wittmer
Johannes Twiefel
S. Wermter
Timo Gerkmann
33
3
0
27 Mar 2023
ICASSP 2023 Deep Noise Suppression Challenge
ICASSP 2023 Deep Noise Suppression Challenge
Harishchandra Dubey
A. Aazami
Vishak Gopal
Sergiy Matusevych
Sebastian Braun
...
Sefik Emre Eskimez
Manthan Thakker
H. Gamper
Takuya Yoshioka
R. Aichner
28
83
0
21 Mar 2023
Towards Real-Time Single-Channel Speech Separation in Noisy and
  Reverberant Environments
Towards Real-Time Single-Channel Speech Separation in Noisy and Reverberant Environments
Julian Neri
Sebastian Braun
22
1
0
14 Mar 2023
Miipher: A Robust Speech Restoration Model Integrating Self-Supervised
  Speech and Text Representations
Miipher: A Robust Speech Restoration Model Integrating Self-Supervised Speech and Text Representations
Yuma Koizumi
Heiga Zen
Shigeki Karita
Yifan Ding
Kohei Yatabe
Nobuyuki Morioka
Yu Zhang
Wei Han
Ankur Bapna
M. Bacchiani
39
24
0
03 Mar 2023
Extending DNN-based Multiplicative Masking to Deep Subband Filtering for
  Improved Dereverberation
Extending DNN-based Multiplicative Masking to Deep Subband Filtering for Improved Dereverberation
Jean-Marie Lemercier
Julian Tobergte
Timo Gerkmann
24
2
0
01 Mar 2023
Reducing the Prior Mismatch of Stochastic Differential Equations for
  Diffusion-based Speech Enhancement
Reducing the Prior Mismatch of Stochastic Differential Equations for Diffusion-based Speech Enhancement
Bunlong Lay
Simon Welker
Julius Richter
Timo Gerkmann
DiffM
26
24
0
28 Feb 2023
Metric-oriented Speech Enhancement using Diffusion Probabilistic Model
Metric-oriented Speech Enhancement using Diffusion Probabilistic Model
Chen Chen
Yuchen Hu
Weiwei Weng
Chng Eng Siong
DiffM
48
19
0
23 Feb 2023
Unsupervised Noise adaptation using Data Simulation
Unsupervised Noise adaptation using Data Simulation
Chen Chen
Yuchen Hu
Heqing Zou
Linhui Sun
Chng Eng Siong
36
13
0
23 Feb 2023
A DNN based Normalized Time-frequency Weighted Criterion for Robust
  Wideband DoA Estimation
A DNN based Normalized Time-frequency Weighted Criterion for Robust Wideband DoA Estimation
Kuan-Lin Chen
Ching-Hua Lee
Bhaskar D. Rao
H. Garudadri
18
4
0
20 Feb 2023
Local spectral attention for full-band speech enhancement
Local spectral attention for full-band speech enhancement
Zhongshu Hou
Qi Hu
Kai-Jyun Chen
Jing Lu
46
0
0
11 Feb 2023
Neural Target Speech Extraction: An Overview
Neural Target Speech Extraction: An Overview
Kateřina Žmolíková
Marc Delcroix
Tsubasa Ochiai
K. Kinoshita
JanHonza'' vCernocký
Dong Yu
25
86
0
31 Jan 2023
Unearthing InSights into Mars: Unsupervised Source Separation with
  Limited Data
Unearthing InSights into Mars: Unsupervised Source Separation with Limited Data
Ali Siahkoohi
Rudy Morel
Maarten V. de Hoop
Erwan Allys
G. Sainton
Taichi Kawamura
47
4
0
27 Jan 2023
On Batching Variable Size Inputs for Training End-to-End Speech
  Enhancement Systems
On Batching Variable Size Inputs for Training End-to-End Speech Enhancement Systems
Philippe Gonzalez
T. S. Alstrøm
Tobias May
24
9
0
25 Jan 2023
HEAR4Health: A blueprint for making computer audition a staple of modern
  healthcare
HEAR4Health: A blueprint for making computer audition a staple of modern healthcare
Andreas Triantafyllopoulos
Alexander Kathan
Alice Baird
Lukas Christ
Alexander Gebhard
...
Shahin Amiriparian
K. D. Bartl-Pokorny
A. Batliner
Florian B. Pokorny
Björn W. Schuller
49
7
0
25 Jan 2023
Multi-resolution location-based training for multi-channel continuous
  speech separation
Multi-resolution location-based training for multi-channel continuous speech separation
H. Taherian
DeLiang Wang
43
7
0
16 Jan 2023
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech
  Enhancement and Dereverberation
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation
Jean-Marie Lemercier
Julius Richter
Simon Welker
Timo Gerkmann
DiffM
159
82
0
22 Dec 2022
Towards Unified All-Neural Beamforming for Time and Frequency Domain
  Speech Separation
Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation
Rongzhi Gu
Shi-Xiong Zhang
Yuexian Zou
Dong Yu
AI4TS
35
24
0
16 Dec 2022
DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech
  Enhancement
DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech Enhancement
Dongheon Lee
Jung-Woo Choi
32
25
0
15 Dec 2022
Tackling the Cocktail Fork Problem for Separation and Transcription of
  Real-World Soundtracks
Tackling the Cocktail Fork Problem for Separation and Transcription of Real-World Soundtracks
Darius Petermann
Gordon Wichern
Aswin Shanmugam Subramanian
Zhong-Qiu Wang
Jonathan Le Roux
29
10
0
14 Dec 2022
Multi-Scale Feature Fusion Transformer Network for End-to-End Single
  Channel Speech Separation
Multi-Scale Feature Fusion Transformer Network for End-to-End Single Channel Speech Separation
Yinhao Xu
Jian Zhou
L. Tao
H. Kwan
35
0
0
14 Dec 2022
CLIPSep: Learning Text-queried Sound Separation with Noisy Unlabeled
  Videos
CLIPSep: Learning Text-queried Sound Separation with Noisy Unlabeled Videos
Hao-Wen Dong
Naoya Takahashi
Yuki Mitsufuji
Julian McAuley
Taylor Berg-Kirkpatrick
VLM
CLIP
31
25
0
14 Dec 2022
Uncertainty Estimation in Deep Speech Enhancement Using Complex Gaussian
  Mixture Models
Uncertainty Estimation in Deep Speech Enhancement Using Complex Gaussian Mixture Models
Hu Fang
Timo Gerkmann
UQCV
16
3
0
09 Dec 2022
Injecting Spatial Information for Monaural Speech Enhancement via
  Knowledge Distillation
Injecting Spatial Information for Monaural Speech Enhancement via Knowledge Distillation
Xinmeng Xu
Weiping Tu
Yuhong Yang
24
0
0
02 Dec 2022
Deep neural network techniques for monaural speech enhancement: state of
  the art analysis
Deep neural network techniques for monaural speech enhancement: state of the art analysis
P. Ochieng
47
21
0
01 Dec 2022
TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech
  Separation
TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech Separation
Zhongqiu Wang
Samuele Cornell
Shukjae Choi
Younglo Lee
Byeonghak Kim
Shinji Watanabe
48
121
0
22 Nov 2022
A Two-Stage Deep Representation Learning-Based Speech Enhancement Method
  Using Variational Autoencoder and Adversarial Training
A Two-Stage Deep Representation Learning-Based Speech Enhancement Method Using Variational Autoencoder and Adversarial Training
Yang Xiang
Jesper Lisby Højvang
M. Rasmussen
M. G. Christensen
DRL
27
5
0
16 Nov 2022
McNet: Fuse Multiple Cues for Multichannel Speech Enhancement
McNet: Fuse Multiple Cues for Multichannel Speech Enhancement
Yujie Yang
Changsheng Quan
Xiaofei Li
38
21
0
16 Nov 2022
Leveraging Heteroscedastic Uncertainty in Learning Complex Spectral
  Mapping for Single-channel Speech Enhancement
Leveraging Heteroscedastic Uncertainty in Learning Complex Spectral Mapping for Single-channel Speech Enhancement
Kuan-Lin Chen
Daniel D. E. Wong
Ke Tan
Buye Xu
Anurag Kumar
V. Ithapu
37
1
0
16 Nov 2022
The Potential of Neural Speech Synthesis-based Data Augmentation for
  Personalized Speech Enhancement
The Potential of Neural Speech Synthesis-based Data Augmentation for Personalized Speech Enhancement
Anastasia Kuznetsova
Aswin Sivaraman
Minje Kim
32
3
0
14 Nov 2022
Handling Trade-Offs in Speech Separation with Sparsely-Gated Mixture of
  Experts
Handling Trade-Offs in Speech Separation with Sparsely-Gated Mixture of Experts
Xiaofei Wang
Zhuo Chen
Yu Shi
Jian Wu
Naoyuki Kanda
Takuya Yoshioka
MoE
28
1
0
11 Nov 2022
Analysing Diffusion-based Generative Approaches versus Discriminative
  Approaches for Speech Restoration
Analysing Diffusion-based Generative Approaches versus Discriminative Approaches for Speech Restoration
Jean-Marie Lemercier
Julius Richter
Simon Welker
Timo Gerkmann
DiffM
38
35
0
04 Nov 2022
Fast and efficient speech enhancement with variational autoencoders
Fast and efficient speech enhancement with variational autoencoders
M. Sadeghi
Romain Serizel
DRL
BDL
27
2
0
02 Nov 2022
A weighted-variance variational autoencoder model for speech enhancement
A weighted-variance variational autoencoder model for speech enhancement
A. Golmakani
M. Sadeghi
Xavier Alameda-Pineda
Romain Serizel
35
1
0
02 Nov 2022
Speech Enhancement with Intelligent Neural Homomorphic Synthesis
Speech Enhancement with Intelligent Neural Homomorphic Synthesis
Shulin He
Wei Rao
Jinjiang Liu
Jun Chen
Yukai Ju
Xueliang Zhang
Yannan Wang
Shidong Shang
26
6
0
28 Oct 2022
Audio Signal Enhancement with Learning from Positive and Unlabelled Data
Audio Signal Enhancement with Learning from Positive and Unlabelled Data
N. Ito
Masashi Sugiyama
21
7
0
27 Oct 2022
Time-Domain Speech Enhancement for Robust Automatic Speech Recognition
Time-Domain Speech Enhancement for Robust Automatic Speech Recognition
Yufeng Yang
Ashutosh Pandey
DeLiang Wang
28
8
0
24 Oct 2022
Improved Normalizing Flow-Based Speech Enhancement using an All-pole
  Gammatone Filterbank for Conditional Input Representation
Improved Normalizing Flow-Based Speech Enhancement using an All-pole Gammatone Filterbank for Conditional Input Representation
Martin Strauss
Matteo Torcoli
B. Edler
31
4
0
21 Oct 2022
Deep Learning Based Stage-wise Two-dimensional Speaker Localization with
  Large Ad-hoc Microphone Arrays
Deep Learning Based Stage-wise Two-dimensional Speaker Localization with Large Ad-hoc Microphone Arrays
Shupei Liu
Linfeng Feng
Yijun Gong
Chengdong Liang
Chen Zhang
Xiao-Lei Zhang
Xuelong Li
23
3
0
19 Oct 2022
Streaming Target-Speaker ASR with Neural Transducer
Streaming Target-Speaker ASR with Neural Transducer
Takafumi Moriya
Hiroshi Sato
Tsubasa Ochiai
Marc Delcroix
T. Shinozaki
39
21
0
09 Sep 2022
TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural
  Speaker Separation
TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural Speaker Separation
Zhong-Qiu Wang
Samuele Cornell
Shukjae Choi
Younglo Lee
Byeonghak Kim
Shinji Watanabe
76
99
0
08 Sep 2022
Speech Enhancement and Dereverberation with Diffusion-based Generative
  Models
Speech Enhancement and Dereverberation with Diffusion-based Generative Models
Julius Richter
Simon Welker
Jean-Marie Lemercier
Bunlong Lay
Timo Gerkmann
DiffM
24
185
0
11 Aug 2022
Inference skipping for more efficient real-time speech enhancement with
  parallel RNNs
Inference skipping for more efficient real-time speech enhancement with parallel RNNs
Xiaohuai Le
Tong Lei
Kai-Jyun Chen
Jing Lu
40
20
0
22 Jul 2022
A light-weight full-band speech enhancement model
A light-weight full-band speech enhancement model
Qi Hu
Zhongshu Hou
Xiaohuai Le
Jing Lu
28
3
0
29 Jun 2022
A two-stage full-band speech enhancement model with effective spectral
  compression mapping
A two-stage full-band speech enhancement model with effective spectral compression mapping
Zhongshu Hou
Qi Hu
Kai-Jyun Chen
Jing Lu
39
0
0
27 Jun 2022
To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time
  Dereverberation Targets
To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets
J. Valin
Ritwik Giri
Shrikant Venkataramani
Umut Isik
A. Krishnaswamy
13
2
0
16 Jun 2022
A deep representation learning speech enhancement method using
  $β$-VAE
A deep representation learning speech enhancement method using βββ-VAE
Yang Xiang
Jesper Lisby Højvang
M. Rasmussen
M. G. Christensen
DRL
27
2
0
11 May 2022
Does a PESQNet (Loss) Require a Clean Reference Input? The Original PESQ
  Does, But ACR Listening Tests Don't
Does a PESQNet (Loss) Require a Clean Reference Input? The Original PESQ Does, But ACR Listening Tests Don't
Ziyi Xu
Maximilian Strake
Tim Fingscheidt
27
3
0
04 May 2022
On monoaural speech enhancement for automatic recognition of real noisy
  speech using mixture invariant training
On monoaural speech enhancement for automatic recognition of real noisy speech using mixture invariant training
Jisi Zhang
Catalin Zorila
R. Doddipatla
Jon Barker
30
4
0
03 May 2022
Heterogeneous Separation Consistency Training for Adaptation of
  Unsupervised Speech Separation
Heterogeneous Separation Consistency Training for Adaptation of Unsupervised Speech Separation
Jiangyu Han
Yanhua Long
35
6
0
23 Apr 2022
RadioSES: mmWave-Based Audioradio Speech Enhancement and Separation
  System
RadioSES: mmWave-Based Audioradio Speech Enhancement and Separation System
M. Z. Ozturk
Chenshu Wu
Beibei Wang
Min Wu
K. Liu
27
20
0
14 Apr 2022
Previous
12345
Next