Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1708.07524
Cited By
Supervised Speech Separation Based on Deep Learning: An Overview
24 August 2017
DeLiang Wang
Jitong Chen
SSL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Supervised Speech Separation Based on Deep Learning: An Overview"
50 / 219 papers shown
Title
Heterogeneous Target Speech Separation
Hyunjae Cho
Wonbin Jung
Junhyeok Lee
Paris Smaragdis
Sanghyun Woo
51
26
0
07 Apr 2022
Target Confusion in End-to-end Speaker Extraction: Analysis and Approaches
Zifeng Zhao
Dongchao Yang
Rongzhi Gu
Haoran Zhang
Yuexian Zou
33
16
0
04 Apr 2022
Coarse-to-Fine Recursive Speech Separation for Unknown Number of Speakers
Zhenhao Jin
Xiang Hao
Xiangdong Su
37
4
0
30 Mar 2022
CMGAN: Conformer-based Metric GAN for Speech Enhancement
Ru Cao
Sherif Abdulatif
Bin Yang
26
94
0
28 Mar 2022
Separate What You Describe: Language-Queried Audio Source Separation
Xubo Liu
Haohe Liu
Qiuqiang Kong
Xinhao Mei
Jinzheng Zhao
Qiushi Huang
Mark D. Plumbley
Wenwu Wang
44
58
0
28 Mar 2022
FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement
Jun Chen
Zehao Wang
Deyi Tuo
Zhiyong Wu
Shiyin Kang
Helen Meng
34
107
0
23 Mar 2022
Automated detection of foreground speech with wearable sensing in everyday home environments: A transfer learning approach
Dawei Liang
Zifan Xu
Yinuo Chen
Rebecca Adaimi
David Harwath
Edison Thomaz
48
1
0
21 Mar 2022
MDNet: Learning Monaural Speech Enhancement from Deep Prior Gradient
Andong Li
C. Zheng
Ziyang Zhang
Xiaodong Li
32
3
0
14 Mar 2022
SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities
Hsiang-Sheng Tsai
Heng-Jui Chang
Wen-Chin Huang
Zili Huang
Kushal Lakhotia
...
Hsuan-Jui Chen
Shang-Wen Li
Shinji Watanabe
Abdel-rahman Mohamed
Hung-yi Lee
31
110
0
14 Mar 2022
Climate Change & Computer Audition: A Call to Action and Overview on Audio Intelligence to Help Save the Planet
Björn W. Schuller
Ali Akman
Yi-Fen Chang
H. Coppock
Alexander Gebhard
Alexander Kathan
Esther Rituerto-González
Andreas Triantafyllopoulos
Florian B. Pokorny
38
1
0
10 Mar 2022
CNN self-attention voice activity detector
Amit Sofer
Shlomo E. Chazan
21
8
0
06 Mar 2022
Integrating Statistical Uncertainty into Neural Network-Based Speech Enhancement
Hu Fang
Tal Peer
S. Wermter
Timo Gerkmann
39
6
0
04 Mar 2022
Audio Self-supervised Learning: A Survey
Shuo Liu
Adria Mallol-Ragolta
Emilia Parada-Cabeleiro
Kun Qian
Xingshuo Jing
Alexander Kathan
Bin Hu
Bjoern W. Schuller
SSL
45
106
0
02 Mar 2022
DMF-Net: A decoupling-style multi-band fusion model for full-band speech enhancement
Guochen Yu
Yuansheng Guan
Weixin Meng
C. Zheng
Haibo Wang
42
2
0
01 Mar 2022
L3DAS22 Challenge: Learning 3D Audio Sources in a Real Office Environment
E. Guizzo
Christian Marinoni
Marco Pennese
Xinlei Ren
Xiguang Zheng
Chen Zhang
Bruno Masiero
A. Uncini
Danilo Comminiello
24
52
0
21 Feb 2022
U-shaped Transformer with Frequency-Band Aware Attention for Speech Enhancement
Yi Li
Yang Sun
S. M. Naqvi
28
25
0
11 Dec 2021
Directed Speech Separation for Automatic Speech Recognition of Long Form Conversational Speech
Rohit Paturi
S. Srinivasan
Katrin Kirchhoff
Daniel Garcia-Romero
27
9
0
10 Dec 2021
Effect of noise suppression losses on speech distortion and ASR performance
Sebastian Braun
H. Gamper
22
19
0
23 Nov 2021
Unsupervised Speech Enhancement with speech recognition embedding and disentanglement losses
V. Trinh
Sebastian Braun
33
17
0
16 Nov 2021
Single-channel speech separation using Soft-minimum Permutation Invariant Training
Midia Yousefi
John H. L. Hansen
21
3
0
16 Nov 2021
MultiSV: Dataset for Far-Field Multi-Channel Speaker Verification
Ladislav Mošner
Oldrich Plchot
L. Burget
J. Černocký
39
7
0
11 Nov 2021
Target Speech Extraction: Independent Vector Extraction Guided by Supervised Speaker Identification
J. Málek
Jakub Janský
Zbyněk Koldovský
Tomás Kounovský
Jaroslav Cmejla
J. Zdánský
30
10
0
05 Nov 2021
Improving Noise Robustness of Contrastive Speech Representation Learning with Speech Reconstruction
Heming Wang
Yao Qian
Xiaofei Wang
Yiming Wang
Chengyi Wang
Shujie Liu
Takuya Yoshioka
Jinyu Li
DeLiang Wang
26
29
0
28 Oct 2021
Multichannel Speech Enhancement without Beamforming
Asutosh Pandey
Buye Xu
Anurag Kumar
Jacob Donley
P. Calamia
DeLiang Wang
31
13
0
25 Oct 2021
Time-domain Ad-hoc Array Speech Enhancement Using a Triple-path Network
Ashutosh Pandey
Buye Xu
Anurag Kumar
Jacob Donley
P. Calamia
DeLiang Wang
38
4
0
22 Oct 2021
Objective Measures of Perceptual Audio Quality Reviewed: An Evaluation of Their Application Domain Dependence
Matteo Torcoli
T. Kastner
Jürgen Herre
21
58
0
21 Oct 2021
TPARN: Triple-path Attentive Recurrent Network for Time-domain Multichannel Speech Enhancement
Ashutosh Pandey
Buye Xu
Anurag Kumar
Jacob Donley
P. Calamia
DeLiang Wang
KELM
27
40
0
20 Oct 2021
The Cocktail Fork Problem: Three-Stem Audio Separation for Real-World Soundtracks
Darius Petermann
Gordon Wichern
Zhong-Qiu Wang
Jonathan Le Roux
23
37
0
19 Oct 2021
Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement
Guochen Yu
Andong Li
C. Zheng
Yinuo Guo
Yutian Wang
Hui Wang
40
84
0
13 Oct 2021
Multi-channel Narrow-band Deep Speech Separation with Full-band Permutation Invariant Training
Changsheng Quan
Xiaofei Li
34
14
0
12 Oct 2021
MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only on noisy/ reverberated speech
Szu-Wei Fu
Cheng Yu
Kuo-Hsuan Hung
Mirco Ravanelli
Yu Tsao
45
46
0
12 Oct 2021
Location-based training for multi-channel talker-independent speaker separation
H. Taherian
Ke Tan
DeLiang Wang
31
10
0
08 Oct 2021
Joint magnitude estimation and phase recovery using Cycle-in-Cycle GAN for non-parallel speech enhancement
Guochen Yu
Andong Li
Yutian Wang
Yinuo Guo
Hui Wang
C. Zheng
52
4
0
26 Sep 2021
Visual Scene Graphs for Audio Source Separation
Moitreya Chatterjee
Jonathan Le Roux
Narendra Ahuja
A. Cherian
31
36
0
24 Sep 2021
Decoupling Magnitude and Phase Estimation with Deep ResUNet for Music Source Separation
Qiuqiang Kong
Yin Cao
Haohe Liu
Keunwoo Choi
Yuxuan Wang
118
96
0
12 Sep 2021
BeamTransformer: Microphone Array-based Overlapping Speech Detection
Siqi Zheng
Shiliang Zhang
Weilong Huang
Qian Chen
Hongbin Suo
Ming Lei
Jinwei Feng
Zhijie Yan
37
7
0
09 Sep 2021
A Two-stage Complex Network using Cycle-consistent Generative Adversarial Networks for Speech Enhancement
Guochen Yu
Yutian Wang
Hui Wang
Qin Zhang
C. Zheng
GAN
50
18
0
05 Sep 2021
SEC4SR: A Security Analysis Platform for Speaker Recognition
Guangke Chen
Zhe Zhao
Fu Song
Sen Chen
Lingling Fan
Yang Liu
AAML
35
12
0
04 Sep 2021
Dereverberation of Autoregressive Envelopes for Far-field Speech Recognition
Anurenjan Purushothaman
Anirudh Sreeram
Rohit Kumar
Sriram Ganapathy
34
7
0
12 Aug 2021
Microphone Array Generalization for Multichannel Narrowband Deep Speech Enhancement
Siyuan Zhang
Xiaofei Li
29
16
0
27 Jul 2021
Multi-Task Audio Source Separation
Lu Zhang
Chenxing Li
Feng Deng
Xiaorui Wang
41
8
0
14 Jul 2021
DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement
Xiaohuai Le
Hongsheng Chen
Kai-Jyun Chen
Jing Lu
35
78
0
12 Jul 2021
Attention-based multi-channel speaker verification with ad-hoc microphone arrays
Che-Yuan Liang
Junqi Chen
Shanzheng Guan
Xiao-Lei Zhang
25
9
0
01 Jul 2021
DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement
Yuma Koizumi
Shigeki Karita
Scott Wisdom
Hakan Erdogan
J. Hershey
Llion Jones
M. Bacchiani
29
41
0
30 Jun 2021
SRIB-LEAP submission to Far-field Multi-Channel Speech Enhancement Challenge for Video Conferencing
R. Raj
Rohit Kumar
M. Jayesh
Anurenjan Purushothaman
Sriram Ganapathy
Basha Shaik
24
2
0
24 Jun 2021
Deep neural network Based Low-latency Speech Separation with Asymmetric analysis-Synthesis Window Pair
Shanshan Wang
Gaurav Naithani
Archontis Politis
Tuomas Virtanen
40
10
0
22 Jun 2021
Glance and Gaze: A Collaborative Learning Framework for Single-channel Speech Enhancement
Andong Li
C. Zheng
Lu Zhang
Xiaodong Li
21
142
0
22 Jun 2021
WASE: Learning When to Attend for Speaker Extraction in Cocktail Party Environments
Yunzhe Hao
Jiaming Xu
Peng Zhang
Bo Xu
19
17
0
13 Jun 2021
Should We Always Separate?: Switching Between Enhanced and Observed Signals for Overlapping Speech Recognition
Hiroshi Sato
Tsubasa Ochiai
Marc Delcroix
K. Kinoshita
Takafumi Moriya
Naoyuki Kamo
33
23
0
02 Jun 2021
Zero-Shot Personalized Speech Enhancement through Speaker-Informed Model Selection
Aswin Sivaraman
Minje Kim
23
9
0
08 May 2021
Previous
1
2
3
4
5
Next