Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2212.00369
Cited By
v1
v2 (latest)
Deep neural network techniques for monaural speech enhancement: state of the art analysis
1 December 2022
P. Ochieng
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep neural network techniques for monaural speech enhancement: state of the art analysis"
50 / 125 papers shown
Title
Phase-aware Single-stage Speech Denoising and Dereverberation with U-Net
Hyeong-Seok Choi
Hoon Heo
Jie Hwan Lee
Kyogu Lee
91
19
0
01 Jun 2020
TinyLSTMs: Efficient Neural Speech Enhancement for Hearing Aids
Igor Fedorov
Marko Stamenovic
Carl R. Jensen
Li-Chia Yang
Ari Mandell
Yiming Gan
Matthew Mattina
P. Whatmough
59
98
0
20 May 2020
Vector-Quantized Autoregressive Predictive Coding
Yu-An Chung
Hao Tang
James R. Glass
SSL
54
115
0
17 May 2020
Conformer: Convolution-augmented Transformer for Speech Recognition
Anmol Gulati
James Qin
Chung-Cheng Chiu
Niki Parmar
Yu Zhang
...
Wei Han
Shibo Wang
Zhengdong Zhang
Yonghui Wu
Ruoming Pang
229
3,164
0
16 May 2020
Longformer: The Long-Document Transformer
Iz Beltagy
Matthew E. Peters
Arman Cohan
RALM
VLM
190
4,105
0
10 Apr 2020
Separating Varying Numbers of Sources with Auxiliary Autoencoding Loss
Yi Luo
N. Mesgarani
72
29
0
27 Mar 2020
Voice Separation with an Unknown Number of Multiple Speakers
Eliya Nachmani
Yossi Adi
Lior Wolf
87
175
0
29 Feb 2020
Wavesplit: End-to-End Speech Separation by Speaker Clustering
Neil Zeghidour
David Grangier
VLM
112
265
0
20 Feb 2020
End-to-End Multi-speaker Speech Recognition with Transformer
Xuankai Chang
Wangyou Zhang
Y. Qian
Jonathan Le Roux
Shinji Watanabe
ViT
93
106
0
10 Feb 2020
Improving GANs for Speech Enhancement
Huy P Phan
Ian Mcloughlin
L. D. Pham
Oliver Y. Chén
P. Koch
M. D. Vos
Alfred Mertins
65
119
0
15 Jan 2020
Reformer: The Efficient Transformer
Nikita Kitaev
Lukasz Kaiser
Anselm Levskaya
VLM
209
2,335
0
13 Jan 2020
Demystifying TasNet: A Dissecting Approach
Jens Heitkaemper
Darius Jakobeit
Christoph Boeddeker
Lukas Drude
Reinhold Haeb-Umbach
50
58
0
20 Nov 2019
Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders
Andy T. Liu
Shu-Wen Yang
Po-Han Chi
Po-Chun Hsu
Hung-yi Lee
SSL
155
374
0
25 Oct 2019
A Recurrent Variational Autoencoder for Speech Enhancement
Simon Leglaive
Xavier Alameda-Pineda
Laurent Girin
Radu Horaud
DRL
138
79
0
24 Oct 2019
Two-Step Sound Source Separation: Training on Learned Latent Targets
Efthymios Tzinis
Shrikant Venkataramani
Zhepei Wang
Y. C. Sübakan
Paris Smaragdis
65
65
0
22 Oct 2019
Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation
Yi Luo
Zhuo Chen
Takuya Yoshioka
AI4TS
100
775
0
14 Oct 2019
On Loss Functions for Supervised Monaural Time-Domain Speech Enhancement
Morten Kolbæk
Zheng-Hua Tan
S. H. Jensen
Jesper Jensen
AAML
107
130
0
03 Sep 2019
Increasing Compactness Of Deep Learning Based Speech Enhancement Models With Parameter Pruning And Quantization Techniques
Jyun-Yi Wu
Cheng Yu
Szu-Wei Fu
Chih-Ting Liu
Shao-Yi Chien
Yu Tsao
33
23
0
31 May 2019
A comprehensive study of speech separation: spectrogram vs waveform separation
F. Bahmaninezhad
Jian Wu
Rongzhi Gu
Shi-Xiong Zhang
Yong-mei Xu
Meng Yu
Dong Yu
73
81
0
17 May 2019
MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement
Szu-Wei Fu
Chien-Feng Liao
Yu Tsao
Shou-De Lin
70
331
0
13 May 2019
Universal Sound Separation
Ilya Kavalerov
Scott Wisdom
Hakan Erdogan
Brian Patton
K. Wilson
Jonathan Le Roux
J. Hershey
81
187
0
08 May 2019
Divide and Conquer: A Deep CASA Approach to Talker-independent Monaural Speaker Separation
Yuzhou Liu
DeLiang Wang
80
158
0
25 Apr 2019
Towards Generalized Speech Enhancement with Generative Adversarial Networks
Santiago Pascual
Joan Serrà
Antonio Bonafonte
GAN
64
33
0
06 Apr 2019
An Unsupervised Autoregressive Model for Speech Representation Learning
Yu-An Chung
Wei-Ning Hsu
Hao Tang
James R. Glass
SSL
94
409
0
05 Apr 2019
Recursive speech separation for unknown number of speakers
Naoya Takahashi
Sudarsanam Parthasaarathy
Nabarun Goswami
Yuki Mitsufuji
58
81
0
05 Apr 2019
Speech enhancement with variational autoencoders and alpha-stable distributions
Simon Leglaive
Umut Simsekli
Antoine Liutkus
Laurent Girin
Radu Horaud
DRL
55
36
0
08 Feb 2019
A variance modeling framework based on variational autoencoders for speech enhancement
Simon Leglaive
Laurent Girin
Radu Horaud
DRL
63
91
0
05 Feb 2019
Deep Learning Based Phase Reconstruction for Speaker Separation: A Trigonometric Perspective
Zhong-Qiu Wang
Ke Tan
DeLiang Wang
107
95
0
22 Nov 2018
SDR - half-baked or well done?
F. Sánchez-Martínez
M. Esplà-Gomis
Hakan Erdogan
J. Hershey
165
1,205
0
06 Nov 2018
End-to-end music source separation: is it possible in the waveform domain?
Francesc Lluís
Jordi Pons
Xavier Serra
74
73
0
29 Oct 2018
VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking
Quan Wang
Hannah Muckenhirn
K. Wilson
Prashant Sridhar
Zelin Wu
J. Hershey
Rif A. Saurous
Ron J. Weiss
Ye Jia
Ignacio López Moreno
96
370
0
11 Oct 2018
Phasebook and Friends: Leveraging Discrete Representations for Source Separation
Jonathan Le Roux
Gordon Wichern
Shinji Watanabe
Andy M. Sarroff
J. Hershey
64
77
0
02 Oct 2018
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation
Yi Luo
N. Mesgarani
177
1,796
0
20 Sep 2018
A study on speech enhancement using exponent-only floating point quantized neural network (EOFP-QNN)
Y. Hsu
Yu-Chen Lin
Szu-Wei Fu
Yu Tsao
Tei-Wei Kuo
MQ
42
15
0
17 Aug 2018
Deep Extractor Network for Target Speaker Recovery From Single Channel Speech Mixtures
Jun Wang
Jie Chen
Dan Su
Lianwu Chen
Meng Yu
Y. Qian
Dong Yu
90
91
0
24 Jul 2018
Noise Adaptive Speech Enhancement using Domain Adversarial Training
Chien-Feng Liao
Yu Tsao
Hung-yi Lee
H. Wang
60
52
0
19 Jul 2018
Speech Denoising with Deep Feature Losses
François Germain
Qifeng Chen
V. Koltun
78
162
0
27 Jun 2018
Wave-U-Net: A Multi-Scale Neural Network for End-to-End Audio Source Separation
Daniel Stoller
Sebastian Ewert
S. Dixon
AI4TS
146
598
0
08 Jun 2018
Noise2Noise: Learning Image Restoration without Clean Data
J. Lehtinen
Jacob Munkberg
J. Hasselgren
S. Laine
Tero Karras
M. Aittala
Timo Aila
109
1,610
0
12 Mar 2018
An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling
Shaojie Bai
J. Zico Kolter
V. Koltun
DRL
101
4,857
0
04 Mar 2018
CSRNet: Dilated Convolutional Neural Networks for Understanding the Highly Congested Scenes
Yuhong Li
Xiaofan Zhang
Deming Chen
169
1,343
0
27 Feb 2018
Monaural Speech Enhancement using Deep Neural Networks by Maximizing a Short-Time Objective Intelligibility Measure
Morten Kolbæk
Zheng-Hua Tan
Jesper Jensen
67
61
0
02 Feb 2018
Language and Noise Transfer in Speech Enhancement Generative Adversarial Network
Santiago Pascual
Maruchan Park
Joan Serrà
Antonio Bonafonte
K. Ahn
64
28
0
18 Dec 2017
Exploring Speech Enhancement with Generative Adversarial Networks for Robust Speech Recognition
Chris Donahue
Yue Liu
Rohit Prabhavalkar
58
201
0
15 Nov 2017
TasNet: time-domain audio separation network for real-time, single-channel speech separation
Yi Luo
N. Mesgarani
94
633
0
01 Nov 2017
Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization
Yoshiaki Bando
Masato Mimura
Katsutoshi Itoyama
Kazuyoshi Yoshii
Tatsuya Kawahara
82
120
0
31 Oct 2017
End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics Optimization by Fully Convolutional Neural Networks
Szu-Wei Fu
Tao-Wei Wang
Yu Tsao
Xugang Lu
Hisashi Kawai
86
276
0
12 Sep 2017
Integrated Speech Enhancement Method Based on Weighted Prediction Error and DNN for Dereverberation and Denoising
Hao Li
Xueliang Zhang
Hui Zhang
Guanglai Gao
18
7
0
28 Aug 2017
Supervised Speech Separation Based on Deep Learning: An Overview
DeLiang Wang
Jitong Chen
SSL
94
1,376
0
24 Aug 2017
Speaker-independent Speech Separation with Deep Attractor Network
Yi Luo
Zhuo Chen
N. Mesgarani
83
247
0
12 Jul 2017
Previous
1
2
3
Next