ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2212.00369
  4. Cited By
Deep neural network techniques for monaural speech enhancement: state of
  the art analysis
v1v2 (latest)

Deep neural network techniques for monaural speech enhancement: state of the art analysis

1 December 2022
P. Ochieng
ArXiv (abs)PDFHTML

Papers citing "Deep neural network techniques for monaural speech enhancement: state of the art analysis"

50 / 125 papers shown
Title
Phase-aware Single-stage Speech Denoising and Dereverberation with U-Net
Phase-aware Single-stage Speech Denoising and Dereverberation with U-Net
Hyeong-Seok Choi
Hoon Heo
Jie Hwan Lee
Kyogu Lee
91
19
0
01 Jun 2020
TinyLSTMs: Efficient Neural Speech Enhancement for Hearing Aids
TinyLSTMs: Efficient Neural Speech Enhancement for Hearing Aids
Igor Fedorov
Marko Stamenovic
Carl R. Jensen
Li-Chia Yang
Ari Mandell
Yiming Gan
Matthew Mattina
P. Whatmough
59
98
0
20 May 2020
Vector-Quantized Autoregressive Predictive Coding
Vector-Quantized Autoregressive Predictive Coding
Yu-An Chung
Hao Tang
James R. Glass
SSL
54
115
0
17 May 2020
Conformer: Convolution-augmented Transformer for Speech Recognition
Conformer: Convolution-augmented Transformer for Speech Recognition
Anmol Gulati
James Qin
Chung-Cheng Chiu
Niki Parmar
Yu Zhang
...
Wei Han
Shibo Wang
Zhengdong Zhang
Yonghui Wu
Ruoming Pang
229
3,164
0
16 May 2020
Longformer: The Long-Document Transformer
Longformer: The Long-Document Transformer
Iz Beltagy
Matthew E. Peters
Arman Cohan
RALMVLM
190
4,105
0
10 Apr 2020
Separating Varying Numbers of Sources with Auxiliary Autoencoding Loss
Separating Varying Numbers of Sources with Auxiliary Autoencoding Loss
Yi Luo
N. Mesgarani
72
29
0
27 Mar 2020
Voice Separation with an Unknown Number of Multiple Speakers
Voice Separation with an Unknown Number of Multiple Speakers
Eliya Nachmani
Yossi Adi
Lior Wolf
87
175
0
29 Feb 2020
Wavesplit: End-to-End Speech Separation by Speaker Clustering
Wavesplit: End-to-End Speech Separation by Speaker Clustering
Neil Zeghidour
David Grangier
VLM
112
265
0
20 Feb 2020
End-to-End Multi-speaker Speech Recognition with Transformer
End-to-End Multi-speaker Speech Recognition with Transformer
Xuankai Chang
Wangyou Zhang
Y. Qian
Jonathan Le Roux
Shinji Watanabe
ViT
93
106
0
10 Feb 2020
Improving GANs for Speech Enhancement
Improving GANs for Speech Enhancement
Huy P Phan
Ian Mcloughlin
L. D. Pham
Oliver Y. Chén
P. Koch
M. D. Vos
Alfred Mertins
65
119
0
15 Jan 2020
Reformer: The Efficient Transformer
Reformer: The Efficient Transformer
Nikita Kitaev
Lukasz Kaiser
Anselm Levskaya
VLM
209
2,335
0
13 Jan 2020
Demystifying TasNet: A Dissecting Approach
Demystifying TasNet: A Dissecting Approach
Jens Heitkaemper
Darius Jakobeit
Christoph Boeddeker
Lukas Drude
Reinhold Haeb-Umbach
50
58
0
20 Nov 2019
Mockingjay: Unsupervised Speech Representation Learning with Deep
  Bidirectional Transformer Encoders
Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders
Andy T. Liu
Shu-Wen Yang
Po-Han Chi
Po-Chun Hsu
Hung-yi Lee
SSL
155
374
0
25 Oct 2019
A Recurrent Variational Autoencoder for Speech Enhancement
A Recurrent Variational Autoencoder for Speech Enhancement
Simon Leglaive
Xavier Alameda-Pineda
Laurent Girin
Radu Horaud
DRL
138
79
0
24 Oct 2019
Two-Step Sound Source Separation: Training on Learned Latent Targets
Two-Step Sound Source Separation: Training on Learned Latent Targets
Efthymios Tzinis
Shrikant Venkataramani
Zhepei Wang
Y. C. Sübakan
Paris Smaragdis
65
65
0
22 Oct 2019
Dual-path RNN: efficient long sequence modeling for time-domain
  single-channel speech separation
Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation
Yi Luo
Zhuo Chen
Takuya Yoshioka
AI4TS
100
775
0
14 Oct 2019
On Loss Functions for Supervised Monaural Time-Domain Speech Enhancement
On Loss Functions for Supervised Monaural Time-Domain Speech Enhancement
Morten Kolbæk
Zheng-Hua Tan
S. H. Jensen
Jesper Jensen
AAML
107
130
0
03 Sep 2019
Increasing Compactness Of Deep Learning Based Speech Enhancement Models
  With Parameter Pruning And Quantization Techniques
Increasing Compactness Of Deep Learning Based Speech Enhancement Models With Parameter Pruning And Quantization Techniques
Jyun-Yi Wu
Cheng Yu
Szu-Wei Fu
Chih-Ting Liu
Shao-Yi Chien
Yu Tsao
33
23
0
31 May 2019
A comprehensive study of speech separation: spectrogram vs waveform
  separation
A comprehensive study of speech separation: spectrogram vs waveform separation
F. Bahmaninezhad
Jian Wu
Rongzhi Gu
Shi-Xiong Zhang
Yong-mei Xu
Meng Yu
Dong Yu
73
81
0
17 May 2019
MetricGAN: Generative Adversarial Networks based Black-box Metric Scores
  Optimization for Speech Enhancement
MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement
Szu-Wei Fu
Chien-Feng Liao
Yu Tsao
Shou-De Lin
70
331
0
13 May 2019
Universal Sound Separation
Universal Sound Separation
Ilya Kavalerov
Scott Wisdom
Hakan Erdogan
Brian Patton
K. Wilson
Jonathan Le Roux
J. Hershey
81
187
0
08 May 2019
Divide and Conquer: A Deep CASA Approach to Talker-independent Monaural
  Speaker Separation
Divide and Conquer: A Deep CASA Approach to Talker-independent Monaural Speaker Separation
Yuzhou Liu
DeLiang Wang
80
158
0
25 Apr 2019
Towards Generalized Speech Enhancement with Generative Adversarial
  Networks
Towards Generalized Speech Enhancement with Generative Adversarial Networks
Santiago Pascual
Joan Serrà
Antonio Bonafonte
GAN
64
33
0
06 Apr 2019
An Unsupervised Autoregressive Model for Speech Representation Learning
An Unsupervised Autoregressive Model for Speech Representation Learning
Yu-An Chung
Wei-Ning Hsu
Hao Tang
James R. Glass
SSL
94
409
0
05 Apr 2019
Recursive speech separation for unknown number of speakers
Recursive speech separation for unknown number of speakers
Naoya Takahashi
Sudarsanam Parthasaarathy
Nabarun Goswami
Yuki Mitsufuji
58
81
0
05 Apr 2019
Speech enhancement with variational autoencoders and alpha-stable
  distributions
Speech enhancement with variational autoencoders and alpha-stable distributions
Simon Leglaive
Umut Simsekli
Antoine Liutkus
Laurent Girin
Radu Horaud
DRL
55
36
0
08 Feb 2019
A variance modeling framework based on variational autoencoders for
  speech enhancement
A variance modeling framework based on variational autoencoders for speech enhancement
Simon Leglaive
Laurent Girin
Radu Horaud
DRL
63
91
0
05 Feb 2019
Deep Learning Based Phase Reconstruction for Speaker Separation: A
  Trigonometric Perspective
Deep Learning Based Phase Reconstruction for Speaker Separation: A Trigonometric Perspective
Zhong-Qiu Wang
Ke Tan
DeLiang Wang
107
95
0
22 Nov 2018
SDR - half-baked or well done?
SDR - half-baked or well done?
F. Sánchez-Martínez
M. Esplà-Gomis
Hakan Erdogan
J. Hershey
165
1,205
0
06 Nov 2018
End-to-end music source separation: is it possible in the waveform
  domain?
End-to-end music source separation: is it possible in the waveform domain?
Francesc Lluís
Jordi Pons
Xavier Serra
74
73
0
29 Oct 2018
VoiceFilter: Targeted Voice Separation by Speaker-Conditioned
  Spectrogram Masking
VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking
Quan Wang
Hannah Muckenhirn
K. Wilson
Prashant Sridhar
Zelin Wu
J. Hershey
Rif A. Saurous
Ron J. Weiss
Ye Jia
Ignacio López Moreno
96
370
0
11 Oct 2018
Phasebook and Friends: Leveraging Discrete Representations for Source
  Separation
Phasebook and Friends: Leveraging Discrete Representations for Source Separation
Jonathan Le Roux
Gordon Wichern
Shinji Watanabe
Andy M. Sarroff
J. Hershey
64
77
0
02 Oct 2018
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for
  Speech Separation
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation
Yi Luo
N. Mesgarani
177
1,796
0
20 Sep 2018
A study on speech enhancement using exponent-only floating point
  quantized neural network (EOFP-QNN)
A study on speech enhancement using exponent-only floating point quantized neural network (EOFP-QNN)
Y. Hsu
Yu-Chen Lin
Szu-Wei Fu
Yu Tsao
Tei-Wei Kuo
MQ
42
15
0
17 Aug 2018
Deep Extractor Network for Target Speaker Recovery From Single Channel
  Speech Mixtures
Deep Extractor Network for Target Speaker Recovery From Single Channel Speech Mixtures
Jun Wang
Jie Chen
Dan Su
Lianwu Chen
Meng Yu
Y. Qian
Dong Yu
90
91
0
24 Jul 2018
Noise Adaptive Speech Enhancement using Domain Adversarial Training
Noise Adaptive Speech Enhancement using Domain Adversarial Training
Chien-Feng Liao
Yu Tsao
Hung-yi Lee
H. Wang
60
52
0
19 Jul 2018
Speech Denoising with Deep Feature Losses
Speech Denoising with Deep Feature Losses
François Germain
Qifeng Chen
V. Koltun
78
162
0
27 Jun 2018
Wave-U-Net: A Multi-Scale Neural Network for End-to-End Audio Source
  Separation
Wave-U-Net: A Multi-Scale Neural Network for End-to-End Audio Source Separation
Daniel Stoller
Sebastian Ewert
S. Dixon
AI4TS
146
598
0
08 Jun 2018
Noise2Noise: Learning Image Restoration without Clean Data
Noise2Noise: Learning Image Restoration without Clean Data
J. Lehtinen
Jacob Munkberg
J. Hasselgren
S. Laine
Tero Karras
M. Aittala
Timo Aila
109
1,610
0
12 Mar 2018
An Empirical Evaluation of Generic Convolutional and Recurrent Networks
  for Sequence Modeling
An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling
Shaojie Bai
J. Zico Kolter
V. Koltun
DRL
101
4,857
0
04 Mar 2018
CSRNet: Dilated Convolutional Neural Networks for Understanding the
  Highly Congested Scenes
CSRNet: Dilated Convolutional Neural Networks for Understanding the Highly Congested Scenes
Yuhong Li
Xiaofan Zhang
Deming Chen
169
1,343
0
27 Feb 2018
Monaural Speech Enhancement using Deep Neural Networks by Maximizing a
  Short-Time Objective Intelligibility Measure
Monaural Speech Enhancement using Deep Neural Networks by Maximizing a Short-Time Objective Intelligibility Measure
Morten Kolbæk
Zheng-Hua Tan
Jesper Jensen
67
61
0
02 Feb 2018
Language and Noise Transfer in Speech Enhancement Generative Adversarial
  Network
Language and Noise Transfer in Speech Enhancement Generative Adversarial Network
Santiago Pascual
Maruchan Park
Joan Serrà
Antonio Bonafonte
K. Ahn
64
28
0
18 Dec 2017
Exploring Speech Enhancement with Generative Adversarial Networks for
  Robust Speech Recognition
Exploring Speech Enhancement with Generative Adversarial Networks for Robust Speech Recognition
Chris Donahue
Yue Liu
Rohit Prabhavalkar
58
201
0
15 Nov 2017
TasNet: time-domain audio separation network for real-time,
  single-channel speech separation
TasNet: time-domain audio separation network for real-time, single-channel speech separation
Yi Luo
N. Mesgarani
94
633
0
01 Nov 2017
Statistical Speech Enhancement Based on Probabilistic Integration of
  Variational Autoencoder and Non-Negative Matrix Factorization
Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization
Yoshiaki Bando
Masato Mimura
Katsutoshi Itoyama
Kazuyoshi Yoshii
Tatsuya Kawahara
82
120
0
31 Oct 2017
End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics
  Optimization by Fully Convolutional Neural Networks
End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics Optimization by Fully Convolutional Neural Networks
Szu-Wei Fu
Tao-Wei Wang
Yu Tsao
Xugang Lu
Hisashi Kawai
86
276
0
12 Sep 2017
Integrated Speech Enhancement Method Based on Weighted Prediction Error
  and DNN for Dereverberation and Denoising
Integrated Speech Enhancement Method Based on Weighted Prediction Error and DNN for Dereverberation and Denoising
Hao Li
Xueliang Zhang
Hui Zhang
Guanglai Gao
18
7
0
28 Aug 2017
Supervised Speech Separation Based on Deep Learning: An Overview
Supervised Speech Separation Based on Deep Learning: An Overview
DeLiang Wang
Jitong Chen
SSL
94
1,376
0
24 Aug 2017
Speaker-independent Speech Separation with Deep Attractor Network
Speaker-independent Speech Separation with Deep Attractor Network
Yi Luo
Zhuo Chen
N. Mesgarani
83
247
0
12 Jul 2017
Previous
123
Next