ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2212.00369
  4. Cited By
Deep neural network techniques for monaural speech enhancement: state of
  the art analysis
v1v2 (latest)

Deep neural network techniques for monaural speech enhancement: state of the art analysis

1 December 2022
P. Ochieng
ArXiv (abs)PDFHTML

Papers citing "Deep neural network techniques for monaural speech enhancement: state of the art analysis"

50 / 125 papers shown
Title
Complex-Valued Time-Frequency Self-Attention for Speech Dereverberation
Complex-Valued Time-Frequency Self-Attention for Speech Dereverberation
Vinay Kothapally
John H. L. Hansen
46
9
0
22 Nov 2022
SkipConvGAN: Monaural Speech Dereverberation using Generative
  Adversarial Networks via Complex Time-Frequency Masking
SkipConvGAN: Monaural Speech Dereverberation using Generative Adversarial Networks via Complex Time-Frequency Masking
Vinay Kothapally
John H. L. Hansen
31
23
0
22 Nov 2022
Self-Supervised Learning for Speech Enhancement through Synthesis
Self-Supervised Learning for Speech Enhancement through Synthesis
Bryce Irvin
Marko Stamenovic
M. Kegler
Li-Chia Yang
78
21
0
04 Nov 2022
A Training and Inference Strategy Using Noisy and Enhanced Speech as
  Target for Speech Enhancement without Clean Speech
A Training and Inference Strategy Using Noisy and Enhanced Speech as Target for Speech Enhancement without Clean Speech
Li-Wei Chen
Yao-Fei Cheng
Hung-Shin Lee
Yu Tsao
Hsin-Min Wang
52
3
0
27 Oct 2022
Understanding Diffusion Models: A Unified Perspective
Understanding Diffusion Models: A Unified Perspective
Calvin Luo
DiffM
102
347
0
25 Aug 2022
Tiny-Sepformer: A Tiny Time-Domain Transformer Network for Speech
  Separation
Tiny-Sepformer: A Tiny Time-Domain Transformer Network for Speech Separation
Jian Luo
Jianzong Wang
Ning Cheng
Edward Xiao
Xulong Zhang
Jing Xiao
ViT
71
12
0
28 Jun 2022
Efficient Transformer-based Speech Enhancement Using Long Frames and
  STFT Magnitudes
Efficient Transformer-based Speech Enhancement Using Long Frames and STFT Magnitudes
Danilo de Oliveira
Tal Peer
Timo Gerkmann
53
21
0
23 Jun 2022
Resource-Efficient Separation Transformer
Resource-Efficient Separation Transformer
Luca Della Libera
Cem Subakan
Mirco Ravanelli
Samuele Cornell
Frédéric Lepoutre
François Grondin
VLM
89
18
0
19 Jun 2022
To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time
  Dereverberation Targets
To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets
J. Valin
Ritwik Giri
Shrikant Venkataramani
Umut Isik
A. Krishnaswamy
35
2
0
16 Jun 2022
SepIt: Approaching a Single Channel Speech Separation Bound
SepIt: Approaching a Single Channel Speech Separation Bound
Shahar Lutati
Eliya Nachmani
Lior Wolf
VLM
131
27
0
24 May 2022
Ultra Fast Speech Separation Model with Teacher Student Learning
Ultra Fast Speech Separation Model with Teacher Student Learning
Sanyuan Chen
Yu-Huan Wu
Zhuo Chen
Jian Wu
Takuya Yoshioka
Shujie Liu
Jinyu Li
Xiangzhan Yu
70
14
0
27 Apr 2022
VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration
VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration
Haohe Liu
Xubo Liu
Qiuqiang Kong
Qiao Tian
Yan Zhao
DeLiang Wang
Chuanzeng Huang
Yuxuan Wang
71
59
0
12 Apr 2022
CMGAN: Conformer-based Metric GAN for Speech Enhancement
CMGAN: Conformer-based Metric GAN for Speech Enhancement
Ru Cao
Sherif Abdulatif
Bin Yang
83
100
0
28 Mar 2022
RemixIT: Continual self-training of speech enhancement models via
  bootstrapped remixing
RemixIT: Continual self-training of speech enhancement models via bootstrapped remixing
Efthymios Tzinis
Yossi Adi
V. Ithapu
Buye Xu
Paris Smaragdis
Anurag Kumar
CLL
75
54
0
17 Feb 2022
Speech Denoising in the Waveform Domain with Self-Attention
Speech Denoising in the Waveform Domain with Self-Attention
Zhifeng Kong
Ming-Yu Liu
Ambrish Dantrey
Bryan Catanzaro
80
63
0
15 Feb 2022
Conditional Diffusion Probabilistic Model for Speech Enhancement
Conditional Diffusion Probabilistic Model for Speech Enhancement
Yen-Ju Lu
Zhongqiu Wang
Shinji Watanabe
Alexander Richard
Cheng Yu
Yu Tsao
DiffM
70
190
0
10 Feb 2022
MixCycle: Unsupervised Speech Separation via Cyclic Mixture Permutation
  Invariant Training
MixCycle: Unsupervised Speech Separation via Cyclic Mixture Permutation Invariant Training
Ertuğ Karamatlı
S. Kırbız
SSL
82
10
0
08 Feb 2022
Audio representations for deep learning in sound synthesis: A review
Audio representations for deep learning in sound synthesis: A review
Anastasia Natsiou
Seán O'Leary
AI4TS
57
18
0
07 Jan 2022
Domain Adaptation and Autoencoder Based Unsupervised Speech Enhancement
Domain Adaptation and Autoencoder Based Unsupervised Speech Enhancement
Yi Li
Yang Sun
K. Horoshenkov
S. M. Naqvi
46
24
0
09 Dec 2021
Speech Separation Using an Asynchronous Fully Recurrent Convolutional
  Neural Network
Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network
Xiaolin Hu
Kai Li
Weiyi Zhang
Yi Luo
Jean-Marie Lemercier
Timo Gerkmann
86
51
0
04 Dec 2021
Unsupervised Speech Enhancement with speech recognition embedding and
  disentanglement losses
Unsupervised Speech Enhancement with speech recognition embedding and disentanglement losses
V. Trinh
Sebastian Braun
55
19
0
16 Nov 2021
Monaural source separation: From anechoic to reverberant environments
Monaural source separation: From anechoic to reverberant environments
Tobias Cord-Landwehr
Christoph Boeddeker
Thilo von Neumann
Catalin Zorila
R. Doddipatla
Reinhold Haeb-Umbach
50
31
0
15 Nov 2021
MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only
  on noisy/ reverberated speech
MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only on noisy/ reverberated speech
Szu-Wei Fu
Cheng Yu
Kuo-Hsuan Hung
Mirco Ravanelli
Yu Tsao
93
46
0
12 Oct 2021
Late reverberation suppression using U-nets
Late reverberation suppression using U-nets
D. León
Felipe A. Tobar
135
4
0
05 Oct 2021
A Study on Speech Enhancement Based on Diffusion Probabilistic Model
A Study on Speech Enhancement Based on Diffusion Probabilistic Model
Yen-Ju Lu
Yu Tsao
Shinji Watanabe
DiffM
60
74
0
25 Jul 2021
A Simultaneous Denoising and Dereverberation Framework with Target
  Decoupling
A Simultaneous Denoising and Dereverberation Framework with Target Decoupling
Andong Li
Wenzhe Liu
Xiaoxue Luo
Guochen Yu
C. Zheng
Xiaodong Li
75
60
0
24 Jun 2021
Unsupervised Speech Enhancement using Dynamical Variational
  Auto-Encoders
Unsupervised Speech Enhancement using Dynamical Variational Auto-Encoders
Xiaoyu Bie
Simon Leglaive
Xavier Alameda-Pineda
Laurent Girin
DiffM
98
55
0
23 Jun 2021
Teacher-Student MixIT for Unsupervised and Semi-supervised Speech
  Separation
Teacher-Student MixIT for Unsupervised and Semi-supervised Speech Separation
Jisi Zhang
Catalin Zorila
R. Doddipatla
Jon Barker
52
22
0
15 Jun 2021
HuBERT: Self-Supervised Speech Representation Learning by Masked
  Prediction of Hidden Units
HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units
Wei-Ning Hsu
Benjamin Bolte
Yao-Hung Hubert Tsai
Kushal Lakhotia
Ruslan Salakhutdinov
Abdel-rahman Mohamed
SSL
188
3,004
0
14 Jun 2021
Training Speech Enhancement Systems with Noisy Speech Datasets
Training Speech Enhancement Systems with Noisy Speech Datasets
Koichi Saito
Stefan Uhlich
Giorgio Fabbro
Yuki Mitsufuji
71
11
0
26 May 2021
Many-Speakers Single Channel Speech Separation with Optimal Permutation
  Training
Many-Speakers Single Channel Speech Separation with Optimal Permutation Training
Shaked Dovrat
Eliya Nachmani
Lior Wolf
VLM
78
22
0
18 Apr 2021
Time-domain Speech Enhancement with Generative Adversarial Learning
Time-domain Speech Enhancement with Generative Adversarial Learning
Feiyang Xiao
Jian Guan
Qiuqiang Kong
Wenwu Wang
GAN
72
9
0
30 Mar 2021
TSTNN: Two-stage Transformer based Neural Network for Speech Enhancement
  in the Time Domain
TSTNN: Two-stage Transformer based Neural Network for Speech Enhancement in the Time Domain
Kai Wang
Bengbeng He
Weiping Zhu
99
169
0
18 Mar 2021
Sandglasset: A Light Multi-Granularity Self-attentive Network For
  Time-Domain Speech Separation
Sandglasset: A Light Multi-Granularity Self-attentive Network For Time-Domain Speech Separation
Max W. Y. Lam
Jun Wang
Dan Su
Dong Yu
AI4TS
117
49
0
01 Mar 2021
Effective Low-Cost Time-Domain Audio Separation Using Globally Attentive
  Locally Recurrent Networks
Effective Low-Cost Time-Domain Audio Separation Using Globally Attentive Locally Recurrent Networks
Max W. Y. Lam
Jun Wang
Dan Su
Dong Yu
89
29
0
13 Jan 2021
Denoising-and-Dereverberation Hierarchical Neural Vocoder for Robust
  Waveform Generation
Denoising-and-Dereverberation Hierarchical Neural Vocoder for Robust Waveform Generation
Yang Ai
Haoyu Li
Xin Wang
Junichi Yamagishi
Zhenhua Ling
42
4
0
08 Nov 2020
DNSMOS: A Non-Intrusive Perceptual Objective Speech Quality metric to
  evaluate Noise Suppressors
DNSMOS: A Non-Intrusive Perceptual Objective Speech Quality metric to evaluate Noise Suppressors
Chandan K. A. Reddy
Vishak Gopal
Ross Cutler
98
315
0
28 Oct 2020
Attention is All You Need in Speech Separation
Attention is All You Need in Speech Separation
Cem Subakan
Mirco Ravanelli
Samuele Cornell
Mirko Bronzi
Jianyuan Zhong
99
565
0
25 Oct 2020
Towards Listening to 10 People Simultaneously: An Efficient Permutation
  Invariant Training of Audio Source Separation Using Sinkhorn's Algorithm
Towards Listening to 10 People Simultaneously: An Efficient Permutation Invariant Training of Audio Source Separation Using Sinkhorn's Algorithm
Hideyuki Tachibana
67
14
0
22 Oct 2020
HiFi-GAN: Generative Adversarial Networks for Efficient and High
  Fidelity Speech Synthesis
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Jungil Kong
Jaehyeon Kim
Jaekyoung Bae
179
1,952
0
12 Oct 2020
PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings,
  Semi-Supervised Conversational Data, and Biased Loss
PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss
Umut Isik
Ritwik Giri
Neerad Phansalkar
J. Valin
Karim Helwani
A. Krishnaswamy
69
84
0
11 Aug 2020
Dual-Path Transformer Network: Direct Context-Aware Modeling for
  End-to-End Monaural Speech Separation
Dual-Path Transformer Network: Direct Context-Aware Modeling for End-to-End Monaural Speech Separation
Jing-jing Chen
Qi-rong Mao
Dong Liu
112
289
0
28 Jul 2020
Sudo rm -rf: Efficient Networks for Universal Audio Source Separation
Sudo rm -rf: Efficient Networks for Universal Audio Source Separation
Efthymios Tzinis
Zhepei Wang
Paris Smaragdis
96
130
0
14 Jul 2020
TERA: Self-Supervised Learning of Transformer Encoder Representation for
  Speech
TERA: Self-Supervised Learning of Transformer Encoder Representation for Speech
Andy T. Liu
Shang-Wen Li
Hung-yi Lee
SSL
148
359
0
12 Jul 2020
Real Time Speech Enhancement in the Waveform Domain
Real Time Speech Enhancement in the Waveform Domain
Alexandre Défossez
Gabriel Synnaeve
Yossi Adi
95
465
0
23 Jun 2020
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech
  Representations
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Alexei Baevski
Henry Zhou
Abdel-rahman Mohamed
Michael Auli
SSL
305
5,853
0
20 Jun 2020
HiFi-GAN: High-Fidelity Denoising and Dereverberation Based on Speech
  Deep Features in Adversarial Networks
HiFi-GAN: High-Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
Jiaqi Su
Zeyu Jin
Adam Finkelstein
69
139
0
10 Jun 2020
Knowledge Distillation: A Survey
Knowledge Distillation: A Survey
Jianping Gou
B. Yu
Stephen J. Maybank
Dacheng Tao
VLM
210
2,993
0
09 Jun 2020
Linformer: Self-Attention with Linear Complexity
Linformer: Self-Attention with Linear Complexity
Sinong Wang
Belinda Z. Li
Madian Khabsa
Han Fang
Hao Ma
219
1,719
0
08 Jun 2020
Multi-talker ASR for an unknown number of sources: Joint training of
  source counting, separation and ASR
Multi-talker ASR for an unknown number of sources: Joint training of source counting, separation and ASR
Thilo von Neumann
Christoph Boeddeker
Lukas Drude
K. Kinoshita
Marc Delcroix
Tomohiro Nakatani
Reinhold Haeb-Umbach
72
41
0
04 Jun 2020
123
Next