ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2308.08926
  4. Cited By
Explicit Estimation of Magnitude and Phase Spectra in Parallel for
  High-Quality Speech Enhancement
v1v2 (latest)

Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement

17 August 2023
Ye-Xin Lu
Yang Ai
Zhenhua Ling
ArXiv (abs)PDFHTML

Papers citing "Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement"

28 / 28 papers shown
Title
CMGAN: Conformer-Based Metric-GAN for Monaural Speech Enhancement
CMGAN: Conformer-Based Metric-GAN for Monaural Speech Enhancement
Sherif Abdulatif
Ru Cao
Bin Yang
81
74
0
22 Sep 2022
Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural
  Speech Enhancement
Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural Speech Enhancement
Andong Li
Shan You
Guochen Yu
C. Zheng
Xiaodong Li
65
28
0
30 Apr 2022
CMGAN: Conformer-based Metric GAN for Speech Enhancement
CMGAN: Conformer-based Metric GAN for Speech Enhancement
Ru Cao
Sherif Abdulatif
Bin Yang
83
100
0
28 Mar 2022
DBT-Net: Dual-branch federative magnitude and phase estimation with
  attention-in-attention transformer for monaural speech enhancement
DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention transformer for monaural speech enhancement
Guochen Yu
Andong Li
Hui Wang
Yutian Wang
Yuxuan Ke
C. Zheng
59
35
0
16 Feb 2022
Speech Denoising in the Waveform Domain with Self-Attention
Speech Denoising in the Waveform Domain with Self-Attention
Zhifeng Kong
Ming-Yu Liu
Ambrish Dantrey
Bryan Catanzaro
80
63
0
15 Feb 2022
On The Compensation Between Magnitude and Phase in Speech Separation
On The Compensation Between Magnitude and Phase in Speech Separation
Zhong-Qiu Wang
Gordon Wichern
Jonathan Le Roux
79
74
0
11 Aug 2021
DPT-FSNet: Dual-path Transformer Based Full-band and Sub-band Fusion
  Network for Speech Enhancement
DPT-FSNet: Dual-path Transformer Based Full-band and Sub-band Fusion Network for Speech Enhancement
Feng Dang
Hangting Chen
Pengyuan Zhang
117
103
0
27 Apr 2021
MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement
MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement
Szu-Wei Fu
Cheng Yu
Tsun-An Hsieh
Peter William VanHarn Plantinga
Mirco Ravanelli
Xugang Lu
Yu Tsao
69
216
0
08 Apr 2021
TSTNN: Two-stage Transformer based Neural Network for Speech Enhancement
  in the Time Domain
TSTNN: Two-stage Transformer based Neural Network for Speech Enhancement in the Time Domain
Kai Wang
Bengbeng He
Weiping Zhu
88
169
0
18 Mar 2021
FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time
  Single-Channel Speech Enhancement
FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement
Xiang Hao
Xiangdong Su
Radu Horaud
Xiaofei Li
74
200
0
29 Oct 2020
Dual-Path Transformer Network: Direct Context-Aware Modeling for
  End-to-End Monaural Speech Separation
Dual-Path Transformer Network: Direct Context-Aware Modeling for End-to-End Monaural Speech Separation
Jing-jing Chen
Qi-rong Mao
Dong Liu
108
288
0
28 Jul 2020
SkipConvNet: Skip Convolutional Neural Network for Speech
  Dereverberation using Optimally Smoothed Spectral Mapping
SkipConvNet: Skip Convolutional Neural Network for Speech Dereverberation using Optimally Smoothed Spectral Mapping
Vinay Kothapally
Wei Xia
Shahram Ghorbani
John H. L. Hansen
Wei Xue
Jing-ling Huang
52
25
0
17 Jul 2020
Real Time Speech Enhancement in the Waveform Domain
Real Time Speech Enhancement in the Waveform Domain
Alexandre Défossez
Gabriel Synnaeve
Yossi Adi
92
465
0
23 Jun 2020
The INTERSPEECH 2020 Deep Noise Suppression Challenge: Datasets,
  Subjective Testing Framework, and Challenge Results
The INTERSPEECH 2020 Deep Noise Suppression Challenge: Datasets, Subjective Testing Framework, and Challenge Results
Chandan K. A. Reddy
Vishak Gopal
Ross Cutler
Ebrahim Beyrami
R. Cheng
...
A. Aazami
Sebastian Braun
Puneet Rana
Sriram Srinivasan
J. Gehrke
96
318
0
16 May 2020
Conformer: Convolution-augmented Transformer for Speech Recognition
Conformer: Convolution-augmented Transformer for Speech Recognition
Anmol Gulati
James Qin
Chung-Cheng Chiu
Niki Parmar
Yu Zhang
...
Wei Han
Shibo Wang
Zhengdong Zhang
Yonghui Wu
Ruoming Pang
229
3,164
0
16 May 2020
ViSQOL v3: An Open Source Production Ready Objective Speech and Audio
  Metric
ViSQOL v3: An Open Source Production Ready Objective Speech and Audio Metric
Michael Chinen
Felicia S. C. Lim
Jan Skoglund
Nikita Gureev
F. O'Gorman
Andrew Hines
78
143
0
20 Apr 2020
PHASEN: A Phase-and-Harmonics-Aware Speech Enhancement Network
PHASEN: A Phase-and-Harmonics-Aware Speech Enhancement Network
Dacheng Yin
Chong Luo
Zhiwei Xiong
Wenjun Zeng
68
320
0
12 Nov 2019
T-GSA: Transformer with Gaussian-weighted self-attention for speech
  enhancement
T-GSA: Transformer with Gaussian-weighted self-attention for speech enhancement
Jaeyoung Kim
Mostafa El-Khamy
Jungwon Lee
85
189
0
13 Oct 2019
MetricGAN: Generative Adversarial Networks based Black-box Metric Scores
  Optimization for Speech Enhancement
MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement
Szu-Wei Fu
Chien-Feng Liao
Yu Tsao
Shou-De Lin
67
331
0
13 May 2019
Phase-aware Speech Enhancement with Deep Complex U-Net
Hyeong-Seok Choi
Jang-Hyun Kim
Jaesung Huh
A. Kim
Jung-Woo Ha
Kyogu Lee
75
334
0
07 Mar 2019
SDR - half-baked or well done?
SDR - half-baked or well done?
F. Sánchez-Martínez
M. Esplà-Gomis
Hakan Erdogan
J. Hershey
165
1,205
0
06 Nov 2018
Phasebook and Friends: Leveraging Discrete Representations for Source
  Separation
Phasebook and Friends: Leveraging Discrete Representations for Source Separation
Jonathan Le Roux
Gordon Wichern
Shinji Watanabe
Andy M. Sarroff
J. Hershey
61
77
0
02 Oct 2018
Speech Dereverberation Using Fully Convolutional Networks
Speech Dereverberation Using Fully Convolutional Networks
Ori Ernst
Shlomo E. Chazan
Sharon Gannot
Jacob Goldberger
38
83
0
22 Mar 2018
SEGAN: Speech Enhancement Generative Adversarial Network
SEGAN: Speech Enhancement Generative Adversarial Network
Santiago Pascual
Antonio Bonafonte
Joan Serrà
GAN
94
1,148
0
28 Mar 2017
Real-Time Single Image and Video Super-Resolution Using an Efficient
  Sub-Pixel Convolutional Neural Network
Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network
Wenzhe Shi
Jose Caballero
Ferenc Huszár
J. Totz
Andrew P. Aitken
Rob Bishop
Daniel Rueckert
Zehan Wang
SupR
342
5,247
0
16 Sep 2016
Instance Normalization: The Missing Ingredient for Fast Stylization
Instance Normalization: The Missing Ingredient for Fast Stylization
Dmitry Ulyanov
Andrea Vedaldi
Victor Lempitsky
OOD
180
3,714
0
27 Jul 2016
Layer Normalization
Layer Normalization
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
435
10,548
0
21 Jul 2016
Delving Deep into Rectifiers: Surpassing Human-Level Performance on
  ImageNet Classification
Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
VLM
355
18,661
0
06 Feb 2015
1