ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.03538
  4. Cited By
MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement

MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement

8 April 2021
Szu-Wei Fu
Cheng Yu
Tsun-An Hsieh
Peter William VanHarn Plantinga
Mirco Ravanelli
Xugang Lu
Yu Tsao
ArXivPDFHTML

Papers citing "MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement"

29 / 29 papers shown
Title
Generative Data Augmentation Challenge: Zero-Shot Speech Synthesis for Personalized Speech Enhancement
Generative Data Augmentation Challenge: Zero-Shot Speech Synthesis for Personalized Speech Enhancement
Jae-Sung Bae
Anastasia Kuznetsova
Dinesh Manocha
John Hershey
Trausti Kristjansson
Minje Kim
77
0
0
23 Jan 2025
Complex Image-Generative Diffusion Transformer for Audio Denoising
Complex Image-Generative Diffusion Transformer for Audio Denoising
Junhui Li
Pu Wang
Jialu Li
Youshan Zhang
DiffM
24
1
0
13 Jun 2024
Pre-training Feature Guided Diffusion Model for Speech Enhancement
Pre-training Feature Guided Diffusion Model for Speech Enhancement
Yiyuan Yang
Niki Trigoni
Andrew Markham
37
3
0
11 Jun 2024
Textless Acoustic Model with Self-Supervised Distillation for
  Noise-Robust Expressive Speech-to-Speech Translation
Textless Acoustic Model with Self-Supervised Distillation for Noise-Robust Expressive Speech-to-Speech Translation
Min-Jae Hwang
Ilia Kulikov
Benjamin Peloquin
Hongyu Gong
Peng-Jen Chen
Ann Lee
35
1
0
04 Jun 2024
An Investigation of Incorporating Mamba for Speech Enhancement
An Investigation of Incorporating Mamba for Speech Enhancement
Rong-Yu Chao
Wen-Huang Cheng
Moreno La Quatra
Sabato Marco Siniscalchi
Chao-Han Huck Yang
Szu-Wei Fu
Yu Tsao
Mamba
53
25
0
10 May 2024
BS-PLCNet: Band-split Packet Loss Concealment Network with Multi-task
  Learning Framework and Multi-discriminators
BS-PLCNet: Band-split Packet Loss Concealment Network with Multi-task Learning Framework and Multi-discriminators
Zihan Zhang
Jiayao Sun
Xianjun Xia
Chuanzeng Huang
Yijian Xiao
Lei Xie
23
3
0
08 Jan 2024
Noise-aware Speech Enhancement using Diffusion Probabilistic Model
Noise-aware Speech Enhancement using Diffusion Probabilistic Model
Yuchen Hu
Cheng Chen
Ruizhe Li
Qiu-shi Zhu
E. Chng
DiffM
16
10
0
16 Jul 2023
Diffusion-Based Speech Enhancement with Joint Generative and Predictive
  Decoders
Diffusion-Based Speech Enhancement with Joint Generative and Predictive Decoders
Hao Shi
Kazuki Shimada
M. Hirano
Takashi Shibuya
Yuichiro Koyama
Zhi-Wei Zhong
Shusuke Takahashi
Tatsuya Kawahara
Yuki Mitsufuji
DiffM
30
15
0
18 May 2023
Integrating Uncertainty into Neural Network-based Speech Enhancement
Integrating Uncertainty into Neural Network-based Speech Enhancement
Hu Fang
Dennis Becker
S. Wermter
Timo Gerkmann
UQCV
32
2
0
15 May 2023
Perceive and predict: self-supervised speech representation based loss
  functions for speech enhancement
Perceive and predict: self-supervised speech representation based loss functions for speech enhancement
George Close
William Ravenscroft
Thomas Hain
Stefan Goetze
SSL
35
12
0
11 Jan 2023
Audio Denoising for Robust Audio Fingerprinting
Audio Denoising for Robust Audio Fingerprinting
Kamil Akesbi
21
3
0
21 Dec 2022
BASPRO: a balanced script producer for speech corpus collection based on
  the genetic algorithm
BASPRO: a balanced script producer for speech corpus collection based on the genetic algorithm
Yu-Wen Chen
Hsin-Min Wang
Yu Tsao
19
0
0
11 Dec 2022
SCP-GAN: Self-Correcting Discriminator Optimization for Training
  Consistency Preserving Metric GAN on Speech Enhancement Tasks
SCP-GAN: Self-Correcting Discriminator Optimization for Training Consistency Preserving Metric GAN on Speech Enhancement Tasks
Vasily Zadorozhnyy
Qian Ye
K. Koishida
21
8
0
26 Oct 2022
Improved Normalizing Flow-Based Speech Enhancement using an All-pole
  Gammatone Filterbank for Conditional Input Representation
Improved Normalizing Flow-Based Speech Enhancement using an All-pole Gammatone Filterbank for Conditional Input Representation
Martin Strauss
Matteo Torcoli
B. Edler
21
4
0
21 Oct 2022
Speech Enhancement and Dereverberation with Diffusion-based Generative
  Models
Speech Enhancement and Dereverberation with Diffusion-based Generative Models
Julius Richter
Simon Welker
Jean-Marie Lemercier
Bunlong Lay
Timo Gerkmann
DiffM
16
180
0
11 Aug 2022
Speaker Reinforcement Using Target Source Extraction for Robust
  Automatic Speech Recognition
Speaker Reinforcement Using Target Source Extraction for Robust Automatic Speech Recognition
Catalin Zorila
R. Doddipatla
24
11
0
09 May 2022
FFC-SE: Fast Fourier Convolution for Speech Enhancement
FFC-SE: Fast Fourier Convolution for Speech Enhancement
Ivan Shchekotov
Pavel Andreev
Oleg Ivanov
Aibek Alanov
Dmitry Vetrov
26
23
0
06 Apr 2022
CMGAN: Conformer-based Metric GAN for Speech Enhancement
CMGAN: Conformer-based Metric GAN for Speech Enhancement
Ru Cao
Sherif Abdulatif
Bin Yang
21
91
0
28 Mar 2022
HiFi++: a Unified Framework for Bandwidth Extension and Speech
  Enhancement
HiFi++: a Unified Framework for Bandwidth Extension and Speech Enhancement
Pavel Andreev
Aibek Alanov
Oleg Ivanov
Dmitry Vetrov
36
38
0
24 Mar 2022
MetricGAN+/-: Increasing Robustness of Noise Reduction on Unseen Data
MetricGAN+/-: Increasing Robustness of Noise Reduction on Unseen Data
George Close
Thomas Hain
Stefan Goetze
24
9
0
23 Mar 2022
MANNER: Multi-view Attention Network for Noise Erasure
MANNER: Multi-view Attention Network for Noise Erasure
Hyun Joon Park
Byung Ha Kang
Wooseok Shin
Jin Sob Kim
S. W. Han
30
48
0
04 Mar 2022
A Novel Temporal Attentive-Pooling based Convolutional Recurrent
  Architecture for Acoustic Signal Enhancement
A Novel Temporal Attentive-Pooling based Convolutional Recurrent Architecture for Acoustic Signal Enhancement
Tassadaq Hussain
Wei-Chien Wang
M. Gogate
K. Dashtipour
Yu Tsao
Xugang Lu
A. Ahsan
Amir Hussain
21
3
0
24 Jan 2022
Perceptual Loss with Recognition Model for Single-Channel Enhancement
  and Robust ASR
Perceptual Loss with Recognition Model for Single-Channel Enhancement and Robust ASR
Peter William VanHarn Plantinga
Deblin Bagchi
Eric Fosler-Lussier
46
10
0
11 Dec 2021
Deep Learning-based Non-Intrusive Multi-Objective Speech Assessment
  Model with Cross-Domain Features
Deep Learning-based Non-Intrusive Multi-Objective Speech Assessment Model with Cross-Domain Features
Ryandhimas E. Zezario
Szu-Wei Fu
Fei Chen
C. Fuh
Hsin-Min Wang
Yu Tsao
DiffM
28
75
0
03 Nov 2021
Self-Supervised Speech Denoising Using Only Noisy Audio Signals
Self-Supervised Speech Denoising Using Only Noisy Audio Signals
Jiasong Wu
Qingchun Li
Guanyu Yang
Lei Li
L. Senhadji
H. Shu
19
10
0
30 Oct 2021
Toward Degradation-Robust Voice Conversion
Toward Degradation-Robust Voice Conversion
Chien-yu Huang
Kai-Wei Chang
Hung-yi Lee
30
7
0
14 Oct 2021
Dual-branch Attention-In-Attention Transformer for single-channel speech
  enhancement
Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement
Guochen Yu
Andong Li
C. Zheng
Yinuo Guo
Yutian Wang
Hui Wang
35
84
0
13 Oct 2021
MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only
  on noisy/ reverberated speech
MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only on noisy/ reverberated speech
Szu-Wei Fu
Cheng Yu
Kuo-Hsuan Hung
Mirco Ravanelli
Yu Tsao
38
46
0
12 Oct 2021
Glance and Gaze: A Collaborative Learning Framework for Single-channel
  Speech Enhancement
Glance and Gaze: A Collaborative Learning Framework for Single-channel Speech Enhancement
Andong Li
C. Zheng
Lu Zhang
Xiaodong Li
19
141
0
22 Jun 2021
1