ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1703.09452
  4. Cited By
SEGAN: Speech Enhancement Generative Adversarial Network

SEGAN: Speech Enhancement Generative Adversarial Network

28 March 2017
Santiago Pascual
Antonio Bonafonte
Joan Serrà
    GAN
ArXivPDFHTML

Papers citing "SEGAN: Speech Enhancement Generative Adversarial Network"

50 / 155 papers shown
Title
Unified Architecture and Unsupervised Speech Disentanglement for Speaker Embedding-Free Enrollment in Personalized Speech Enhancement
Unified Architecture and Unsupervised Speech Disentanglement for Speaker Embedding-Free Enrollment in Personalized Speech Enhancement
Ziling Huang
Haixin Guan
Yanhua Long
19
0
0
18 May 2025
DeepExtractor: Time-domain reconstruction of signals and glitches in gravitational wave data with deep learning
DeepExtractor: Time-domain reconstruction of signals and glitches in gravitational wave data with deep learning
Tom Dooney
Harsh Narola
Stefano Bromuri
R. L. Curier
C. Broeck
Sarah Caudill
D. Tan
74
0
0
30 Jan 2025
Speech Enhancement with Overlapped-Frame Information Fusion and Causal Self-Attention
Speech Enhancement with Overlapped-Frame Information Fusion and Causal Self-Attention
Yuewei Zhang
Huanbin Zou
Jie Zhu
44
0
0
21 Jan 2025
CleanUMamba: A Compact Mamba Network for Speech Denoising using Channel Pruning
CleanUMamba: A Compact Mamba Network for Speech Denoising using Channel Pruning
Sjoerd Groot
Qinyu Chen
Jan C. van Gemert
Chang Gao
Mamba
213
0
0
14 Oct 2024
Diffusion-based Unsupervised Audio-visual Speech Enhancement
Diffusion-based Unsupervised Audio-visual Speech Enhancement
Jean-Eudes Ayilo
Mostafa Sadeghi
Romain Serizel
Xavier Alameda-Pineda
DiffM
30
0
0
04 Oct 2024
aTENNuate: Optimized Real-time Speech Enhancement with Deep SSMs on Raw Audio
aTENNuate: Optimized Real-time Speech Enhancement with Deep SSMs on Raw Audio
Yan Ru Pei
Ritik Shrivastava
FNU Sidharth
45
1
0
05 Sep 2024
Effects of Dataset Sampling Rate for Noise Cancellation through Deep
  Learning
Effects of Dataset Sampling Rate for Noise Cancellation through Deep Learning
Brandon Colelough
Andrew Zheng
28
1
0
30 May 2024
Towards Decoupling Frontend Enhancement and Backend Recognition in
  Monaural Robust ASR
Towards Decoupling Frontend Enhancement and Backend Recognition in Monaural Robust ASR
Yufeng Yang
Ashutosh Pandey
DeLiang Wang
46
4
0
11 Mar 2024
Single-channel speech enhancement using learnable loss mixup
Single-channel speech enhancement using learnable loss mixup
Oscar Chang
Dung N. Tran
K. Koishida
51
7
0
20 Dec 2023
PDPCRN: Parallel Dual-Path CRN with Bi-directional Inter-Branch
  Interactions for Multi-Channel Speech Enhancement
PDPCRN: Parallel Dual-Path CRN with Bi-directional Inter-Branch Interactions for Multi-Channel Speech Enhancement
Jia Pan
Shulin He
Tianci Wu
Hui Zhang
Xueliang Zhang
29
0
0
19 Sep 2023
Single-Channel Speech Enhancement with Deep Complex U-Networks and
  Probabilistic Latent Space Models
Single-Channel Speech Enhancement with Deep Complex U-Networks and Probabilistic Latent Space Models
E. J. Nustede
Jörn Anemüller
27
3
0
04 Sep 2023
NADiffuSE: Noise-aware Diffusion-based Model for Speech Enhancement
NADiffuSE: Noise-aware Diffusion-based Model for Speech Enhancement
Wen Wang
Dongchao Yang
Qichen Ye
Bowen Cao
Yuexian Zou
DiffM
40
3
0
03 Sep 2023
Noise-aware Speech Enhancement using Diffusion Probabilistic Model
Noise-aware Speech Enhancement using Diffusion Probabilistic Model
Yuchen Hu
Cheng Chen
Ruizhe Li
Qiu-shi Zhu
Eng Siong Chng
DiffM
19
10
0
16 Jul 2023
Incorporating Ultrasound Tongue Images for Audio-Visual Speech
  Enhancement through Knowledge Distillation
Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement through Knowledge Distillation
Ruixin Zheng
Yang Ai
Zhenhua Ling
32
8
0
24 May 2023
Diffusion-Based Speech Enhancement with Joint Generative and Predictive
  Decoders
Diffusion-Based Speech Enhancement with Joint Generative and Predictive Decoders
Hao Shi
Kazuki Shimada
M. Hirano
Takashi Shibuya
Yuichiro Koyama
Zhi-Wei Zhong
Shusuke Takahashi
Tatsuya Kawahara
Yuki Mitsufuji
DiffM
40
15
0
18 May 2023
Enhancing Gappy Speech Audio Signals with Generative Adversarial
  Networks
Enhancing Gappy Speech Audio Signals with Generative Adversarial Networks
Deniss Strods
Alan F. Smeaton
27
2
0
09 May 2023
Affective social anthropomorphic intelligent system
Affective social anthropomorphic intelligent system
Md. Adyelullahil Mamun
Hasnat Md. Abdullah
Md. Golam Rabiul Alam
Muhammad Mehedi Hassan
Md. Zia Uddin
22
1
0
19 Apr 2023
Time-domain Speech Enhancement Assisted by Multi-resolution Frequency
  Encoder and Decoder
Time-domain Speech Enhancement Assisted by Multi-resolution Frequency Encoder and Decoder
Hao Shi
Masato Mimura
Longbiao Wang
J. Dang
Tatsuya Kawahara
36
14
0
26 Mar 2023
Unsupervised Noise adaptation using Data Simulation
Unsupervised Noise adaptation using Data Simulation
Chen Chen
Yuchen Hu
Heqing Zou
Linhui Sun
Chng Eng Siong
36
13
0
23 Feb 2023
Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust
  Speech Recognition
Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition
Yuchen Hu
Chen Chen
Ruizhe Li
Qiu-shi Zhu
Eng Siong Chng
37
15
0
22 Feb 2023
Speech Enhancement with Multi-granularity Vector Quantization
Speech Enhancement with Multi-granularity Vector Quantization
Xiaokang Zhao
Qiu-shi Zhu
Jie Zhang
25
0
0
16 Feb 2023
GANravel: User-Driven Direction Disentanglement in Generative
  Adversarial Networks
GANravel: User-Driven Direction Disentanglement in Generative Adversarial Networks
Noyan Evirgen
Xiang Chen
30
12
0
31 Jan 2023
Audio Denoising for Robust Audio Fingerprinting
Audio Denoising for Robust Audio Fingerprinting
Kamil Akesbi
26
3
0
21 Dec 2022
Generative Models for Improved Naturalness, Intelligibility, and Voicing
  of Whispered Speech
Generative Models for Improved Naturalness, Intelligibility, and Voicing of Whispered Speech
Dominik Wagner
Sebastian P. Bayerl
H. A. C. Maruri
Tobias Bocklet
24
7
0
04 Dec 2022
Deep neural network techniques for monaural speech enhancement: state of
  the art analysis
Deep neural network techniques for monaural speech enhancement: state of the art analysis
P. Ochieng
40
21
0
01 Dec 2022
A Two-Stage Deep Representation Learning-Based Speech Enhancement Method
  Using Variational Autoencoder and Adversarial Training
A Two-Stage Deep Representation Learning-Based Speech Enhancement Method Using Variational Autoencoder and Adversarial Training
Yang Xiang
Jesper Lisby Højvang
M. Rasmussen
M. G. Christensen
DRL
25
5
0
16 Nov 2022
The Potential of Neural Speech Synthesis-based Data Augmentation for
  Personalized Speech Enhancement
The Potential of Neural Speech Synthesis-based Data Augmentation for Personalized Speech Enhancement
Anastasia Kuznetsova
Aswin Sivaraman
Minje Kim
32
3
0
14 Nov 2022
Fast and efficient speech enhancement with variational autoencoders
Fast and efficient speech enhancement with variational autoencoders
M. Sadeghi
Romain Serizel
DRL
BDL
16
2
0
02 Nov 2022
A weighted-variance variational autoencoder model for speech enhancement
A weighted-variance variational autoencoder model for speech enhancement
A. Golmakani
M. Sadeghi
Xavier Alameda-Pineda
Romain Serizel
33
1
0
02 Nov 2022
SRTNet: Time Domain Speech Enhancement Via Stochastic Refinement
SRTNet: Time Domain Speech Enhancement Via Stochastic Refinement
Zhibin Qiu
Mengfan Fu
Yinfeng Yu
Lili Yin
Gang Hua
Hao-Ming Huang
DiffM
112
17
0
30 Oct 2022
GM-TCNet: Gated Multi-scale Temporal Convolutional Network using Emotion
  Causality for Speech Emotion Recognition
GM-TCNet: Gated Multi-scale Temporal Convolutional Network using Emotion Causality for Speech Emotion Recognition
Jiaxin Ye
Xin-Cheng Wen
Xihuai Wang
Yong Xu
Yan Luo
Chang-Li Wu
Liyan Chen
Kunhong Liu
31
35
0
28 Oct 2022
Diffiner: A Versatile Diffusion-based Generative Refiner for Speech
  Enhancement
Diffiner: A Versatile Diffusion-based Generative Refiner for Speech Enhancement
Ryosuke Sawata
Naoki Murata
Yuhta Takida
Toshimitsu Uesaka
Takashi Shibuya
Shusuke Takahashi
Yuki Mitsufuji
DiffM
36
15
0
27 Oct 2022
SCP-GAN: Self-Correcting Discriminator Optimization for Training
  Consistency Preserving Metric GAN on Speech Enhancement Tasks
SCP-GAN: Self-Correcting Discriminator Optimization for Training Consistency Preserving Metric GAN on Speech Enhancement Tasks
Vasily Zadorozhnyy
Qian Ye
K. Koishida
21
9
0
26 Oct 2022
Adversarial Permutation Invariant Training for Universal Sound
  Separation
Adversarial Permutation Invariant Training for Universal Sound Separation
Emilian Postolache
Jordi Pons
Santiago Pascual
Joan Serrà
VLM
28
6
0
21 Oct 2022
Speech Enhancement Using Self-Supervised Pre-Trained Model and Vector
  Quantization
Speech Enhancement Using Self-Supervised Pre-Trained Model and Vector Quantization
Xiaokang Zhao
Qiu-shi Zhu
Jie Zhang
41
4
0
28 Sep 2022
Music Separation Enhancement with Generative Modeling
Music Separation Enhancement with Generative Modeling
N. Schaffer
Boaz Cogan
Ethan Manilow
Max Morrison
Prem Seetharaman
Bryan Pardo
34
9
0
26 Aug 2022
Speech Enhancement and Dereverberation with Diffusion-based Generative
  Models
Speech Enhancement and Dereverberation with Diffusion-based Generative Models
Julius Richter
Simon Welker
Jean-Marie Lemercier
Bunlong Lay
Timo Gerkmann
DiffM
24
185
0
11 Aug 2022
Inference skipping for more efficient real-time speech enhancement with
  parallel RNNs
Inference skipping for more efficient real-time speech enhancement with parallel RNNs
Xiaohuai Le
Tong Lei
Kai-Jyun Chen
Jing Lu
38
20
0
22 Jul 2022
Stochastic Restoration of Heavily Compressed Musical Audio using
  Generative Adversarial Networks
Stochastic Restoration of Heavily Compressed Musical Audio using Generative Adversarial Networks
Stefan Lattner
J. Nistal
32
11
0
04 Jul 2022
ClearBuds: Wireless Binaural Earbuds for Learning-Based Speech
  Enhancement
ClearBuds: Wireless Binaural Earbuds for Learning-Based Speech Enhancement
Ishan Chatterjee
Maruchi Kim
V. Jayaram
Shyamnath Gollakota
Ira Kemelmacher-Shlizerman
Shwetak N. Patel
S. M. Seitz
27
24
0
27 Jun 2022
Does a PESQNet (Loss) Require a Clean Reference Input? The Original PESQ
  Does, But ACR Listening Tests Don't
Does a PESQNet (Loss) Require a Clean Reference Input? The Original PESQ Does, But ACR Listening Tests Don't
Ziyi Xu
Maximilian Strake
Tim Fingscheidt
27
3
0
04 May 2022
Efficient dynamic filter for robust and low computational feature
  extraction
Efficient dynamic filter for robust and low computational feature extraction
Donghyeon Kim
Gwantae Kim
Bokyeung Lee
Jeong-gi Kwak
D. Han
Hanseok Ko
39
3
0
03 May 2022
Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural
  Speech Enhancement
Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural Speech Enhancement
Andong Li
Shan You
Guochen Yu
C. Zheng
Xiaodong Li
38
26
0
30 Apr 2022
RadioSES: mmWave-Based Audioradio Speech Enhancement and Separation
  System
RadioSES: mmWave-Based Audioradio Speech Enhancement and Separation System
M. Z. Ozturk
Chenshu Wu
Beibei Wang
Min Wu
K. Liu
27
20
0
14 Apr 2022
VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration
VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration
Haohe Liu
Xubo Liu
Qiuqiang Kong
Qiao Tian
Yan Zhao
DeLiang Wang
Chuanzeng Huang
Yuxuan Wang
21
51
0
12 Apr 2022
FFC-SE: Fast Fourier Convolution for Speech Enhancement
FFC-SE: Fast Fourier Convolution for Speech Enhancement
Ivan Shchekotov
Pavel Andreev
Oleg Ivanov
Aibek Alanov
Dmitry Vetrov
40
23
0
06 Apr 2022
Audio-Visual Speech Codecs: Rethinking Audio-Visual Speech Enhancement
  by Re-Synthesis
Audio-Visual Speech Codecs: Rethinking Audio-Visual Speech Enhancement by Re-Synthesis
Karren D. Yang
Dejan Marković
Steven Krenn
Vasu Agrawal
Alexander Richard
VGen
20
32
0
31 Mar 2022
Speech Enhancement with Score-Based Generative Models in the Complex
  STFT Domain
Speech Enhancement with Score-Based Generative Models in the Complex STFT Domain
Simon Welker
Julius Richter
Timo Gerkmann
DiffM
33
110
0
31 Mar 2022
CMGAN: Conformer-based Metric GAN for Speech Enhancement
CMGAN: Conformer-based Metric GAN for Speech Enhancement
Ru Cao
Sherif Abdulatif
Bin Yang
26
92
0
28 Mar 2022
Speech-enhanced and Noise-aware Networks for Robust Speech Recognition
Speech-enhanced and Noise-aware Networks for Robust Speech Recognition
Hung-Shin Lee
Pin-Yuan Chen
Yao-Fei Cheng
Yu Tsao
Hsin-Min Wang
27
1
0
25 Mar 2022
1234
Next