ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1709.01703
  4. Cited By
Conditional Generative Adversarial Networks for Speech Enhancement and
  Noise-Robust Speaker Verification

Conditional Generative Adversarial Networks for Speech Enhancement and Noise-Robust Speaker Verification

6 September 2017
Daniel Michelsanti
Zheng-Hua Tan
    GAN
ArXivPDFHTML

Papers citing "Conditional Generative Adversarial Networks for Speech Enhancement and Noise-Robust Speaker Verification"

50 / 81 papers shown
Title
Linguistic Knowledge Transfer Learning for Speech Enhancement
Kuo-Hsuan Hung
Xugang Lu
Szu-Wei Fu
Huan-Hsin Tseng
Hsin-Yi Lin
Chii-Wann Lin
Yu Tsao
VLM
70
0
0
10 Mar 2025
A Survey of Deep Learning Audio Generation Methods
A Survey of Deep Learning Audio Generation Methods
Matej Bozic
Marko Horvat
VLM
MedIm
66
0
0
31 May 2024
An Investigation of Incorporating Mamba for Speech Enhancement
An Investigation of Incorporating Mamba for Speech Enhancement
Rong-Yu Chao
Wen-Huang Cheng
Moreno La Quatra
Sabato Marco Siniscalchi
Chao-Han Huck Yang
Szu-Wei Fu
Yu Tsao
Mamba
53
26
0
10 May 2024
Investigating the Design Space of Diffusion Models for Speech
  Enhancement
Investigating the Design Space of Diffusion Models for Speech Enhancement
Philippe Gonzalez
Zheng-Hua Tan
Jan Østergaard
Jesper Jensen
T. S. Alstrøm
Tobias May
DiffM
33
6
0
07 Dec 2023
Rethinking Session Variability: Leveraging Session Embeddings for
  Session Robustness in Speaker Verification
Rethinking Session Variability: Leveraging Session Embeddings for Session Robustness in Speaker Verification
Hee-Soo Heo
Ki-hyun Nam
Bong-Jin Lee
Youngki Kwon
Min-Ji Lee
You Jin Kim
Joon Son Chung
32
1
0
26 Sep 2023
Self-supervised learning with diffusion-based multichannel speech
  enhancement for speaker verification under noisy conditions
Self-supervised learning with diffusion-based multichannel speech enhancement for speaker verification under noisy conditions
Sandipana Dowerah
Ajinkya Kulkarni
Romain Serizel
D. Jouvet
DiffM
24
1
0
05 Jul 2023
Spatial-temporal Graph Based Multi-channel Speaker Verification With
  Ad-hoc Microphone Arrays
Spatial-temporal Graph Based Multi-channel Speaker Verification With Ad-hoc Microphone Arrays
Yijiang Chen
Chen Liang
Xiao-Lei Zhang
31
1
0
03 Jul 2023
Focus on the Sound around You: Monaural Target Speaker Extraction via
  Distance and Speaker Information
Focus on the Sound around You: Monaural Target Speaker Extraction via Distance and Speaker Information
Jiuxin Lin
Peng Wang
Heinrich Dinkel
Jun Chen
Zhiyong Wu
Zhiyong Yan
Yongqing Wang
Junbo Zhang
Yujun Wang
19
8
0
28 Jun 2023
AV-SepFormer: Cross-Attention SepFormer for Audio-Visual Target Speaker
  Extraction
AV-SepFormer: Cross-Attention SepFormer for Audio-Visual Target Speaker Extraction
Jiuxin Lin
X. Cai
Heinrich Dinkel
Jun Chen
Zhiyong Yan
Yongqing Wang
Junbo Zhang
Zhiyong Wu
Yujun Wang
Helen M. Meng
26
21
0
25 Jun 2023
Speech Enhancement with Multi-granularity Vector Quantization
Speech Enhancement with Multi-granularity Vector Quantization
Xiaokang Zhao
Qiu-shi Zhu
Jie Zhang
25
0
0
16 Feb 2023
SkipConvGAN: Monaural Speech Dereverberation using Generative
  Adversarial Networks via Complex Time-Frequency Masking
SkipConvGAN: Monaural Speech Dereverberation using Generative Adversarial Networks via Complex Time-Frequency Masking
Vinay Kothapally
John H. L. Hansen
14
21
0
22 Nov 2022
A Two-Stage Deep Representation Learning-Based Speech Enhancement Method
  Using Variational Autoencoder and Adversarial Training
A Two-Stage Deep Representation Learning-Based Speech Enhancement Method Using Variational Autoencoder and Adversarial Training
Yang Xiang
Jesper Lisby Højvang
M. Rasmussen
M. G. Christensen
DRL
23
5
0
16 Nov 2022
CMGAN: Conformer-Based Metric-GAN for Monaural Speech Enhancement
CMGAN: Conformer-Based Metric-GAN for Monaural Speech Enhancement
Sherif Abdulatif
Ru Cao
Bin Yang
29
62
0
22 Sep 2022
Stochastic Restoration of Heavily Compressed Musical Audio using
  Generative Adversarial Networks
Stochastic Restoration of Heavily Compressed Musical Audio using Generative Adversarial Networks
Stefan Lattner
J. Nistal
30
11
0
04 Jul 2022
Model Joins: Enabling Analytics Over Joins of Absent Big Tables
Model Joins: Enabling Analytics Over Joins of Absent Big Tables
A. Shanghooshabad
Peter Triantafillou
9
0
0
21 Jun 2022
Perceptual Contrast Stretching on Target Feature for Speech Enhancement
Perceptual Contrast Stretching on Target Feature for Speech Enhancement
Rong-Yu Chao
Cheng Yu
Szu-Wei Fu
Xugang Lu
Yu Tsao
VLM
31
14
0
31 Mar 2022
Conditional Diffusion Probabilistic Model for Speech Enhancement
Conditional Diffusion Probabilistic Model for Speech Enhancement
Yen-Ju Lu
Zhongqiu Wang
Shinji Watanabe
Alexander Richard
Cheng Yu
Yu Tsao
DiffM
31
178
0
10 Feb 2022
Unsupervised Noise Adaptive Speech Enhancement by
  Discriminator-Constrained Optimal Transport
Unsupervised Noise Adaptive Speech Enhancement by Discriminator-Constrained Optimal Transport
Hsin-Yi Lin
Huan-Hsin Tseng
Xugang Lu
Yu Tsao
OT
22
32
0
11 Nov 2021
OSSEM: one-shot speaker adaptive speech enhancement using meta learning
OSSEM: one-shot speaker adaptive speech enhancement using meta learning
Cheng Yu
Szu-Wei Fu
Tsun-An Hsieh
Yu Tsao
Mirco Ravanelli
VLM
42
4
0
10 Nov 2021
SEOFP-NET: Compression and Acceleration of Deep Neural Networks for
  Speech Enhancement Using Sign-Exponent-Only Floating-Points
SEOFP-NET: Compression and Acceleration of Deep Neural Networks for Speech Enhancement Using Sign-Exponent-Only Floating-Points
Yu-Chen Lin
Cheng Yu
Y. Hsu
Szu-Wei Fu
Yu Tsao
Tei-Wei Kuo
19
6
0
08 Nov 2021
A Unified View of cGANs with and without Classifiers
A Unified View of cGANs with and without Classifiers
Si-An Chen
Chun-Liang Li
Hsuan-Tien Lin
GAN
22
10
0
01 Nov 2021
Late reverberation suppression using U-nets
Late reverberation suppression using U-nets
D. León
Felipe A. Tobar
30
4
0
05 Oct 2021
Adversarial Data Augmentation for Disordered Speech Recognition
Adversarial Data Augmentation for Disordered Speech Recognition
Zengrui Jin
Mengzhe Geng
Xurong Xie
Jianwei Yu
Shansong Liu
Xunying Liu
Helen Meng
22
35
0
02 Aug 2021
A Study on Speech Enhancement Based on Diffusion Probabilistic Model
A Study on Speech Enhancement Based on Diffusion Probabilistic Model
Yen-Ju Lu
Yu Tsao
Shinji Watanabe
DiffM
13
73
0
25 Jul 2021
INTERSPEECH 2021 ConferencingSpeech Challenge: Towards Far-field
  Multi-Channel Speech Enhancement for Video Conferencing
INTERSPEECH 2021 ConferencingSpeech Challenge: Towards Far-field Multi-Channel Speech Enhancement for Video Conferencing
Wei Rao
Yihui Fu
Yanxin Hu
Xin Xu
Yvkai Jv
...
Shinji Watanabe
Zheng-Hua Tan
Hui Bu
Tao Yu
Shidong Shang
49
12
0
02 Apr 2021
Data Generation Using Pass-phrase-dependent Deep Auto-encoders for
  Text-Dependent Speaker Verification
Data Generation Using Pass-phrase-dependent Deep Auto-encoders for Text-Dependent Speaker Verification
A. K. Sarkar
Md. Sahidullah
Zheng-Hua Tan
12
0
0
03 Feb 2021
Visual Speech Enhancement Without A Real Visual Stream
Visual Speech Enhancement Without A Real Visual Stream
Sindhu B. Hegde
Prajwal K R
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
DiffM
20
17
0
20 Dec 2020
Improving Speech Enhancement Performance by Leveraging Contextual Broad
  Phonetic Class Information
Improving Speech Enhancement Performance by Leveraging Contextual Broad Phonetic Class Information
Yen-Ju Lu
Chia-Yu Chang
Cheng Yu
Ching-Feng Liu
J. Hung
Shinji Watanabe
Yu Tsao
27
4
0
15 Nov 2020
UNetGAN: A Robust Speech Enhancement Approach in Time Domain for
  Extremely Low Signal-to-noise Ratio Condition
UNetGAN: A Robust Speech Enhancement Approach in Time Domain for Extremely Low Signal-to-noise Ratio Condition
Xiang Hao
Xiangdong Su
Zhiyu Wang
Hui Zhang
Batushiren
25
32
0
29 Oct 2020
Investigating Cross-Domain Losses for Speech Enhancement
Investigating Cross-Domain Losses for Speech Enhancement
Sherif Abdulatif
Karim Armanious
Jayasankar T. Sajeev
Karim Guirguis
B. Yang
19
7
0
20 Oct 2020
Conditional Image Generation with One-Vs-All Classifier
Conditional Image Generation with One-Vs-All Classifier
Xiangrui Xu
Yaqin Li
Cao Yuan
VLM
GAN
25
12
0
18 Sep 2020
Libri-Adapt: A New Speech Dataset for Unsupervised Domain Adaptation
Libri-Adapt: A New Speech Dataset for Unsupervised Domain Adaptation
Akhil Mathur
F. Kawsar
N. Bianchi-Berthouze
Nicholas D. Lane
24
13
0
06 Sep 2020
Speaker Representation Learning using Global Context Guided Channel and
  Time-Frequency Transformations
Speaker Representation Learning using Global Context Guided Channel and Time-Frequency Transformations
Wei Xia
John H. L. Hansen
22
9
0
02 Sep 2020
Improved Lite Audio-Visual Speech Enhancement
Improved Lite Audio-Visual Speech Enhancement
Shang-Yi Chuang
Hsin-Min Wang
Yu Tsao
33
32
0
30 Aug 2020
DrumGAN: Synthesis of Drum Sounds With Timbral Feature Conditioning
  Using Generative Adversarial Networks
DrumGAN: Synthesis of Drum Sounds With Timbral Feature Conditioning Using Generative Adversarial Networks
J. Nistal
Stefan Lattner
G. Richard
GAN
25
55
0
27 Aug 2020
Data augmentation enhanced speaker enrollment for text-dependent speaker
  verification
Data augmentation enhanced speaker enrollment for text-dependent speaker verification
A. K. Sarkar
H. Sarma
Priyanka Dwivedi
Zheng-Hua Tan
8
3
0
12 Jul 2020
Dynamic Attention Based Generative Adversarial Network with Phase
  Post-Processing for Speech Enhancement
Dynamic Attention Based Generative Adversarial Network with Phase Post-Processing for Speech Enhancement
Andong Li
C. Zheng
Renhua Peng
Cunhang Fan
Xiaodong Li
6
4
0
13 Jun 2020
SADDEL: Joint Speech Separation and Denoising Model based on Multitask
  Learning
SADDEL: Joint Speech Separation and Denoising Model based on Multitask Learning
Yuan-Kuei Wu
Chao-I Tuan
Hung-yi Lee
Yu Tsao
25
4
0
20 May 2020
Generative Adversarial Networks (GANs Survey): Challenges, Solutions,
  and Future Directions
Generative Adversarial Networks (GANs Survey): Challenges, Solutions, and Future Directions
Divya Saxena
Jiannong Cao
AAML
AI4CE
26
287
0
30 Apr 2020
Data augmentation using generative networks to identify dementia
Data augmentation using generative networks to identify dementia
B. Mirheidari
Yilin Pan
D. Blackburn
R. O'Malley
Traci Walker
A. Venneri
M. Reuber
H. Christensen
MedIm
18
4
0
13 Apr 2020
SNR-Based Features and Diverse Training Data for Robust DNN-Based Speech
  Enhancement
SNR-Based Features and Diverse Training Data for Robust DNN-Based Speech Enhancement
R. Rehr
Timo Gerkmann
11
15
0
07 Apr 2020
Characterizing Speech Adversarial Examples Using Self-Attention U-Net
  Enhancement
Characterizing Speech Adversarial Examples Using Self-Attention U-Net Enhancement
Chao-Han Huck Yang
Jun Qi
Pin-Yu Chen
Xiaoli Ma
Chin-Hui Lee
AAML
32
53
0
31 Mar 2020
Mic2Mic: Using Cycle-Consistent Generative Adversarial Networks to
  Overcome Microphone Variability in Speech Systems
Mic2Mic: Using Cycle-Consistent Generative Adversarial Networks to Overcome Microphone Variability in Speech Systems
Akhil Mathur
Anton Isopoussu
F. Kawsar
N. Bianchi-Berthouze
Nicholas D. Lane
25
51
0
27 Mar 2020
iSEGAN: Improved Speech Enhancement Generative Adversarial Networks
iSEGAN: Improved Speech Enhancement Generative Adversarial Networks
Deepak Baby
GAN
13
7
0
20 Feb 2020
Analysis of Deep Feature Loss based Enhancement for Speaker Verification
Analysis of Deep Feature Loss based Enhancement for Speaker Verification
Saurabh Kataria
P. S. Nidadavolu
Jesús Villalba
Najim Dehak
27
13
0
01 Feb 2020
Speech Enhancement based on Denoising Autoencoder with Multi-branched
  Encoders
Speech Enhancement based on Denoising Autoencoder with Multi-branched Encoders
Cheng Yu
Ryandhimas E. Zezario
Syu-Siang Wang
Jonathan Sherman
Yi-Yen Hsieh
Xugang Lu
Hsin-Min Wang
Yu Tsao
27
38
0
06 Jan 2020
Deep Representation Learning in Speech Processing: Challenges, Recent
  Advances, and Future Trends
Deep Representation Learning in Speech Processing: Challenges, Recent Advances, and Future Trends
S. Latif
R. Rana
Sara Khalifa
Raja Jurdak
Junaid Qadir
Björn W. Schuller
AI4TS
34
81
0
02 Jan 2020
High-quality Speech Synthesis Using Super-resolution Mel-Spectrogram
High-quality Speech Synthesis Using Super-resolution Mel-Spectrogram
Leyuan Sheng
Dong-Yan Huang
Evgeny Nikolaevich Pavlovskiy
19
15
0
03 Dec 2019
Time-Domain Multi-modal Bone/air Conducted Speech Enhancement
Time-Domain Multi-modal Bone/air Conducted Speech Enhancement
Cheng Yu
Kuo-Hsuan Hung
Syu-Siang Wang
Szu-Wei Fu
Yu Tsao
J. Hung
29
33
0
22 Nov 2019
Distributed Microphone Speech Enhancement based on Deep Learning
Distributed Microphone Speech Enhancement based on Deep Learning
Syu-Siang Wang
Yu-You Liang
J. Hung
Yu Tsao
H. Wang
Shih-Hau Fang
22
6
0
19 Nov 2019
12
Next