Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1709.01703
Cited By
Conditional Generative Adversarial Networks for Speech Enhancement and Noise-Robust Speaker Verification
6 September 2017
Daniel Michelsanti
Zheng-Hua Tan
GAN
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Conditional Generative Adversarial Networks for Speech Enhancement and Noise-Robust Speaker Verification"
50 / 81 papers shown
Title
Linguistic Knowledge Transfer Learning for Speech Enhancement
Kuo-Hsuan Hung
Xugang Lu
Szu-Wei Fu
Huan-Hsin Tseng
Hsin-Yi Lin
Chii-Wann Lin
Yu Tsao
VLM
70
0
0
10 Mar 2025
A Survey of Deep Learning Audio Generation Methods
Matej Bozic
Marko Horvat
VLM
MedIm
66
0
0
31 May 2024
An Investigation of Incorporating Mamba for Speech Enhancement
Rong-Yu Chao
Wen-Huang Cheng
Moreno La Quatra
Sabato Marco Siniscalchi
Chao-Han Huck Yang
Szu-Wei Fu
Yu Tsao
Mamba
53
26
0
10 May 2024
Investigating the Design Space of Diffusion Models for Speech Enhancement
Philippe Gonzalez
Zheng-Hua Tan
Jan Østergaard
Jesper Jensen
T. S. Alstrøm
Tobias May
DiffM
33
6
0
07 Dec 2023
Rethinking Session Variability: Leveraging Session Embeddings for Session Robustness in Speaker Verification
Hee-Soo Heo
Ki-hyun Nam
Bong-Jin Lee
Youngki Kwon
Min-Ji Lee
You Jin Kim
Joon Son Chung
32
1
0
26 Sep 2023
Self-supervised learning with diffusion-based multichannel speech enhancement for speaker verification under noisy conditions
Sandipana Dowerah
Ajinkya Kulkarni
Romain Serizel
D. Jouvet
DiffM
24
1
0
05 Jul 2023
Spatial-temporal Graph Based Multi-channel Speaker Verification With Ad-hoc Microphone Arrays
Yijiang Chen
Chen Liang
Xiao-Lei Zhang
31
1
0
03 Jul 2023
Focus on the Sound around You: Monaural Target Speaker Extraction via Distance and Speaker Information
Jiuxin Lin
Peng Wang
Heinrich Dinkel
Jun Chen
Zhiyong Wu
Zhiyong Yan
Yongqing Wang
Junbo Zhang
Yujun Wang
19
8
0
28 Jun 2023
AV-SepFormer: Cross-Attention SepFormer for Audio-Visual Target Speaker Extraction
Jiuxin Lin
X. Cai
Heinrich Dinkel
Jun Chen
Zhiyong Yan
Yongqing Wang
Junbo Zhang
Zhiyong Wu
Yujun Wang
Helen M. Meng
26
21
0
25 Jun 2023
Speech Enhancement with Multi-granularity Vector Quantization
Xiaokang Zhao
Qiu-shi Zhu
Jie Zhang
25
0
0
16 Feb 2023
SkipConvGAN: Monaural Speech Dereverberation using Generative Adversarial Networks via Complex Time-Frequency Masking
Vinay Kothapally
John H. L. Hansen
14
21
0
22 Nov 2022
A Two-Stage Deep Representation Learning-Based Speech Enhancement Method Using Variational Autoencoder and Adversarial Training
Yang Xiang
Jesper Lisby Højvang
M. Rasmussen
M. G. Christensen
DRL
23
5
0
16 Nov 2022
CMGAN: Conformer-Based Metric-GAN for Monaural Speech Enhancement
Sherif Abdulatif
Ru Cao
Bin Yang
29
62
0
22 Sep 2022
Stochastic Restoration of Heavily Compressed Musical Audio using Generative Adversarial Networks
Stefan Lattner
J. Nistal
30
11
0
04 Jul 2022
Model Joins: Enabling Analytics Over Joins of Absent Big Tables
A. Shanghooshabad
Peter Triantafillou
9
0
0
21 Jun 2022
Perceptual Contrast Stretching on Target Feature for Speech Enhancement
Rong-Yu Chao
Cheng Yu
Szu-Wei Fu
Xugang Lu
Yu Tsao
VLM
31
14
0
31 Mar 2022
Conditional Diffusion Probabilistic Model for Speech Enhancement
Yen-Ju Lu
Zhongqiu Wang
Shinji Watanabe
Alexander Richard
Cheng Yu
Yu Tsao
DiffM
31
178
0
10 Feb 2022
Unsupervised Noise Adaptive Speech Enhancement by Discriminator-Constrained Optimal Transport
Hsin-Yi Lin
Huan-Hsin Tseng
Xugang Lu
Yu Tsao
OT
22
32
0
11 Nov 2021
OSSEM: one-shot speaker adaptive speech enhancement using meta learning
Cheng Yu
Szu-Wei Fu
Tsun-An Hsieh
Yu Tsao
Mirco Ravanelli
VLM
42
4
0
10 Nov 2021
SEOFP-NET: Compression and Acceleration of Deep Neural Networks for Speech Enhancement Using Sign-Exponent-Only Floating-Points
Yu-Chen Lin
Cheng Yu
Y. Hsu
Szu-Wei Fu
Yu Tsao
Tei-Wei Kuo
19
6
0
08 Nov 2021
A Unified View of cGANs with and without Classifiers
Si-An Chen
Chun-Liang Li
Hsuan-Tien Lin
GAN
22
10
0
01 Nov 2021
Late reverberation suppression using U-nets
D. León
Felipe A. Tobar
30
4
0
05 Oct 2021
Adversarial Data Augmentation for Disordered Speech Recognition
Zengrui Jin
Mengzhe Geng
Xurong Xie
Jianwei Yu
Shansong Liu
Xunying Liu
Helen Meng
22
35
0
02 Aug 2021
A Study on Speech Enhancement Based on Diffusion Probabilistic Model
Yen-Ju Lu
Yu Tsao
Shinji Watanabe
DiffM
13
73
0
25 Jul 2021
INTERSPEECH 2021 ConferencingSpeech Challenge: Towards Far-field Multi-Channel Speech Enhancement for Video Conferencing
Wei Rao
Yihui Fu
Yanxin Hu
Xin Xu
Yvkai Jv
...
Shinji Watanabe
Zheng-Hua Tan
Hui Bu
Tao Yu
Shidong Shang
49
12
0
02 Apr 2021
Data Generation Using Pass-phrase-dependent Deep Auto-encoders for Text-Dependent Speaker Verification
A. K. Sarkar
Md. Sahidullah
Zheng-Hua Tan
12
0
0
03 Feb 2021
Visual Speech Enhancement Without A Real Visual Stream
Sindhu B. Hegde
Prajwal K R
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
DiffM
20
17
0
20 Dec 2020
Improving Speech Enhancement Performance by Leveraging Contextual Broad Phonetic Class Information
Yen-Ju Lu
Chia-Yu Chang
Cheng Yu
Ching-Feng Liu
J. Hung
Shinji Watanabe
Yu Tsao
27
4
0
15 Nov 2020
UNetGAN: A Robust Speech Enhancement Approach in Time Domain for Extremely Low Signal-to-noise Ratio Condition
Xiang Hao
Xiangdong Su
Zhiyu Wang
Hui Zhang
Batushiren
25
32
0
29 Oct 2020
Investigating Cross-Domain Losses for Speech Enhancement
Sherif Abdulatif
Karim Armanious
Jayasankar T. Sajeev
Karim Guirguis
B. Yang
19
7
0
20 Oct 2020
Conditional Image Generation with One-Vs-All Classifier
Xiangrui Xu
Yaqin Li
Cao Yuan
VLM
GAN
25
12
0
18 Sep 2020
Libri-Adapt: A New Speech Dataset for Unsupervised Domain Adaptation
Akhil Mathur
F. Kawsar
N. Bianchi-Berthouze
Nicholas D. Lane
24
13
0
06 Sep 2020
Speaker Representation Learning using Global Context Guided Channel and Time-Frequency Transformations
Wei Xia
John H. L. Hansen
22
9
0
02 Sep 2020
Improved Lite Audio-Visual Speech Enhancement
Shang-Yi Chuang
Hsin-Min Wang
Yu Tsao
33
32
0
30 Aug 2020
DrumGAN: Synthesis of Drum Sounds With Timbral Feature Conditioning Using Generative Adversarial Networks
J. Nistal
Stefan Lattner
G. Richard
GAN
25
55
0
27 Aug 2020
Data augmentation enhanced speaker enrollment for text-dependent speaker verification
A. K. Sarkar
H. Sarma
Priyanka Dwivedi
Zheng-Hua Tan
8
3
0
12 Jul 2020
Dynamic Attention Based Generative Adversarial Network with Phase Post-Processing for Speech Enhancement
Andong Li
C. Zheng
Renhua Peng
Cunhang Fan
Xiaodong Li
6
4
0
13 Jun 2020
SADDEL: Joint Speech Separation and Denoising Model based on Multitask Learning
Yuan-Kuei Wu
Chao-I Tuan
Hung-yi Lee
Yu Tsao
25
4
0
20 May 2020
Generative Adversarial Networks (GANs Survey): Challenges, Solutions, and Future Directions
Divya Saxena
Jiannong Cao
AAML
AI4CE
26
287
0
30 Apr 2020
Data augmentation using generative networks to identify dementia
B. Mirheidari
Yilin Pan
D. Blackburn
R. O'Malley
Traci Walker
A. Venneri
M. Reuber
H. Christensen
MedIm
18
4
0
13 Apr 2020
SNR-Based Features and Diverse Training Data for Robust DNN-Based Speech Enhancement
R. Rehr
Timo Gerkmann
11
15
0
07 Apr 2020
Characterizing Speech Adversarial Examples Using Self-Attention U-Net Enhancement
Chao-Han Huck Yang
Jun Qi
Pin-Yu Chen
Xiaoli Ma
Chin-Hui Lee
AAML
32
53
0
31 Mar 2020
Mic2Mic: Using Cycle-Consistent Generative Adversarial Networks to Overcome Microphone Variability in Speech Systems
Akhil Mathur
Anton Isopoussu
F. Kawsar
N. Bianchi-Berthouze
Nicholas D. Lane
25
51
0
27 Mar 2020
iSEGAN: Improved Speech Enhancement Generative Adversarial Networks
Deepak Baby
GAN
13
7
0
20 Feb 2020
Analysis of Deep Feature Loss based Enhancement for Speaker Verification
Saurabh Kataria
P. S. Nidadavolu
Jesús Villalba
Najim Dehak
27
13
0
01 Feb 2020
Speech Enhancement based on Denoising Autoencoder with Multi-branched Encoders
Cheng Yu
Ryandhimas E. Zezario
Syu-Siang Wang
Jonathan Sherman
Yi-Yen Hsieh
Xugang Lu
Hsin-Min Wang
Yu Tsao
27
38
0
06 Jan 2020
Deep Representation Learning in Speech Processing: Challenges, Recent Advances, and Future Trends
S. Latif
R. Rana
Sara Khalifa
Raja Jurdak
Junaid Qadir
Björn W. Schuller
AI4TS
34
81
0
02 Jan 2020
High-quality Speech Synthesis Using Super-resolution Mel-Spectrogram
Leyuan Sheng
Dong-Yan Huang
Evgeny Nikolaevich Pavlovskiy
19
15
0
03 Dec 2019
Time-Domain Multi-modal Bone/air Conducted Speech Enhancement
Cheng Yu
Kuo-Hsuan Hung
Syu-Siang Wang
Szu-Wei Fu
Yu Tsao
J. Hung
29
33
0
22 Nov 2019
Distributed Microphone Speech Enhancement based on Deep Learning
Syu-Siang Wang
Yu-You Liang
J. Hung
Yu Tsao
H. Wang
Shih-Hau Fang
22
6
0
19 Nov 2019
1
2
Next