ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1704.08504
  4. Cited By
Complex spectrogram enhancement by convolutional neural network with
  multi-metrics learning

Complex spectrogram enhancement by convolutional neural network with multi-metrics learning

27 April 2017
Szu-Wei Fu
Ting-Yao Hu
Yu Tsao
Xugang Lu
ArXivPDFHTML

Papers citing "Complex spectrogram enhancement by convolutional neural network with multi-metrics learning"

50 / 55 papers shown
Title
Speech Enhancement Using Continuous Embeddings of Neural Audio Codec
Speech Enhancement Using Continuous Embeddings of Neural Audio Codec
Haoyang Li
J. Yip
Tianyu Fan
Eng Siong Chng
54
1
0
22 Feb 2025
DeepExtractor: Time-domain reconstruction of signals and glitches in gravitational wave data with deep learning
DeepExtractor: Time-domain reconstruction of signals and glitches in gravitational wave data with deep learning
Tom Dooney
Harsh Narola
Stefano Bromuri
R. L. Curier
C. Broeck
Sarah Caudill
D. Tan
74
0
0
30 Jan 2025
DPSNN: Spiking Neural Network for Low-Latency Streaming Speech
  Enhancement
DPSNN: Spiking Neural Network for Low-Latency Streaming Speech Enhancement
Tao Sun
Sander Bohté
28
2
0
14 Aug 2024
Pre-training Feature Guided Diffusion Model for Speech Enhancement
Pre-training Feature Guided Diffusion Model for Speech Enhancement
Yiyuan Yang
Niki Trigoni
Andrew Markham
42
3
0
11 Jun 2024
Towards Decoupling Frontend Enhancement and Backend Recognition in
  Monaural Robust ASR
Towards Decoupling Frontend Enhancement and Backend Recognition in Monaural Robust ASR
Yufeng Yang
Ashutosh Pandey
DeLiang Wang
46
4
0
11 Mar 2024
CrossNet: Leveraging Global, Cross-Band, Narrow-Band, and Positional
  Encoding for Single- and Multi-Channel Speaker Separation
CrossNet: Leveraging Global, Cross-Band, Narrow-Band, and Positional Encoding for Single- and Multi-Channel Speaker Separation
Vahid Ahmadi Kalkhorani
DeLiang Wang
48
3
0
06 Mar 2024
Decoupled Spatial and Temporal Processing for Resource Efficient
  Multichannel Speech Enhancement
Decoupled Spatial and Temporal Processing for Resource Efficient Multichannel Speech Enhancement
Ashutosh Pandey
Buye Xu
49
1
0
15 Jan 2024
Speech Separation based on Contrastive Learning and Deep Modularization
Speech Separation based on Contrastive Learning and Deep Modularization
Peter Ochieng
SSL
30
0
0
18 May 2023
Rethinking complex-valued deep neural networks for monaural speech
  enhancement
Rethinking complex-valued deep neural networks for monaural speech enhancement
Haibin Wu
Ke Tan
Buye Xu
Anurag Kumar
Daniel D. E. Wong
29
6
0
11 Jan 2023
Deep neural network techniques for monaural speech enhancement: state of
  the art analysis
Deep neural network techniques for monaural speech enhancement: state of the art analysis
P. Ochieng
40
21
0
01 Dec 2022
TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech
  Separation
TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech Separation
Zhongqiu Wang
Samuele Cornell
Shukjae Choi
Younglo Lee
Byeonghak Kim
Shinji Watanabe
48
121
0
22 Nov 2022
Leveraging Heteroscedastic Uncertainty in Learning Complex Spectral
  Mapping for Single-channel Speech Enhancement
Leveraging Heteroscedastic Uncertainty in Learning Complex Spectral Mapping for Single-channel Speech Enhancement
Kuan-Lin Chen
Daniel D. E. Wong
Ke Tan
Buye Xu
Anurag Kumar
V. Ithapu
35
1
0
16 Nov 2022
TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural
  Speaker Separation
TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural Speaker Separation
Zhong-Qiu Wang
Samuele Cornell
Shukjae Choi
Younglo Lee
Byeonghak Kim
Shinji Watanabe
74
99
0
08 Sep 2022
LCSM: A Lightweight Complex Spectral Mapping Framework for Stereophonic
  Acoustic Echo Cancellation
LCSM: A Lightweight Complex Spectral Mapping Framework for Stereophonic Acoustic Echo Cancellation
Chen Zhang
Jinjiang Liu
Xueliang Zhang
19
9
0
15 Aug 2022
Speech Enhancement and Dereverberation with Diffusion-based Generative
  Models
Speech Enhancement and Dereverberation with Diffusion-based Generative Models
Julius Richter
Simon Welker
Jean-Marie Lemercier
Bunlong Lay
Timo Gerkmann
DiffM
24
185
0
11 Aug 2022
GLD-Net: Improving Monaural Speech Enhancement by Learning Global and
  Local Dependency Features with GLD Block
GLD-Net: Improving Monaural Speech Enhancement by Learning Global and Local Dependency Features with GLD Block
Xinmeng Xu
Yang Wang
Jie Jia
Binbin Chen
Jia Hao
17
5
0
30 Jun 2022
Resource-efficient Deep Neural Networks for Automotive Radar
  Interference Mitigation
Resource-efficient Deep Neural Networks for Automotive Radar Interference Mitigation
J. Rock
Wolfgang Roth
Máté Tóth
Paul Meissner
Franz Pernkopf
30
43
0
25 Jan 2022
SEOFP-NET: Compression and Acceleration of Deep Neural Networks for
  Speech Enhancement Using Sign-Exponent-Only Floating-Points
SEOFP-NET: Compression and Acceleration of Deep Neural Networks for Speech Enhancement Using Sign-Exponent-Only Floating-Points
Yu-Chen Lin
Cheng Yu
Y. Hsu
Szu-Wei Fu
Yu Tsao
Tei-Wei Kuo
19
6
0
08 Nov 2021
Leveraging Low-Distortion Target Estimates for Improved Speech
  Enhancement
Leveraging Low-Distortion Target Estimates for Improved Speech Enhancement
Zhong-Qiu Wang
Gordon Wichern
Jonathan Le Roux
126
15
0
01 Oct 2021
Convolutive Prediction for Monaural Speech Dereverberation and
  Noisy-Reverberant Speaker Separation
Convolutive Prediction for Monaural Speech Dereverberation and Noisy-Reverberant Speaker Separation
Zhong-Qiu Wang
Gordon Wichern
Jonathan Le Roux
30
31
0
16 Aug 2021
On The Compensation Between Magnitude and Phase in Speech Separation
On The Compensation Between Magnitude and Phase in Speech Separation
Zhong-Qiu Wang
Gordon Wichern
Jonathan Le Roux
27
71
0
11 Aug 2021
Audio Spectral Enhancement: Leveraging Autoencoders for Low Latency
  Reconstruction of Long, Lossy Audio Sequences
Audio Spectral Enhancement: Leveraging Autoencoders for Low Latency Reconstruction of Long, Lossy Audio Sequences
Darshan Deshpande
H. Abichandani
19
0
0
08 Aug 2021
Self-attending RNN for Speech Enhancement to Improve Cross-corpus
  Generalization
Self-attending RNN for Speech Enhancement to Improve Cross-corpus Generalization
Ashutosh Pandey
DeLiang Wang
17
40
0
26 May 2021
Complex-valued Convolutional Neural Networks for Enhanced Radar Signal
  Denoising and Interference Mitigation
Complex-valued Convolutional Neural Networks for Enhanced Radar Signal Denoising and Interference Mitigation
Alexander Fuchs
J. Rock
Máté Tóth
Paul Meissner
Franz Pernkopf
24
33
0
29 Apr 2021
TSTNN: Two-stage Transformer based Neural Network for Speech Enhancement
  in the Time Domain
TSTNN: Two-stage Transformer based Neural Network for Speech Enhancement in the Time Domain
Kai Wang
Bengbeng He
Weiping Zhu
41
166
0
18 Mar 2021
Speech Enhancement Using Multi-Stage Self-Attentive Temporal
  Convolutional Networks
Speech Enhancement Using Multi-Stage Self-Attentive Temporal Convolutional Networks
Ju Lin
A. Wijngaarden
Kuang-Ching Wang
M. C. Smith
27
50
0
24 Feb 2021
Dual Application of Speech Enhancement for Automatic Speech Recognition
Dual Application of Speech Enhancement for Automatic Speech Recognition
Ashutosh Pandey
Chunxi Liu
Yun Wang
Yatharth Saraf
46
37
0
07 Nov 2020
Multi-microphone Complex Spectral Mapping for Utterance-wise and
  Continuous Speech Separation
Multi-microphone Complex Spectral Mapping for Utterance-wise and Continuous Speech Separation
Zhong-Qiu Wang
Peidong Wang
DeLiang Wang
35
88
0
04 Oct 2020
Dense CNN with Self-Attention for Time-Domain Speech Enhancement
Dense CNN with Self-Attention for Time-Domain Speech Enhancement
Ashutosh Pandey
DeLiang Wang
31
135
0
03 Sep 2020
Improved Lite Audio-Visual Speech Enhancement
Improved Lite Audio-Visual Speech Enhancement
Shang-Yi Chuang
Hsin-Min Wang
Yu Tsao
33
32
0
30 Aug 2020
Lite Audio-Visual Speech Enhancement
Lite Audio-Visual Speech Enhancement
Shang-Yi Chuang
Yu Tsao
Chen-Chou Lo
Hsin-Min Wang
24
24
0
24 May 2020
SpEx: Multi-Scale Time Domain Speaker Extraction Network
SpEx: Multi-Scale Time Domain Speaker Extraction Network
Chenglin Xu
Wei Rao
Eng Siong Chng
Haizhou Li
31
167
0
17 Apr 2020
WaveCRN: An Efficient Convolutional Recurrent Neural Network for
  End-to-end Speech Enhancement
WaveCRN: An Efficient Convolutional Recurrent Neural Network for End-to-end Speech Enhancement
Tsun-An Hsieh
Hsin-Min Wang
Xugang Lu
Yu Tsao
51
60
0
06 Apr 2020
A Recursive Network with Dynamic Attention for Monaural Speech
  Enhancement
A Recursive Network with Dynamic Attention for Monaural Speech Enhancement
Andong Li
C. Zheng
Cunhang Fan
Renhua Peng
Xiaodong Li
14
28
0
29 Mar 2020
A Review of Multi-Objective Deep Learning Speech Denoising Methods
A Review of Multi-Objective Deep Learning Speech Denoising Methods
A. Azarang
N. Kehtarnavaz
39
30
0
26 Mar 2020
A Time-domain Monaural Speech Enhancement with Feedback Learning
A Time-domain Monaural Speech Enhancement with Feedback Learning
Andong Li
C. Zheng
Linjuan Cheng
Renhua Peng
Xiaodong Li
23
3
0
22 Mar 2020
Multi-Microphone Complex Spectral Mapping for Speech Dereverberation
Multi-Microphone Complex Spectral Mapping for Speech Dereverberation
Zhong-Qiu Wang
DeLiang Wang
27
61
0
04 Mar 2020
On Cross-Corpus Generalization of Deep Learning Based Speech Enhancement
On Cross-Corpus Generalization of Deep Learning Based Speech Enhancement
Ashutosh Pandey
DeLiang Wang
27
52
0
10 Feb 2020
Speech Enhancement based on Denoising Autoencoder with Multi-branched
  Encoders
Speech Enhancement based on Denoising Autoencoder with Multi-branched Encoders
Cheng Yu
Ryandhimas E. Zezario
Syu-Siang Wang
Jonathan Sherman
Yi-Yen Hsieh
Xugang Lu
Hsin-Min Wang
Yu Tsao
27
38
0
06 Jan 2020
Investigating U-Nets with various Intermediate Blocks for
  Spectrogram-based Singing Voice Separation
Investigating U-Nets with various Intermediate Blocks for Spectrogram-based Singing Voice Separation
Woosung Choi
Minseok Kim
Jaehwa Chung
Daewon Lee
Soonyoung Jung
24
4
0
02 Dec 2019
Narrow-band Deep Filtering for Multichannel Speech Enhancement
Narrow-band Deep Filtering for Multichannel Speech Enhancement
Xiaofei Li
Radu Horaud
19
21
0
25 Nov 2019
Sound texture synthesis using RI spectrograms
Sound texture synthesis using RI spectrograms
Hugo Caracalla
Axel Roebel
21
7
0
21 Oct 2019
Improving the Intelligibility of Electric and Acoustic Stimulation
  Speech Using Fully Convolutional Networks Based Speech Enhancement
Improving the Intelligibility of Electric and Acoustic Stimulation Speech Using Fully Convolutional Networks Based Speech Enhancement
N. Wang
H. Wang
Tao-Wei Wang
Szu-Wei Fu
Xugan Lu
Yu Tsao
Hsin-Min Wang
28
1
0
26 Sep 2019
Multichannel Speech Enhancement by Raw Waveform-mapping using Fully
  Convolutional Networks
Multichannel Speech Enhancement by Raw Waveform-mapping using Fully Convolutional Networks
Changle Liu
Sze-Wei Fu
You-Jin Li
Jen-Wei Huang
Hsin-Min Wang
Yu Tsao
30
50
0
26 Sep 2019
Convolutional Recurrent Neural Network Based Progressive Learning for
  Monaural Speech Enhancement
Convolutional Recurrent Neural Network Based Progressive Learning for Monaural Speech Enhancement
Andong Li
Minmin Yuan
C. Zheng
Xiaodong Li
16
8
0
28 Aug 2019
Complex Signal Denoising and Interference Mitigation for Automotive
  Radar Using Convolutional Neural Networks
Complex Signal Denoising and Interference Mitigation for Automotive Radar Using Convolutional Neural Networks
J. Rock
Máté Tóth
Elmar Messner
Paul Meissner
Franz Pernkopf
36
44
0
24 Jun 2019
Self-supervised Audio Spatialization with Correspondence Classifier
Self-supervised Audio Spatialization with Correspondence Classifier
Yu-Ding Lu
Hsin-Ying Lee
Hung-Yu Tseng
Ming-Hsuan Yang
14
22
0
14 May 2019
Learning with Learned Loss Function: Speech Enhancement with Quality-Net
  to Improve Perceptual Evaluation of Speech Quality
Learning with Learned Loss Function: Speech Enhancement with Quality-Net to Improve Perceptual Evaluation of Speech Quality
Szu-Wei Fu
Chien-Feng Liao
Yu Tsao
16
69
0
06 May 2019
A study on speech enhancement using exponent-only floating point
  quantized neural network (EOFP-QNN)
A study on speech enhancement using exponent-only floating point quantized neural network (EOFP-QNN)
Y. Hsu
Yu-Chen Lin
Szu-Wei Fu
Yu Tsao
Tei-Wei Kuo
MQ
22
15
0
17 Aug 2018
Convolutional Neural Networks to Enhance Coded Speech
Convolutional Neural Networks to Enhance Coded Speech
Ziyue Zhao
Huijun Liu
Tim Fingscheidt
32
64
0
25 Jun 2018
12
Next