ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1709.03658
  4. Cited By
End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics
  Optimization by Fully Convolutional Neural Networks

End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics Optimization by Fully Convolutional Neural Networks

12 September 2017
Szu-Wei Fu
Tao-Wei Wang
Yu Tsao
Xugang Lu
Hisashi Kawai
ArXivPDFHTML

Papers citing "End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics Optimization by Fully Convolutional Neural Networks"

49 / 99 papers shown
Title
UNetGAN: A Robust Speech Enhancement Approach in Time Domain for
  Extremely Low Signal-to-noise Ratio Condition
UNetGAN: A Robust Speech Enhancement Approach in Time Domain for Extremely Low Signal-to-noise Ratio Condition
Xiang Hao
Xiangdong Su
Zhiyu Wang
Hui Zhang
Batushiren
14
32
0
29 Oct 2020
Improving Perceptual Quality by Phone-Fortified Perceptual Loss using
  Wasserstein Distance for Speech Enhancement
Improving Perceptual Quality by Phone-Fortified Perceptual Loss using Wasserstein Distance for Speech Enhancement
Tsun-An Hsieh
Cheng Yu
Szu-Wei Fu
Xugang Lu
Yu Tsao
20
16
0
28 Oct 2020
Investigating Cross-Domain Losses for Speech Enhancement
Investigating Cross-Domain Losses for Speech Enhancement
Sherif Abdulatif
Karim Armanious
Jayasankar T. Sajeev
Karim Guirguis
B. Yang
19
7
0
20 Oct 2020
Dense CNN with Self-Attention for Time-Domain Speech Enhancement
Dense CNN with Self-Attention for Time-Domain Speech Enhancement
Ashutosh Pandey
DeLiang Wang
13
134
0
03 Sep 2020
CITISEN: A Deep Learning-Based Speech Signal-Processing Mobile
  Application
CITISEN: A Deep Learning-Based Speech Signal-Processing Mobile Application
Yu-Wen Chen
Kuo-Hsuan Hung
You-Jin Li
A. Kang
Ya-Hsin Lai
Kai-Chun Liu
Szu-Wei Fu
Syu-Siang Wang
Yu Tsao
25
5
0
21 Aug 2020
DCCRN: Deep Complex Convolution Recurrent Network for Phase-Aware Speech
  Enhancement
DCCRN: Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement
Yanxin Hu
Yun Liu
Shubo Lv
Mengtao Xing
Shimin Zhang
Yihui Fu
Jian Wu
Bihong Zhang
Lei Xie
6
585
0
01 Aug 2020
A Unified Framework of Surrogate Loss by Refactoring and Interpolation
A Unified Framework of Surrogate Loss by Refactoring and Interpolation
Lanlan Liu
Mingzhe Wang
Jia Deng
22
8
0
27 Jul 2020
Lite Audio-Visual Speech Enhancement
Lite Audio-Visual Speech Enhancement
Shang-Yi Chuang
Yu Tsao
Chen-Chou Lo
Hsin-Min Wang
16
24
0
24 May 2020
SpEx: Multi-Scale Time Domain Speaker Extraction Network
SpEx: Multi-Scale Time Domain Speaker Extraction Network
Chenglin Xu
Wei Rao
E. Chng
Haizhou Li
15
166
0
17 Apr 2020
WaveCRN: An Efficient Convolutional Recurrent Neural Network for
  End-to-end Speech Enhancement
WaveCRN: An Efficient Convolutional Recurrent Neural Network for End-to-end Speech Enhancement
Tsun-An Hsieh
Hsin-Min Wang
Xugang Lu
Yu Tsao
43
60
0
06 Apr 2020
A Time-domain Monaural Speech Enhancement with Feedback Learning
A Time-domain Monaural Speech Enhancement with Feedback Learning
Andong Li
C. Zheng
Linjuan Cheng
Renhua Peng
Xiaodong Li
15
3
0
22 Mar 2020
Improving noise robust automatic speech recognition with single-channel
  time-domain enhancement network
Improving noise robust automatic speech recognition with single-channel time-domain enhancement network
K. Kinoshita
Tsubasa Ochiai
Marc Delcroix
Tomohiro Nakatani
21
97
0
09 Mar 2020
Phonetic Feedback for Speech Enhancement With and Without Parallel
  Speech Data
Phonetic Feedback for Speech Enhancement With and Without Parallel Speech Data
Peter William VanHarn Plantinga
Deblin Bagchi
Eric Fosler-Lussier
21
4
0
03 Mar 2020
Speech Enhancement based on Denoising Autoencoder with Multi-branched
  Encoders
Speech Enhancement based on Denoising Autoencoder with Multi-branched Encoders
Cheng Yu
Ryandhimas E. Zezario
Syu-Siang Wang
Jonathan Sherman
Yi-Yen Hsieh
Xugang Lu
Hsin-Min Wang
Yu Tsao
19
38
0
06 Jan 2020
A Unified Framework for Speech Separation
A Unified Framework for Speech Separation
F. Bahmaninezhad
Shi-Xiong Zhang
Yong-mei Xu
Meng Yu
John H. L. Hansen
Dong Yu
9
4
0
17 Dec 2019
A Supervised Speech enhancement Approach with Residual Noise Control for
  Voice Communication
A Supervised Speech enhancement Approach with Residual Noise Control for Voice Communication
Andong Li
C. Zheng
Xiaodong Li
25
8
0
08 Dec 2019
Time-Domain Multi-modal Bone/air Conducted Speech Enhancement
Time-Domain Multi-modal Bone/air Conducted Speech Enhancement
Cheng Yu
Kuo-Hsuan Hung
Syu-Siang Wang
Szu-Wei Fu
Yu Tsao
J. Hung
21
33
0
22 Nov 2019
Dual-path RNN: efficient long sequence modeling for time-domain
  single-channel speech separation
Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation
Yi Luo
Zhuo Chen
Takuya Yoshioka
AI4TS
26
753
0
14 Oct 2019
A Study of Joint Effect on Denoising Techniques and Visual Cues to
  Improve Speech Intelligibility in Cochlear Implant Simulation
A Study of Joint Effect on Denoising Techniques and Visual Cues to Improve Speech Intelligibility in Cochlear Implant Simulation
Rung-Yu Tseng
Tao-Wei Wang
Szu-Wei Fu
Chia-Ying Lee
Yu Tsao
16
0
0
26 Sep 2019
Improving the Intelligibility of Electric and Acoustic Stimulation
  Speech Using Fully Convolutional Networks Based Speech Enhancement
Improving the Intelligibility of Electric and Acoustic Stimulation Speech Using Fully Convolutional Networks Based Speech Enhancement
N. Wang
H. Wang
Tao-Wei Wang
Szu-Wei Fu
Xugan Lu
Yu Tsao
Hsin-Min Wang
11
1
0
26 Sep 2019
Multichannel Speech Enhancement by Raw Waveform-mapping using Fully
  Convolutional Networks
Multichannel Speech Enhancement by Raw Waveform-mapping using Fully Convolutional Networks
Changle Liu
Sze-Wei Fu
You-Jin Li
Jen-Wei Huang
Hsin-Min Wang
Yu Tsao
22
50
0
26 Sep 2019
On Loss Functions for Supervised Monaural Time-Domain Speech Enhancement
On Loss Functions for Supervised Monaural Time-Domain Speech Enhancement
Morten Kolbæk
Zheng-Hua Tan
S. H. Jensen
Jesper Jensen
AAML
63
125
0
03 Sep 2019
Coarse-to-fine Optimization for Speech Enhancement
Coarse-to-fine Optimization for Speech Enhancement
Jian Yao
Ahmad Al-Dahle
11
25
0
21 Aug 2019
A Dual-Staged Context Aggregation Method Towards Efficient End-To-End
  Speech Enhancement
A Dual-Staged Context Aggregation Method Towards Efficient End-To-End Speech Enhancement
Kai Zhen
Mi Suk Lee
Minje Kim
26
3
0
18 Aug 2019
Components Loss for Neural Networks in Mask-Based Speech Enhancement
Components Loss for Neural Networks in Mask-Based Speech Enhancement
Ziyi Xu
Samy Elshamy
Ziyue Zhao
Tim Fingscheidt
17
20
0
14 Aug 2019
Dilated FCN: Listening Longer to Hear Better
Dilated FCN: Listening Longer to Hear Better
Shuyu Gong
Zhewei Wang
Tao Sun
Yuanhang Zhang
Charles D. Smith
Li Xu
Jundong Liu
14
14
0
27 Jul 2019
Increasing Compactness Of Deep Learning Based Speech Enhancement Models
  With Parameter Pruning And Quantization Techniques
Increasing Compactness Of Deep Learning Based Speech Enhancement Models With Parameter Pruning And Quantization Techniques
Jyun-Yi Wu
Cheng Yu
Szu-Wei Fu
Chih-Ting Liu
Shao-Yi Chien
Yu Tsao
9
23
0
31 May 2019
A Perceptual Weighting Filter Loss for DNN Training in Speech
  Enhancement
A Perceptual Weighting Filter Loss for DNN Training in Speech Enhancement
Ziyue Zhao
Samy Elshamy
Tim Fingscheidt
11
15
0
23 May 2019
MetricGAN: Generative Adversarial Networks based Black-box Metric Scores
  Optimization for Speech Enhancement
MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement
Szu-Wei Fu
Chien-Feng Liao
Yu Tsao
Shou-De Lin
19
323
0
13 May 2019
Learning with Learned Loss Function: Speech Enhancement with Quality-Net
  to Improve Perceptual Evaluation of Speech Quality
Learning with Learned Loss Function: Speech Enhancement with Quality-Net to Improve Perceptual Evaluation of Speech Quality
Szu-Wei Fu
Chien-Feng Liao
Yu Tsao
8
69
0
06 May 2019
Deep Learning for Audio Signal Processing
Deep Learning for Audio Signal Processing
Hendrik Purwins
Bo-wen Li
Tuomas Virtanen
Jan Schlüter
Shuo-yiin Chang
Tara N. Sainath
VLM
24
586
0
30 Apr 2019
Improving Deep Speech Denoising by Noisy2Noisy Signal Mapping
Improving Deep Speech Denoising by Noisy2Noisy Signal Mapping
N. Alamdari
A. Azarang
N. Kehtarnavaz
22
42
0
26 Apr 2019
Boundary-Preserved Deep Denoising of the Stochastic Resonance Enhanced
  Multiphoton Images
Boundary-Preserved Deep Denoising of the Stochastic Resonance Enhanced Multiphoton Images
Sheng-Yong Niu
Lun-Zhang Guo
Yue Li
Tzung-Dau Wang
Yu Tsao
Tzu-Ming Liu
17
2
0
12 Apr 2019
Data-driven design of perfect reconstruction filterbank for DNN-based
  sound source enhancement
Data-driven design of perfect reconstruction filterbank for DNN-based sound source enhancement
Daiki Takeuchi
Kohei Yatabe
Yuma Koizumi
Yasuhiro Oikawa
N. Harada
13
13
0
21 Mar 2019
End-to-End Model for Speech Enhancement by Consistent Spectrogram
  Masking
End-to-End Model for Speech Enhancement by Consistent Spectrogram Masking
Xingjian Du
Mengyao Zhu
Xuan Shi
Xinpeng Zhang
Wen Zhang
Jingdong Chen
16
6
0
02 Jan 2019
Acoustics-guided evaluation (AGE): a new measure for estimating
  performance of speech enhancement algorithms for robust ASR
Acoustics-guided evaluation (AGE): a new measure for estimating performance of speech enhancement algorithms for robust ASR
Li Chai
Jun Du
Chin-Hui Lee
16
7
0
28 Nov 2018
End-to-end Networks for Supervised Single-channel Speech Separation
End-to-end Networks for Supervised Single-channel Speech Separation
Shrikant Venkataramani
Paris Smaragdis
33
10
0
05 Oct 2018
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for
  Speech Separation
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation
Yi Luo
N. Mesgarani
42
1,748
0
20 Sep 2018
Single-Microphone Speech Enhancement and Separation Using Deep Learning
Single-Microphone Speech Enhancement and Separation Using Deep Learning
Morten Kolbaek
12
7
0
31 Aug 2018
A study on speech enhancement using exponent-only floating point
  quantized neural network (EOFP-QNN)
A study on speech enhancement using exponent-only floating point quantized neural network (EOFP-QNN)
Y. Hsu
Yu-Chen Lin
Szu-Wei Fu
Yu Tsao
Tei-Wei Kuo
MQ
22
15
0
17 Aug 2018
Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model
  based on BLSTM
Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model based on BLSTM
Szu-Wei Fu
Yu Tsao
Hsin-Te Hwang
H. Wang
12
162
0
16 Aug 2018
Convolutional Neural Networks to Enhance Coded Speech
Convolutional Neural Networks to Enhance Coded Speech
Ziyue Zhao
Huijun Liu
Tim Fingscheidt
29
64
0
25 Jun 2018
On the Relationship Between Short-Time Objective Intelligibility and
  Short-Time Spectral-Amplitude Mean-Square Error for Speech Enhancement
On the Relationship Between Short-Time Objective Intelligibility and Short-Time Spectral-Amplitude Mean-Square Error for Speech Enhancement
Morten Kolbæk
Zheng-Hua Tan
Jesper Jensen
12
16
0
21 Jun 2018
Monaural source enhancement maximizing source-to-distortion ratio via
  automatic differentiation
Monaural source enhancement maximizing source-to-distortion ratio via automatic differentiation
Hiroaki Nakajima
Yu Takahashi
Kazunobu Kondo
Yuji Hisaminato
6
5
0
15 Jun 2018
Performance Based Cost Functions for End-to-End Speech Separation
Performance Based Cost Functions for End-to-End Speech Separation
Shrikant Venkataramani
Ryley Higa
Paris Smaragdis
6
21
0
01 Jun 2018
End-to-End Speech Separation with Unfolded Iterative Phase
  Reconstruction
End-to-End Speech Separation with Unfolded Iterative Phase Reconstruction
Zhong-Qiu Wang
Jonathan Le Roux
DeLiang Wang
J. Hershey
96
123
0
26 Apr 2018
Raw Multi-Channel Audio Source Separation using Multi-Resolution
  Convolutional Auto-Encoders
Raw Multi-Channel Audio Source Separation using Multi-Resolution Convolutional Auto-Encoders
Emad M. Grais
D. Ward
Mark D. Plumbley
9
40
0
02 Mar 2018
Multi-Resolution Fully Convolutional Neural Networks for Monaural Audio
  Source Separation
Multi-Resolution Fully Convolutional Neural Networks for Monaural Audio Source Separation
Emad M. Grais
H. Wierstorf
D. Ward
Mark D. Plumbley
29
21
0
28 Oct 2017
Audio-Visual Speech Enhancement Using Multimodal Deep Convolutional
  Neural Networks
Audio-Visual Speech Enhancement Using Multimodal Deep Convolutional Neural Networks
Jen-Cheng Hou
Syu-Siang Wang
Ying-Hui Lai
Yu Tsao
Hsiu-Wen Chang
H. Wang
28
196
0
01 Sep 2017
Previous
12