Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2111.02363
Cited By
v1
v2
v3
v4 (latest)
Deep Learning-based Non-Intrusive Multi-Objective Speech Assessment Model with Cross-Domain Features
3 November 2021
Ryandhimas E. Zezario
Szu-Wei Fu
Fei Chen
C. Fuh
Hsin-Min Wang
Yu Tsao
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Learning-based Non-Intrusive Multi-Objective Speech Assessment Model with Cross-Domain Features"
27 / 27 papers shown
Title
HAAQI-Net: A Non-intrusive Neural Music Audio Quality Assessment Model for Hearing Aids
Dyah A. M. G. Wisnu
Stefano Rini
Ryandhimas E. Zezario
Hsin-Min Wang
Yu Tsao
145
0
0
10 Jan 2025
A Study on Zero-shot Non-intrusive Speech Assessment using Large Language Models
Ryandhimas E. Zezario
Sabato Marco Siniscalchi
Hsin-Min Wang
Yu Tsao
82
3
0
16 Sep 2024
MTI-Net: A Multi-Target Speech Intelligibility Prediction Model
Ryandhimas E. Zezario
Szu-Wei Fu
Fei Chen
C. Fuh
Hsin-Min Wang
Yu Tsao
58
15
0
07 Apr 2022
MBI-Net: A Non-Intrusive Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids
Ryandhimas E. Zezario
Fei Chen
C. Fuh
Hsin-Min Wang
Yu Tsao
73
17
0
07 Apr 2022
Deep Noise Suppression Maximizing Non-Differentiable PESQ Mediated by a Non-Intrusive PESQNet
Ziyi Xu
Maximilian Strake
Tim Fingscheidt
50
15
0
06 Nov 2021
MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only on noisy/ reverberated speech
Szu-Wei Fu
Cheng Yu
Kuo-Hsuan Hung
Mirco Ravanelli
Yu Tsao
80
46
0
12 Oct 2021
SVSNet: An End-to-end Speaker Voice Similarity Assessment Model
Cheng-Hung Hu
Yu-Huai Peng
Junichi Yamagishi
Yu Tsao
Hsin-Min Wang
39
5
0
20 Jul 2021
HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units
Wei-Ning Hsu
Benjamin Bolte
Yao-Hung Hubert Tsai
Kushal Lakhotia
Ruslan Salakhutdinov
Abdel-rahman Mohamed
SSL
184
3,004
0
14 Jun 2021
MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement
Szu-Wei Fu
Cheng Yu
Tsun-An Hsieh
Peter William VanHarn Plantinga
Mirco Ravanelli
Xugang Lu
Yu Tsao
69
216
0
08 Apr 2021
Utilizing Self-supervised Representations for MOS Prediction
Wei-Cheng Tseng
Chien-yu Huang
Wei-Tsung Kao
Yist Y. Lin
Hung-yi Lee
SSL
103
65
0
07 Apr 2021
MBNet: MOS Prediction for Synthesized Speech with Mean-Bias Network
Yichong Leng
Xu Tan
Sheng Zhao
Frank Soong
Xiang-Yang Li
Tao Qin
84
96
0
27 Feb 2021
Speech Enhancement with Zero-Shot Model Selection
Ryandhimas E. Zezario
C. Fuh
Hsin-Min Wang
Yu Tsao
54
5
0
17 Dec 2020
STOI-Net: A Deep Learning based Non-Intrusive Speech Intelligibility Assessment Model
Ryandhimas E. Zezario
Szu-Wei Fu
C. Fuh
Yu Tsao
Hsin-Min Wang
39
42
0
09 Nov 2020
DNSMOS: A Non-Intrusive Perceptual Objective Speech Quality metric to evaluate Noise Suppressors
Chandan K. A. Reddy
Vishak Gopal
Ross Cutler
90
315
0
28 Oct 2020
A Pyramid Recurrent Network for Predicting Crowdsourced Speech-Quality Ratings of Real-World Signals
Xuan Dong
Donald Williamson
46
19
0
31 Jul 2020
Neural MOS Prediction for Synthesized Speech Using Multi-Task Learning With Spoofing Detection and Spoofing Type Classification
Yeunju Choi
Youngmoon Jung
Hoirin Kim
99
27
0
16 Jul 2020
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Alexei Baevski
Henry Zhou
Abdel-rahman Mohamed
Michael Auli
SSL
301
5,849
0
20 Jun 2020
Speech Enhancement using Self-Adaptation and Multi-Head Self-Attention
Yuma Koizumi
Kohei Yatabe
Marc Delcroix
Yoshiki Masuyama
Daiki Takeuchi
51
125
0
14 Feb 2020
T-GSA: Transformer with Gaussian-weighted self-attention for speech enhancement
Jaeyoung Kim
Mostafa El-Khamy
Jungwon Lee
77
189
0
13 Oct 2019
MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement
Szu-Wei Fu
Chien-Feng Liao
Yu Tsao
Shou-De Lin
67
331
0
13 May 2019
MOSNet: Deep Learning based Objective Assessment for Voice Conversion
Chen-Chou Lo
Szu-Wei Fu
Wen-Chin Huang
Xin Wang
Junichi Yamagishi
Yu Tsao
H. Wang
61
275
0
17 Apr 2019
Non-intrusive speech quality assessment using neural networks
Anderson R. Avila
H. Gamper
Chandan K. A. Reddy
Ross Cutler
I. Tashev
J. Gehrke
60
110
0
16 Mar 2019
SDR - half-baked or well done?
F. Sánchez-Martínez
M. Esplà-Gomis
Hakan Erdogan
J. Hershey
165
1,205
0
06 Nov 2018
Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model based on BLSTM
Szu-Wei Fu
Yu Tsao
Hsin-Te Hwang
H. Wang
77
165
0
16 Aug 2018
Speaker Recognition from Raw Waveform with SincNet
Mirco Ravanelli
Yoshua Bengio
193
718
0
29 Jul 2018
Raw Waveform-based Speech Enhancement by Fully Convolutional Networks
Szu-Wei Fu
Yu Tsao
Xugang Lu
Hisashi Kawai
64
197
0
07 Mar 2017
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
2.1K
150,433
0
22 Dec 2014
1