ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.02363
  4. Cited By
Deep Learning-based Non-Intrusive Multi-Objective Speech Assessment
  Model with Cross-Domain Features
v1v2v3v4 (latest)

Deep Learning-based Non-Intrusive Multi-Objective Speech Assessment Model with Cross-Domain Features

3 November 2021
Ryandhimas E. Zezario
Szu-Wei Fu
Fei Chen
C. Fuh
Hsin-Min Wang
Yu Tsao
    DiffM
ArXiv (abs)PDFHTML

Papers citing "Deep Learning-based Non-Intrusive Multi-Objective Speech Assessment Model with Cross-Domain Features"

27 / 27 papers shown
Title
HAAQI-Net: A Non-intrusive Neural Music Audio Quality Assessment Model for Hearing Aids
HAAQI-Net: A Non-intrusive Neural Music Audio Quality Assessment Model for Hearing Aids
Dyah A. M. G. Wisnu
Stefano Rini
Ryandhimas E. Zezario
Hsin-Min Wang
Yu Tsao
145
0
0
10 Jan 2025
A Study on Zero-shot Non-intrusive Speech Assessment using Large Language Models
A Study on Zero-shot Non-intrusive Speech Assessment using Large Language Models
Ryandhimas E. Zezario
Sabato Marco Siniscalchi
Hsin-Min Wang
Yu Tsao
82
3
0
16 Sep 2024
MTI-Net: A Multi-Target Speech Intelligibility Prediction Model
MTI-Net: A Multi-Target Speech Intelligibility Prediction Model
Ryandhimas E. Zezario
Szu-Wei Fu
Fei Chen
C. Fuh
Hsin-Min Wang
Yu Tsao
58
15
0
07 Apr 2022
MBI-Net: A Non-Intrusive Multi-Branched Speech Intelligibility
  Prediction Model for Hearing Aids
MBI-Net: A Non-Intrusive Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids
Ryandhimas E. Zezario
Fei Chen
C. Fuh
Hsin-Min Wang
Yu Tsao
73
17
0
07 Apr 2022
Deep Noise Suppression Maximizing Non-Differentiable PESQ Mediated by a
  Non-Intrusive PESQNet
Deep Noise Suppression Maximizing Non-Differentiable PESQ Mediated by a Non-Intrusive PESQNet
Ziyi Xu
Maximilian Strake
Tim Fingscheidt
50
15
0
06 Nov 2021
MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only
  on noisy/ reverberated speech
MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only on noisy/ reverberated speech
Szu-Wei Fu
Cheng Yu
Kuo-Hsuan Hung
Mirco Ravanelli
Yu Tsao
80
46
0
12 Oct 2021
SVSNet: An End-to-end Speaker Voice Similarity Assessment Model
SVSNet: An End-to-end Speaker Voice Similarity Assessment Model
Cheng-Hung Hu
Yu-Huai Peng
Junichi Yamagishi
Yu Tsao
Hsin-Min Wang
39
5
0
20 Jul 2021
HuBERT: Self-Supervised Speech Representation Learning by Masked
  Prediction of Hidden Units
HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units
Wei-Ning Hsu
Benjamin Bolte
Yao-Hung Hubert Tsai
Kushal Lakhotia
Ruslan Salakhutdinov
Abdel-rahman Mohamed
SSL
184
3,004
0
14 Jun 2021
MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement
MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement
Szu-Wei Fu
Cheng Yu
Tsun-An Hsieh
Peter William VanHarn Plantinga
Mirco Ravanelli
Xugang Lu
Yu Tsao
69
216
0
08 Apr 2021
Utilizing Self-supervised Representations for MOS Prediction
Utilizing Self-supervised Representations for MOS Prediction
Wei-Cheng Tseng
Chien-yu Huang
Wei-Tsung Kao
Yist Y. Lin
Hung-yi Lee
SSL
103
65
0
07 Apr 2021
MBNet: MOS Prediction for Synthesized Speech with Mean-Bias Network
MBNet: MOS Prediction for Synthesized Speech with Mean-Bias Network
Yichong Leng
Xu Tan
Sheng Zhao
Frank Soong
Xiang-Yang Li
Tao Qin
84
96
0
27 Feb 2021
Speech Enhancement with Zero-Shot Model Selection
Speech Enhancement with Zero-Shot Model Selection
Ryandhimas E. Zezario
C. Fuh
Hsin-Min Wang
Yu Tsao
54
5
0
17 Dec 2020
STOI-Net: A Deep Learning based Non-Intrusive Speech Intelligibility
  Assessment Model
STOI-Net: A Deep Learning based Non-Intrusive Speech Intelligibility Assessment Model
Ryandhimas E. Zezario
Szu-Wei Fu
C. Fuh
Yu Tsao
Hsin-Min Wang
39
42
0
09 Nov 2020
DNSMOS: A Non-Intrusive Perceptual Objective Speech Quality metric to
  evaluate Noise Suppressors
DNSMOS: A Non-Intrusive Perceptual Objective Speech Quality metric to evaluate Noise Suppressors
Chandan K. A. Reddy
Vishak Gopal
Ross Cutler
90
315
0
28 Oct 2020
A Pyramid Recurrent Network for Predicting Crowdsourced Speech-Quality
  Ratings of Real-World Signals
A Pyramid Recurrent Network for Predicting Crowdsourced Speech-Quality Ratings of Real-World Signals
Xuan Dong
Donald Williamson
46
19
0
31 Jul 2020
Neural MOS Prediction for Synthesized Speech Using Multi-Task Learning
  With Spoofing Detection and Spoofing Type Classification
Neural MOS Prediction for Synthesized Speech Using Multi-Task Learning With Spoofing Detection and Spoofing Type Classification
Yeunju Choi
Youngmoon Jung
Hoirin Kim
99
27
0
16 Jul 2020
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech
  Representations
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Alexei Baevski
Henry Zhou
Abdel-rahman Mohamed
Michael Auli
SSL
301
5,849
0
20 Jun 2020
Speech Enhancement using Self-Adaptation and Multi-Head Self-Attention
Speech Enhancement using Self-Adaptation and Multi-Head Self-Attention
Yuma Koizumi
Kohei Yatabe
Marc Delcroix
Yoshiki Masuyama
Daiki Takeuchi
51
125
0
14 Feb 2020
T-GSA: Transformer with Gaussian-weighted self-attention for speech
  enhancement
T-GSA: Transformer with Gaussian-weighted self-attention for speech enhancement
Jaeyoung Kim
Mostafa El-Khamy
Jungwon Lee
77
189
0
13 Oct 2019
MetricGAN: Generative Adversarial Networks based Black-box Metric Scores
  Optimization for Speech Enhancement
MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement
Szu-Wei Fu
Chien-Feng Liao
Yu Tsao
Shou-De Lin
67
331
0
13 May 2019
MOSNet: Deep Learning based Objective Assessment for Voice Conversion
MOSNet: Deep Learning based Objective Assessment for Voice Conversion
Chen-Chou Lo
Szu-Wei Fu
Wen-Chin Huang
Xin Wang
Junichi Yamagishi
Yu Tsao
H. Wang
61
275
0
17 Apr 2019
Non-intrusive speech quality assessment using neural networks
Non-intrusive speech quality assessment using neural networks
Anderson R. Avila
H. Gamper
Chandan K. A. Reddy
Ross Cutler
I. Tashev
J. Gehrke
60
110
0
16 Mar 2019
SDR - half-baked or well done?
SDR - half-baked or well done?
F. Sánchez-Martínez
M. Esplà-Gomis
Hakan Erdogan
J. Hershey
165
1,205
0
06 Nov 2018
Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model
  based on BLSTM
Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model based on BLSTM
Szu-Wei Fu
Yu Tsao
Hsin-Te Hwang
H. Wang
77
165
0
16 Aug 2018
Speaker Recognition from Raw Waveform with SincNet
Speaker Recognition from Raw Waveform with SincNet
Mirco Ravanelli
Yoshua Bengio
193
718
0
29 Jul 2018
Raw Waveform-based Speech Enhancement by Fully Convolutional Networks
Raw Waveform-based Speech Enhancement by Fully Convolutional Networks
Szu-Wei Fu
Yu Tsao
Xugang Lu
Hisashi Kawai
66
197
0
07 Mar 2017
Adam: A Method for Stochastic Optimization
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
2.1K
150,433
0
22 Dec 2014
1