ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2506.02082
  4. Cited By
SALF-MOS: Speaker Agnostic Latent Features Downsampled for MOS Prediction

SALF-MOS: Speaker Agnostic Latent Features Downsampled for MOS Prediction

2 June 2025
Saurabh Agrawal
Raj Gohil
Gopal Kumar Agrawal
Vikram C M
Kushal Verma
ArXiv (abs)PDFHTML

Papers citing "SALF-MOS: Speaker Agnostic Latent Features Downsampled for MOS Prediction"

22 / 22 papers shown
Title
MOSPC: MOS Prediction Based on Pairwise Comparison
MOSPC: MOS Prediction Based on Pairwise Comparison
Kexin Wang
Yunlong Zhao
Qianqian Dong
Tom Ko
Mingxuan Wang
49
6
0
18 Jun 2023
Evaluation of Speech Representations for MOS prediction
Evaluation of Speech Representations for MOS prediction
F. S. Oliveira
Edresson Casanova
Arnaldo Cândido Júnior
L. Gris
A. S. Soares
A. R. G. Filho
49
4
0
16 Jun 2023
Speech Quality Assessment through MOS using Non-Matching References
Speech Quality Assessment through MOS using Non-Matching References
Pranay Manocha
Anurag Kumar
125
28
0
24 Jun 2022
Fusion of Self-supervised Learned Models for MOS Prediction
Fusion of Self-supervised Learned Models for MOS Prediction
Zhengdong Yang
Wangjin Zhou
Chenhui Chu
Sheng Li
Raj Dabre
Raphaël Rubino
Yi Zhao
58
29
0
11 Apr 2022
DDOS: A MOS Prediction Framework utilizing Domain Adaptive Pre-training
  and Distribution of Opinion Scores
DDOS: A MOS Prediction Framework utilizing Domain Adaptive Pre-training and Distribution of Opinion Scores
Wei-Cheng Tseng
Wei-Tsung Kao
Hung-yi Lee
62
21
0
07 Apr 2022
SOMOS: The Samsung Open MOS Dataset for the Evaluation of Neural
  Text-to-Speech Synthesis
SOMOS: The Samsung Open MOS Dataset for the Evaluation of Neural Text-to-Speech Synthesis
Georgia Maniati
Alexandra Vioni
Nikolaos Ellinas
Karolos Nikitaras
Konstantinos Klapsas
June Sig Sung
Gunu Jho
Aimilios Chalamandaris
Pirros Tsiakoulis
40
28
0
06 Apr 2022
UTMOS: UTokyo-SaruLab System for VoiceMOS Challenge 2022
UTMOS: UTokyo-SaruLab System for VoiceMOS Challenge 2022
Takaaki Saeki
Detai Xin
Wataru Nakata
Tomoki Koriyama
Shinnosuke Takamichi
Hiroshi Saruwatari
109
217
0
05 Apr 2022
The VoiceMOS Challenge 2022
The VoiceMOS Challenge 2022
Wen-Chin Huang
Erica Cooper
Yu Tsao
Hsin-Min Wang
Tomoki Toda
Junichi Yamagishi
63
108
0
21 Mar 2022
InQSS: a speech intelligibility and quality assessment model using a
  multi-task learning network
InQSS: a speech intelligibility and quality assessment model using a multi-task learning network
Yu-Wen Chen
Yu Tsao
43
13
0
04 Nov 2021
Deep Learning-based Non-Intrusive Multi-Objective Speech Assessment
  Model with Cross-Domain Features
Deep Learning-based Non-Intrusive Multi-Objective Speech Assessment Model with Cross-Domain Features
Ryandhimas E. Zezario
Szu-Wei Fu
Fei Chen
C. Fuh
Hsin-Min Wang
Yu Tsao
DiffM
62
81
0
03 Nov 2021
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech
  Processing
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing
Sanyuan Chen
Chengyi Wang
Zhengyang Chen
Yu-Huan Wu
Shujie Liu
...
Yao Qian
Jian Wu
Micheal Zeng
Xiangzhan Yu
Furu Wei
SSL
259
1,898
0
26 Oct 2021
LDNet: Unified Listener Dependent Modeling in MOS Prediction for
  Synthetic Speech
LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech
Wen-Chin Huang
Erica Cooper
Junichi Yamagishi
Tomoki Toda
52
77
0
18 Oct 2021
HuBERT: Self-Supervised Speech Representation Learning by Masked
  Prediction of Hidden Units
HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units
Wei-Ning Hsu
Benjamin Bolte
Yao-Hung Hubert Tsai
Kushal Lakhotia
Ruslan Salakhutdinov
Abdel-rahman Mohamed
SSL
182
2,993
0
14 Jun 2021
MBNet: MOS Prediction for Synthesized Speech with Mean-Bias Network
MBNet: MOS Prediction for Synthesized Speech with Mean-Bias Network
Yichong Leng
Xu Tan
Sheng Zhao
Frank Soong
Xiang-Yang Li
Tao Qin
76
96
0
27 Feb 2021
TERA: Self-Supervised Learning of Transformer Encoder Representation for
  Speech
TERA: Self-Supervised Learning of Transformer Encoder Representation for Speech
Andy T. Liu
Shang-Wen Li
Hung-yi Lee
SSL
132
358
0
12 Jul 2020
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech
  Representations
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Alexei Baevski
Henry Zhou
Abdel-rahman Mohamed
Michael Auli
SSL
295
5,837
0
20 Jun 2020
MOSNet: Deep Learning based Objective Assessment for Voice Conversion
MOSNet: Deep Learning based Objective Assessment for Voice Conversion
Chen-Chou Lo
Szu-Wei Fu
Wen-Chin Huang
Xin Wang
Junichi Yamagishi
Yu Tsao
H. Wang
52
274
0
17 Apr 2019
wav2vec: Unsupervised Pre-training for Speech Recognition
wav2vec: Unsupervised Pre-training for Speech Recognition
Steffen Schneider
Alexei Baevski
R. Collobert
Michael Auli
SSL
76
418
0
11 Apr 2019
Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model
  based on BLSTM
Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model based on BLSTM
Szu-Wei Fu
Yu Tsao
Hsin-Te Hwang
H. Wang
62
165
0
16 Aug 2018
The Voice Conversion Challenge 2018: Promoting Development of Parallel
  and Nonparallel Methods
The Voice Conversion Challenge 2018: Promoting Development of Parallel and Nonparallel Methods
Jaime Lorenzo-Trueba
Junichi Yamagishi
Tomoki Toda
Daisuke Saito
F. Villavicencio
Tomi Kinnunen
Zhenhua Ling
59
321
0
12 Apr 2018
AutoMOS: Learning a non-intrusive assessor of naturalness-of-speech
AutoMOS: Learning a non-intrusive assessor of naturalness-of-speech
Brian Patton
Yannis Agiomyrgiannakis
Michael Terry
K. Wilson
Rif A. Saurous
D. Sculley
63
84
0
28 Nov 2016
U-Net: Convolutional Networks for Biomedical Image Segmentation
U-Net: Convolutional Networks for Biomedical Image Segmentation
Olaf Ronneberger
Philipp Fischer
Thomas Brox
SSeg3DV
1.9K
77,378
0
18 May 2015
1