ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.12645
  4. Cited By
Comparison of Speech Representations for Automatic Quality Estimation in
  Multi-Speaker Text-to-Speech Synthesis

Comparison of Speech Representations for Automatic Quality Estimation in Multi-Speaker Text-to-Speech Synthesis

28 February 2020
Jennifer Williams
Joanna Rownicka
P. Oplustil
Simon King
ArXivPDFHTML

Papers citing "Comparison of Speech Representations for Automatic Quality Estimation in Multi-Speaker Text-to-Speech Synthesis"

12 / 12 papers shown
Title
Partial Rank Similarity Minimization Method for Quality MOS Prediction
  of Unseen Speech Synthesis Systems in Zero-Shot and Semi-supervised setting
Partial Rank Similarity Minimization Method for Quality MOS Prediction of Unseen Speech Synthesis Systems in Zero-Shot and Semi-supervised setting
Hemant Yadav
Erica Cooper
Junichi Yamagishi
Sunayana Sitaram
R. Shah
11
0
0
08 Oct 2023
Resource-Efficient Fine-Tuning Strategies for Automatic MOS Prediction
  in Text-to-Speech for Low-Resource Languages
Resource-Efficient Fine-Tuning Strategies for Automatic MOS Prediction in Text-to-Speech for Low-Resource Languages
P. Do
Matt Coler
J. Dijkstra
E. Klabbers
32
3
0
30 May 2023
Automatic Evaluation of Turn-taking Cues in Conversational Speech
  Synthesis
Automatic Evaluation of Turn-taking Cues in Conversational Speech Synthesis
Erik Ekstedt
Siyang Wang
Éva Székely
Joakim Gustafson
Gabriel Skantze
28
6
0
29 May 2023
SQuId: Measuring Speech Naturalness in Many Languages
SQuId: Measuring Speech Naturalness in Many Languages
Thibault Sellam
Ankur Bapna
Joshua Camp
Diana Mackinnon
Ankur P. Parikh
Jason Riesa
35
17
0
12 Oct 2022
Predicting pairwise preferences between TTS audio stimuli using parallel
  ratings data and anti-symmetric twin neural networks
Predicting pairwise preferences between TTS audio stimuli using parallel ratings data and anti-symmetric twin neural networks
Cassia Valentini-Botinhao
M. Ribeiro
O. Watts
Korin Richmond
G. Henter
16
1
0
22 Sep 2022
SOMOS: The Samsung Open MOS Dataset for the Evaluation of Neural
  Text-to-Speech Synthesis
SOMOS: The Samsung Open MOS Dataset for the Evaluation of Neural Text-to-Speech Synthesis
Georgia Maniati
Alexandra Vioni
Nikolaos Ellinas
Karolos Nikitaras
Konstantinos Klapsas
June Sig Sung
Gunu Jho
Aimilios Chalamandaris
Pirros Tsiakoulis
19
26
0
06 Apr 2022
The VoiceMOS Challenge 2022
The VoiceMOS Challenge 2022
Wen-Chin Huang
Erica Cooper
Yu Tsao
Hsin-Min Wang
T. Toda
Junichi Yamagishi
11
102
0
21 Mar 2022
Human Perception of Audio Deepfakes
Human Perception of Audio Deepfakes
Nicolas M. Muller
Karla Markert
Konstantin Böttinger
22
49
0
20 Jul 2021
MBNet: MOS Prediction for Synthesized Speech with Mean-Bias Network
MBNet: MOS Prediction for Synthesized Speech with Mean-Bias Network
Yichong Leng
Xu Tan
Sheng Zhao
Frank Soong
Xiang-Yang Li
Tao Qin
24
96
0
27 Feb 2021
Learning Disentangled Phone and Speaker Representations in a
  Semi-Supervised VQ-VAE Paradigm
Learning Disentangled Phone and Speaker Representations in a Semi-Supervised VQ-VAE Paradigm
Jennifer Williams
Yi Zhao
Erica Cooper
Junichi Yamagishi
SSL
25
23
0
21 Oct 2020
Predictions of Subjective Ratings and Spoofing Assessments of Voice
  Conversion Challenge 2020 Submissions
Predictions of Subjective Ratings and Spoofing Assessments of Voice Conversion Challenge 2020 Submissions
Rohan Kumar Das
Tomi Kinnunen
Wen-Chin Huang
Zhenhua Ling
Junichi Yamagishi
Yi Zhao
Xiaohai Tian
T. Toda
31
52
0
08 Sep 2020
An Overview of Voice Conversion and its Challenges: From Statistical
  Modeling to Deep Learning
An Overview of Voice Conversion and its Challenges: From Statistical Modeling to Deep Learning
Berrak Sisman
Junichi Yamagishi
Simon King
Haizhou Li
BDL
41
318
0
09 Aug 2020
1