Comparison of Speech Representations for Automatic Quality Estimation in
Multi-Speaker Text-to-Speech Synthesis

Comparison of Speech Representations for Automatic Quality Estimation in Multi-Speaker Text-to-Speech Synthesis

28 February 2020

Jennifer Williams

Joanna Rownicka

Papers citing "Comparison of Speech Representations for Automatic Quality Estimation in Multi-Speaker Text-to-Speech Synthesis"

12 / 12 papers shown

Title
Partial Rank Similarity Minimization Method for Quality MOS Prediction of Unseen Speech Synthesis Systems in Zero-Shot and Semi-supervised setting Hemant Yadav Erica Cooper Junichi Yamagishi Sunayana Sitaram R. Shah 11 0 0 08 Oct 2023
Resource-Efficient Fine-Tuning Strategies for Automatic MOS Prediction in Text-to-Speech for Low-Resource Languages P. Do Matt Coler J. Dijkstra E. Klabbers 32 3 0 30 May 2023
Automatic Evaluation of Turn-taking Cues in Conversational Speech Synthesis Erik Ekstedt Siyang Wang Éva Székely Joakim Gustafson Gabriel Skantze 28 6 0 29 May 2023
SQuId: Measuring Speech Naturalness in Many Languages Thibault Sellam Ankur Bapna Joshua Camp Diana Mackinnon Ankur P. Parikh Jason Riesa 35 17 0 12 Oct 2022
Predicting pairwise preferences between TTS audio stimuli using parallel ratings data and anti-symmetric twin neural networks Cassia Valentini-Botinhao M. Ribeiro O. Watts Korin Richmond G. Henter 16 1 0 22 Sep 2022
SOMOS: The Samsung Open MOS Dataset for the Evaluation of Neural Text-to-Speech Synthesis Georgia Maniati Alexandra Vioni Nikolaos Ellinas Karolos Nikitaras Konstantinos Klapsas June Sig Sung Gunu Jho Aimilios Chalamandaris Pirros Tsiakoulis 19 26 0 06 Apr 2022
The VoiceMOS Challenge 2022 Wen-Chin Huang Erica Cooper Yu Tsao Hsin-Min Wang T. Toda Junichi Yamagishi 11 102 0 21 Mar 2022
Human Perception of Audio Deepfakes Nicolas M. Muller Karla Markert Konstantin Böttinger 22 49 0 20 Jul 2021
MBNet: MOS Prediction for Synthesized Speech with Mean-Bias Network Yichong Leng Xu Tan Sheng Zhao Frank Soong Xiang-Yang Li Tao Qin 24 96 0 27 Feb 2021
Learning Disentangled Phone and Speaker Representations in a Semi-Supervised VQ-VAE Paradigm Jennifer Williams Yi Zhao Erica Cooper Junichi Yamagishi SSL 25 23 0 21 Oct 2020
Predictions of Subjective Ratings and Spoofing Assessments of Voice Conversion Challenge 2020 Submissions Rohan Kumar Das Tomi Kinnunen Wen-Chin Huang Zhenhua Ling Junichi Yamagishi Yi Zhao Xiaohai Tian T. Toda 31 52 0 08 Sep 2020
An Overview of Voice Conversion and its Challenges: From Statistical Modeling to Deep Learning Berrak Sisman Junichi Yamagishi Simon King Haizhou Li BDL 41 318 0 09 Aug 2020