Metis: A Foundation Speech Generation Model with Masked Generative Pre-training

5 February 2025

Papers citing "Metis: A Foundation Speech Generation Model with Masked Generative Pre-training"

3 / 3 papers shown

Title
VoiceStar: Robust Zero-Shot Autoregressive TTS with Duration Control and Extrapolation Puyuan Peng Shang-Wen Li Abdelrahman Mohamed David Harwath 36 0 0 26 May 2025
Unified Architecture and Unsupervised Speech Disentanglement for Speaker Embedding-Free Enrollment in Personalized Speech Enhancement Ziling Huang Haixin Guan Yanhua Long 99 0 0 18 May 2025
Advancing Zero-shot Text-to-Speech Intelligibility across Diverse Domains via Preference Alignment Xueyao Zhang Yijiao Wang Chaoren Wang Zehan Li Zhuo Chen Zhizheng Wu 337 0 0 07 May 2025