Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2502.03128
Cited By
Metis: A Foundation Speech Generation Model with Masked Generative Pre-training
5 February 2025
Yansen Wang
Jiachen Zheng
Junan Zhang
Xueyao Zhang
Huan Liao
Zhizheng Wu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Metis: A Foundation Speech Generation Model with Masked Generative Pre-training"
3 / 3 papers shown
Title
VoiceStar: Robust Zero-Shot Autoregressive TTS with Duration Control and Extrapolation
Puyuan Peng
Shang-Wen Li
Abdelrahman Mohamed
David Harwath
36
0
0
26 May 2025
Unified Architecture and Unsupervised Speech Disentanglement for Speaker Embedding-Free Enrollment in Personalized Speech Enhancement
Ziling Huang
Haixin Guan
Yanhua Long
99
0
0
18 May 2025
Advancing Zero-shot Text-to-Speech Intelligibility across Diverse Domains via Preference Alignment
Xueyao Zhang
Yijiao Wang
Chaoren Wang
Zehan Li
Zhuo Chen
Zhizheng Wu
337
0
0
07 May 2025
1