Title |
---|
![]() VALL-E R: Robust and Efficient Zero-Shot Text-to-Speech Synthesis via
Monotonic Alignment Bing Han Long Zhou Shujie Liu Sanyuan Chen Lingwei Meng Yanming Qian Yanqing Liu Sheng Zhao Jinyu Li Furu Wei |
![]() Improving Language Model-Based Zero-Shot Text-to-Speech Synthesis with
Multi-Scale Acoustic Prompts Shunwei Lei Yixuan Zhou Liyang Chen Dan Luo Zhiyong Wu ...Shiyin Kang Tao Jiang Yahui Zhou Yuxing Han Helen M. Meng |