Title |
---|
![]() Emilia: An Extensive, Multilingual, and Diverse Speech Dataset for
Large-Scale Speech Generation Haorui He Zengqiang Shang Chaoren Wang Xuyuan Li Yicheng Gu ...Peiyang Shi Yuancheng Wang Kai Chen Pengyuan Zhang Zhizheng Wu |
![]() E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS Sefik Emre Eskimez Xiaofei Wang Manthan Thakker Canrun Li Chung-Hsien Tsai ...Min Tang Xu Tan Yanqing Liu Sheng Zhao Naoyuki Kanda |
![]() VALL-E R: Robust and Efficient Zero-Shot Text-to-Speech Synthesis via
Monotonic Alignment Bing Han Long Zhou Shujie Liu Sanyuan Chen Lingwei Meng Yanming Qian Yanqing Liu Sheng Zhao Jinyu Li Furu Wei |
![]() ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and
Zero-shot Language Style Control With Decoupled Codec Shengpeng Ji Jia-li Zuo Minghui Fang Siqi Zheng Qian Chen ...Ziyue Jiang Hai Huang Xize Cheng Rongjie Huang Zhou Zhao |