Title |
---|
![]() FunAudioLLM: Voice Understanding and Generation Foundation Models for
Natural Interaction Between Humans and LLMs Keyu An Qian Chen Chong Deng Zhihao Du Changfeng Gao ...Bin Zhang Qinglin Zhang Shiliang Zhang Nan Zhao Siqi Zheng |
![]() E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS Sefik Emre Eskimez Xiaofei Wang Manthan Thakker Canrun Li Chung-Hsien Tsai ...Min Tang Xu Tan Yanqing Liu Sheng Zhao Naoyuki Kanda |
![]() VALL-E R: Robust and Efficient Zero-Shot Text-to-Speech Synthesis via
Monotonic Alignment Bing Han Long Zhou Shujie Liu Sanyuan Chen Lingwei Meng Yanming Qian Yanqing Liu Sheng Zhao Jinyu Li Furu Wei |