Title |
---|
![]() Paralinguistics-Enhanced Large Language Modeling of Spoken Dialogue Guan-Ting Lin Prashanth Gurunath Shivakumar Ankur Gandhe Chao-Han Huck Yang Yile Gu Shalini Ghosh A. Stolcke Hung-yi Lee I. Bulyko |
![]() SEF-VC: Speaker Embedding Free Zero-Shot Voice Conversion with Cross
Attention Junjie Li Yiwei Guo Xie Chen Kai Yu |
![]() Low-latency Speech Enhancement via Speech Token Generation Huaying Xue Xiulian Peng Yan Lu |
![]() LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT Zhihao Du Jiaming Wang Qian Chen Yunfei Chu Zhifu Gao ...Wen Wang Siqi Zheng Chang Zhou Zhijie Yan Shiliang Zhang |
![]() UniAudio: An Audio Foundation Model Toward Universal Audio Generation Dongchao Yang Jinchuan Tian Xuejiao Tan Rongjie Huang Songxiang Liu ...Jiang Bian Xixin Wu Zhou Zhao Shinji Watanabe Helen M. Meng |