Title |
---|
![]() E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS Sefik Emre Eskimez Xiaofei Wang Manthan Thakker Canrun Li Chung-Hsien Tsai ...Min Tang Xu Tan Yanqing Liu Sheng Zhao Naoyuki Kanda |
![]() Joint Speaker Features Learning for Audio-visual Multichannel Speech
Separation and Recognition Guinan Li Jiajun Deng Youjun Chen Mengzhe Geng Shujie Hu ...Zengrui Jin Tianzi Wang Xurong Xie Helen Meng Xunying Liu |
![]() On the Evaluation of Speech Foundation Models for Spoken Language
Understanding Siddhant Arora Ankita Pasad Chung-Ming Chien Jionghao Han Roshan S. Sharma ...William Chen Suwon Shon Hung-yi Lee Karen Livescu Shinji Watanabe |
![]() Towards Effective and Efficient Non-autoregressive Decoding Using
Block-based Attention Mask Tianzi Wang Xurong Xie Zhaoqing Li Shoukang Hu Zengrui Jin ...Shujie Hu Mengzhe Geng Guinan Li Helen Meng Xunying Liu |