Title |
---|
![]() Sound Check: Auditing Audio Datasets William Agnew Julia Barnett Annie Chu Rachel Hong Michael Feffer Robin Netzorg Harry H. Jiang Ezra Awumey Sauvik Das |
![]() MIO: A Foundation Model on Multimodal Tokens Zekun Wang King Zhu Chunpu Xu Wangchunshu Zhou Jiaheng Liu ...Yuanxing Zhang Ge Zhang Ke Xu Jie Fu Wenhao Huang |
![]() Codec-SUPERB @ SLT 2024: A lightweight benchmark for neural audio codec
models Haibin Wu Xuanjun Chen Yi-Cheng Lin Kaiwei Chang Jiawei Du ...Yi-Chiao Wu Xu Tan James Glass Shinji Watanabe Hung-yi Lee |
![]() Investigating Neural Audio Codecs for Speech Language Model-Based Speech
Generation Jiaqi Li Dongmei Wang Xiaofei Wang Yao Qian Long Zhou ...Junkun Chen Sheng Zhao Jinyu Li Zhizheng Wu Michael Zeng |
![]() SSDM: Scalable Speech Dysfluency Modeling Jiachen Lian Xuanru Zhou Z. Ezzes Jet M J Vonk Brittany Morin D. Baquirin Zachary Mille M. G. Tempini Gopala Anumanchipalli |