Title |
---|
![]() RenderWorld: World Model with Self-Supervised 3D Label Ziyang Yan Wenzhen Dong Yihua Shao Yuhang Lu Liu Haiyang ...Haozhe Wang Zhe Wang Yan Wang Fabio Remondino Yuexin Ma |
![]() Investigating Neural Audio Codecs for Speech Language Model-Based Speech
Generation Jiaqi Li Dongmei Wang Xiaofei Wang Yao Qian Long Zhou ...Junkun Chen Sheng Zhao Jinyu Li Zhizheng Wu Michael Zeng |
![]() NEST: Self-supervised Fast Conformer as All-purpose Seasoning to Speech
Processing Tasks He Huang Taejin Park Kunal Dhawan Ivan Medennikov Krishna Puvvada Nithin Rao Koluguri Weiqing Wang Jagadeesh Balam Boris Ginsburg |