Title |
---|
![]() xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed
Representations Can Qin Congying Xia Krithika Ramakrishnan Michael S Ryoo Lifu Tu ...Silvio Savarese Juan Carlos Niebles Zeyuan Chen Ran Xu Caiming Xiong |
![]() MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation
in Videos Xuehai He Weixi Feng Kaizhi Zheng Yujie Lu Wanrong Zhu ...Zhengyuan Yang Kevin Lin William Yang Wang Lijuan Wang Xin Eric Wang |
![]() m3P: Towards Multimodal Multilingual Translation with Multimodal Prompt Jian Yang Hongcheng Guo Yuwei Yin Jiaqi Bai Bing Wang Jiaheng Liu Xinnian Liang Linzheng Cahi Liqun Yang Zhoujun Li |