Title |
---|
![]() xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed
Representations Can Qin Congying Xia Krithika Ramakrishnan Michael S Ryoo Lifu Tu ...Silvio Savarese Juan Carlos Niebles Zeyuan Chen Ran Xu Caiming Xiong |
![]() Show-o: One Single Transformer to Unify Multimodal Understanding and
Generation Jinheng Xie Weijia Mao Zechen Bai David Junhao Zhang Weihao Wang Kevin Qinghong Lin Yuchao Gu Zhijie Chen Zhenheng Yang Mike Zheng Shou |