Title |
---|
![]() Show-o: One Single Transformer to Unify Multimodal Understanding and
Generation Jinheng Xie Weijia Mao Zechen Bai David Junhao Zhang Weihao Wang Kevin Qinghong Lin Yuchao Gu Zhijie Chen Zhenheng Yang Mike Zheng Shou |
![]() Pandora: Towards General World Model with Natural Language Actions and
Video States Jiannan Xiang Guangyi Liu Yi Gu Qiyue Gao Yuting Ning ...Shibo Hao Yemin Shi Zhengzhong Liu Eric P. Xing Zhiting Hu |
![]() Muse: Text-To-Image Generation via Masked Generative Transformers Huiwen Chang Han Zhang Jarred Barber AJ Maschinot José Lezama ...Kevin Patrick Murphy William T. Freeman Michael Rubinstein Yuanzhen Li Dilip Krishnan |