Title |
---|
![]() ControlAR: Controllable Image Generation with Autoregressive Models Zongming Li Tianheng Cheng Shoufa Chen Peize Sun Haocheng Shen Longjin Ran Xiaoxin Chen Wenyu Liu Xinggang Wang |
![]() Emu3: Next-Token Prediction is All You Need Xinlong Wang Xiaosong Zhang Zhengxiong Luo Quan-Sen Sun Yufeng Cui ...Xi Yang Jingjing Liu Yonghua Lin Tiejun Huang Zhongyuan Wang |
![]() Show-o: One Single Transformer to Unify Multimodal Understanding and
Generation Jinheng Xie Weijia Mao Zechen Bai David Junhao Zhang Weihao Wang Kevin Qinghong Lin Yuchao Gu Zhijie Chen Zhenheng Yang Mike Zheng Shou |
![]() Pandora: Towards General World Model with Natural Language Actions and
Video States Jiannan Xiang Guangyi Liu Yi Gu Qiyue Gao Yuting Ning ...Shibo Hao Yemin Shi Zhengzhong Liu Eric P. Xing Zhiting Hu |