Title |
---|
![]() Emu3: Next-Token Prediction is All You Need Xinlong Wang Xiaosong Zhang Zhengxiong Luo Quan-Sen Sun Yufeng Cui ...Xi Yang Jingjing Liu Yonghua Lin Tiejun Huang Zhongyuan Wang |
![]() Show-o: One Single Transformer to Unify Multimodal Understanding and
Generation Jinheng Xie Weijia Mao Zechen Bai David Junhao Zhang Weihao Wang Kevin Qinghong Lin Yuchao Gu Zhijie Chen Zhenheng Yang Mike Zheng Shou |
![]() PhyBench: A Physical Commonsense Benchmark for Evaluating Text-to-Image
Models Fanqing Meng Wenqi Shao Lixin Luo Yahong Wang Yiran Chen ...Yue Yang Tianshuo Yang Kaipeng Zhang Yu Qiao Ping Luo |