Title |
---|
![]() A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive
Transformer for Efficient Finegrained Image Generation Liang Chen Sinan Tan Zefan Cai Weichu Xie Haozhe Zhao Yichi Zhang Junyang Lin Jinze Bai Tianyu Liu Baobao Chang |
![]() PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions Weifeng Lin Xinyu Wei Renrui Zhang Le Zhuo Shitian Zhao ...Junlin Xie Junlin Xie Yu Qiao Peng Gao Hongsheng Li |
![]() In-Context Imitation Learning via Next-Token Prediction Letian Fu Huang Huang Gaurav Datta Lawrence Yunliang Chen William Chung-Ho Panitch Fangchen Liu Hui Li Ken Goldberg |
![]() HoloHisto: End-to-end Gigapixel WSI Segmentation with 4K Resolution
Sequential Tokenization Yucheng Tang Yufan He Vishwesh Nath Pengfeig Guo Ruining Deng ...Ziyue Xu Holger Roth Daguang Xu Haichun Yang Yuankai Huo |