Title |
---|
![]() Mementos: A Comprehensive Benchmark for Multimodal Large Language Model
Reasoning over Image Sequences Xiyao Wang Yuhang Zhou Xiaoyu Liu Hongjin Lu Yuancheng Xu ...Taixi Lu Gedas Bertasius Mohit Bansal Huaxiu Yao Furong Huang |
![]() DetGPT: Detect What You Need via Reasoning Renjie Pi Jiahui Gao Shizhe Diao Boyao Wang Hanze Dong ...Lewei Yao Jianhua Han Hang Xu Lingpeng Kong Tong Zhang Tong Zhang |