
A Survey on Evaluation of Multimodal Large Language Models
Jiaxing Huang
Jingyi Zhang
Papers citing "A Survey on Evaluation of Multimodal Large Language Models"
50 / 50 papers shown
Title |
---|
![]() MuirBench: A Comprehensive Benchmark for Robust Multi-image
Understanding Fei Wang Xingyu Fu James Y. Huang Zekun Li Qin Liu ...Kai-Wei Chang Dan Roth Sheng Zhang Hoifung Poon Muhao Chen |
![]() AIR-Bench: Benchmarking Large Audio-Language Models via Generative
Comprehension Qian Yang Jin Xu Wenrui Liu Yunfei Chu Ziyue Jiang ...Yichong Leng Yuanjun Lv Zhou Zhao Chang Zhou Jingren Zhou |
![]() Mementos: A Comprehensive Benchmark for Multimodal Large Language Model
Reasoning over Image Sequences Xiyao Wang Yuhang Zhou Xiaoyu Liu Hongjin Lu Yuancheng Xu ...Taixi Lu Gedas Bertasius Mohit Bansal Huaxiu Yao Furong Huang |