
R-Bench: Graduate-level Multi-disciplinary Benchmarks for LLM & MLLM Complex Reasoning Evaluation
Meng-Hao Guo
Jiajun Xu
Yi Zhang
Jiaxi Song
Haoyang Peng
Yi-Xuan Deng
Xinzhi Dong
Kiyohiro Nakayama
Zhengyang Geng
Chen Wang
Bolin Ni
Guo-Wei Yang
Yongming Rao
Houwen Peng
Han Hu
Gordon Wetzstein
Shi-Min Hu
Papers citing "R-Bench: Graduate-level Multi-disciplinary Benchmarks for LLM & MLLM Complex Reasoning Evaluation"
13 / 13 papers shown
Title |
---|
![]() Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large
Language Models Bofei Gao Feifan Song Zhiyong Yang Zefan Cai Yibo Miao ...Lei Sha Yichang Zhang Xuancheng Ren Tianyu Liu Baobao Chang |
![]() VLMEvalKit: An Open-Source Toolkit for Evaluating Large Multi-Modality Models Haodong Duan Junming Yang Junming Yang Xinyu Fang Lin Chen ...Yuhang Zang Pan Zhang Jiaqi Wang Dahua Lin Kai Chen |