Title |
---|
![]() AutoBench-V: Can Large Vision-Language Models Benchmark Themselves? Han Bao Yue Huang Yanbo Wang Jiayi Ye Xiangqi Wang Xiuying Chen Mohamed Elhoseiny Xuzhi Zhang Mohamed Elhoseiny Xiangliang Zhang |
![]() Infinity-MM: Scaling Multimodal Performance with Large-Scale and High-Quality Instruction Data Shuhao Gu Jialing Zhang Siyuan Zhou Kevin Yu Zhaohu Xing ...Yufeng Cui Xinlong Wang Yaoqi Liu Fangxiang Feng Guang Liu |
![]() LabSafety Bench: Benchmarking LLMs on Safety Issues in Scientific Labs Yujun Zhou Jingdong Yang Kehan Guo Pin-Yu Chen Tian Gao ...Tian Gao Werner Geyer Nuno Moniz Nitesh V Chawla Xiangliang Zhang |