Title |
---|
![]() YuLan: An Open-source Large Language Model Yutao Zhu Kun Zhou Kelong Mao Wentong Chen Yiding Sun ...Wenbing Huang Ze-Feng Gao Yueguo Chen Weizheng Lu Ji-Rong Wen |
![]() CMMaTH: A Chinese Multi-modal Math Skill Evaluation Benchmark for
Foundation Models Zhong-Zhi Li Ming-Liang Zhang Fei Yin Zhi-Long Ji Jin-Feng Bai Zhen-Ru Pan Fan-Hu Zeng Jian Xu Jia-Xin Zhang Cheng-Lin Liu |
![]() LiveBench: A Challenging, Contamination-Limited LLM Benchmark Colin White Samuel Dooley Manley Roberts Arka Pal Ben Feuer ...Willie Neiswanger Micah Goldblum Tom Goldstein Willie Neiswanger Micah Goldblum |