Title |
---|
![]() CLEVER: A Curated Benchmark for Formally Verified Code Generation Amitayush Thakur Jasper Lee George Tsoukalas Meghana Sistla Matthew Zhao Stefan Zetzsche Greg Durrett Yisong Yue Swarat Chaudhuri |
![]() Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large
Language Models Bofei Gao Feifan Song Zhengyuan Yang Zefan Cai Yibo Miao ...Lei Sha Yichang Zhang Xuancheng Ren Tianyu Liu Baobao Chang |