EquiBench: Benchmarking Large Language Models' Understanding of Program Semantics via Equivalence Checking
v1v2 (latest)

EquiBench: Benchmarking Large Language Models' Understanding of Program Semantics via Equivalence Checking

    LRM

Papers citing "EquiBench: Benchmarking Large Language Models' Understanding of Program Semantics via Equivalence Checking"

12 / 12 papers shown
Title
Qwen Technical Report
Qwen Technical Report
Jinze Bai
Shuai Bai
Yunfei Chu
Zeyu Cui
Kai Dang
...
Zhenru Zhang
Chang Zhou
Jingren Zhou
Xiaohuan Zhou
Tianhang Zhu
268
1,908
0
28 Sep 2023

We use cookies and other tracking technologies to improve your browsing experience on our website, to show you personalized content and targeted ads, to analyze our website traffic, and to understand where our visitors are coming from. See our policy.