ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2502.12466
  4. Cited By
EquiBench: Benchmarking Code Reasoning Capabilities of Large Language Models via Equivalence Checking

EquiBench: Benchmarking Code Reasoning Capabilities of Large Language Models via Equivalence Checking

18 February 2025
Anjiang Wei
Jiannan Cao
Ran Li
H. Chen
Y. Zhang
Ziheng Wang
Yaofeng Sun
Yuan Liu
Thiago S. F. X. Teixeira
D. Yang
Ke Wang
Alex Aiken
    LRM
ArXivPDFHTML

Papers citing "EquiBench: Benchmarking Code Reasoning Capabilities of Large Language Models via Equivalence Checking"

1 / 1 papers shown
Title
CodeARC: Benchmarking Reasoning Capabilities of LLM Agents for Inductive Program Synthesis
CodeARC: Benchmarking Reasoning Capabilities of LLM Agents for Inductive Program Synthesis
Anjiang Wei
Tarun Suresh
Jiannan Cao
Naveen Kannan
Yuheng Wu
Kai Yan
Thiago S. F. X. Teixeira
Ke Wang
Alex Aiken
ELM
LRM
43
0
0
29 Mar 2025
1