ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2502.16614
  4. Cited By
CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models

CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models

23 February 2025
Alexander Zhang
Marcus Dong
Jing Liu
Wei Zhang
Yejie Wang
Jian Yang
Ge Zhang
Tianming Liu
Zhongyuan Peng
Yingshui Tan
Yuyao Zhang
Zhaoxiang Wang
Weixun Wang
Yancheng He
K. Deng
Wangchunshu Zhou
Wenhao Huang
Zhenru Zhang
    LRM
ArXivPDFHTML

Papers citing "CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models"

1 / 1 papers shown
Title
Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?
Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?
Yancheng He
Shilong Li
Jing Liu
Weixun Wang
Xingyuan Bu
...
Zhongyuan Peng
Zhenru Zhang
Zhicheng Zheng
Wenbo Su
Bo Zheng
ELM
LRM
86
9
0
26 Feb 2025
1