ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2502.09183
78
1

RefineCoder: Iterative Improving of Large Language Models via Adaptive Critique Refinement for Code Generation

13 February 2025
C. Zhou
Xinyu Zhang
Dandan Song
Xiancai Chen
Wanli Gu
Huipeng Ma
Yuhang Tian
Hao Fei
Linmei Hu
ArXivPDFHTML
Abstract

Code generation has attracted increasing attention with the rise of Large Language Models (LLMs). Many studies have developed powerful code LLMs by synthesizing code-related instruction data and applying supervised fine-tuning. However, these methods are limited by teacher model distillation and ignore the potential of iterative refinement by self-generated code. In this paper, we propose Adaptive Critique Refinement (ACR), which enables the model to refine itself by self-generated code and external critique, rather than directly imitating the code responses of the teacher model. Concretely, ACR includes a composite scoring system with LLM-as-a-Judge to evaluate the quality of code responses and a selective critique strategy with LLM-as-a-Critic to critique self-generated low-quality code responses. We develop the RefineCoder series by iteratively applying ACR, achieving continuous performance improvement on multiple code generation benchmarks. Compared to the baselines of the same size, our proposed RefineCoder series can achieve comparable or even superior performance using less data.

View on arXiv
@article{zhou2025_2502.09183,
  title={ RefineCoder: Iterative Improving of Large Language Models via Adaptive Critique Refinement for Code Generation },
  author={ Changzhi Zhou and Xinyu Zhang and Dandan Song and Xiancai Chen and Wanli Gu and Huipeng Ma and Yuhang Tian and Mengdi Zhang and Linmei Hu },
  journal={arXiv preprint arXiv:2502.09183},
  year={ 2025 }
}
Comments on this paper