ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.14597
  4. Cited By
Success is in the Details: Evaluate and Enhance Details Sensitivity of Code LLMs through Counterfactuals

Success is in the Details: Evaluate and Enhance Details Sensitivity of Code LLMs through Counterfactuals

20 May 2025
Xianzhen Luo
Qingfu Zhu
Zhiming Zhang
Mingzheng Xu
Tianhao Cheng
Yixuan Wang
Zheng Chu
Shijie Xuyang
Zhiyuan Ma
YuanTao Fan
Wanxiang Che
    AAML
ArXiv (abs)PDFHTML

Papers citing "Success is in the Details: Evaluate and Enhance Details Sensitivity of Code LLMs through Counterfactuals"

11 / 11 papers shown
Title
RobuNFR: Evaluating the Robustness of Large Language Models on Non-Functional Requirements Aware Code Generation
RobuNFR: Evaluating the Robustness of Large Language Models on Non-Functional Requirements Aware Code Generation
Feng Lin
Dong Jae Kim
Zhiyu Li
Jinqiu Yang
Tse-Husn
Chen
AAML
99
1
0
28 Mar 2025
OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement
OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement
Tianyu Zheng
Ge Zhang
Tianhao Shen
Xueling Liu
Bill Yuchen Lin
Jie Fu
Wenhu Chen
Xiang Yue
SyDa
154
129
0
08 Jan 2025
A Survey on Natural Language Counterfactual Generation
A Survey on Natural Language Counterfactual Generation
Yongjie Wang
Xiaoqi Qiu
Yu Yue
Xu Guo
Zhiwei Zeng
Yuhong Feng
Zhiqi Shen
59
9
0
04 Jul 2024
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
Terry Yue Zhuo
Minh Chien Vu
Jenny Chim
Han Hu
Wenhao Yu
...
David Lo
Daniel Fried
Xiaoning Du
H. D. Vries
Leandro von Werra
172
192
0
22 Jun 2024
The Counterfeit Conundrum: Can Code Language Models Grasp the Nuances of
  Their Incorrect Generations?
The Counterfeit Conundrum: Can Code Language Models Grasp the Nuances of Their Incorrect Generations?
Alex Gu
Wen-Ding Li
Naman Jain
Theo X. Olausson
Celine Lee
Koushik Sen
Armando Solar-Lezama
LRM
50
19
0
29 Feb 2024
InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks
InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks
Xueyu Hu
Ziyu Zhao
Shuang Wei
Ziwei Chai
Qianli Ma
...
Jiwei Li
Kun Kuang
Yang Yang
Hongxia Yang
Leilei Gan
LMTDELM
63
57
0
10 Jan 2024
WaveCoder: Widespread And Versatile Enhancement For Code Large Language
  Models By Instruction Tuning
WaveCoder: Widespread And Versatile Enhancement For Code Large Language Models By Instruction Tuning
Zhaojian Yu
Xin Zhang
Ning Shang
Yangyu Huang
Can Xu
Yishujie Zhao
Wenxiang Hu
Qiufeng Yin
SyDa
90
28
0
20 Dec 2023
WizardCoder: Empowering Code Large Language Models with Evol-Instruct
WizardCoder: Empowering Code Large Language Models with Evol-Instruct
Ziyang Luo
Can Xu
Pu Zhao
Qingfeng Sun
Xiubo Geng
Wenxiang Hu
Chongyang Tao
Jing Ma
Qingwei Lin
Daxin Jiang
ELMSyDaALM
119
690
0
14 Jun 2023
A Survey on Natural Language Processing for Programming
A Survey on Natural Language Processing for Programming
Qingfu Zhu
Xianzhen Luo
Fang Liu
Cuiyun Gao
Wanxiang Che
54
3
0
12 Dec 2022
Counterfactual Explanations for Models of Code
Counterfactual Explanations for Models of Code
Jürgen Cito
Işıl Dillig
V. Murali
S. Chandra
AAMLLRM
68
50
0
10 Nov 2021
Program Synthesis with Large Language Models
Program Synthesis with Large Language Models
Jacob Austin
Augustus Odena
Maxwell Nye
Maarten Bosma
Henryk Michalewski
...
Ellen Jiang
Carrie J. Cai
Michael Terry
Quoc V. Le
Charles Sutton
ELMAIMatReCodALM
216
2,004
0
16 Aug 2021
1