Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.14597
Cited By
Success is in the Details: Evaluate and Enhance Details Sensitivity of Code LLMs through Counterfactuals
20 May 2025
Xianzhen Luo
Qingfu Zhu
Zhiming Zhang
Mingzheng Xu
Tianhao Cheng
Yixuan Wang
Zheng Chu
Shijie Xuyang
Zhiyuan Ma
YuanTao Fan
Wanxiang Che
AAML
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Success is in the Details: Evaluate and Enhance Details Sensitivity of Code LLMs through Counterfactuals"
11 / 11 papers shown
Title
RobuNFR: Evaluating the Robustness of Large Language Models on Non-Functional Requirements Aware Code Generation
Feng Lin
Dong Jae Kim
Zhiyu Li
Jinqiu Yang
Tse-Husn
Chen
AAML
99
1
0
28 Mar 2025
OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement
Tianyu Zheng
Ge Zhang
Tianhao Shen
Xueling Liu
Bill Yuchen Lin
Jie Fu
Wenhu Chen
Xiang Yue
SyDa
154
129
0
08 Jan 2025
A Survey on Natural Language Counterfactual Generation
Yongjie Wang
Xiaoqi Qiu
Yu Yue
Xu Guo
Zhiwei Zeng
Yuhong Feng
Zhiqi Shen
59
9
0
04 Jul 2024
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
Terry Yue Zhuo
Minh Chien Vu
Jenny Chim
Han Hu
Wenhao Yu
...
David Lo
Daniel Fried
Xiaoning Du
H. D. Vries
Leandro von Werra
174
192
0
22 Jun 2024
The Counterfeit Conundrum: Can Code Language Models Grasp the Nuances of Their Incorrect Generations?
Alex Gu
Wen-Ding Li
Naman Jain
Theo X. Olausson
Celine Lee
Koushik Sen
Armando Solar-Lezama
LRM
50
19
0
29 Feb 2024
InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks
Xueyu Hu
Ziyu Zhao
Shuang Wei
Ziwei Chai
Qianli Ma
...
Jiwei Li
Kun Kuang
Yang Yang
Hongxia Yang
Leilei Gan
LMTD
ELM
65
57
0
10 Jan 2024
WaveCoder: Widespread And Versatile Enhancement For Code Large Language Models By Instruction Tuning
Zhaojian Yu
Xin Zhang
Ning Shang
Yangyu Huang
Can Xu
Yishujie Zhao
Wenxiang Hu
Qiufeng Yin
SyDa
92
28
0
20 Dec 2023
WizardCoder: Empowering Code Large Language Models with Evol-Instruct
Ziyang Luo
Can Xu
Pu Zhao
Qingfeng Sun
Xiubo Geng
Wenxiang Hu
Chongyang Tao
Jing Ma
Qingwei Lin
Daxin Jiang
ELM
SyDa
ALM
119
690
0
14 Jun 2023
A Survey on Natural Language Processing for Programming
Qingfu Zhu
Xianzhen Luo
Fang Liu
Cuiyun Gao
Wanxiang Che
54
3
0
12 Dec 2022
Counterfactual Explanations for Models of Code
Jürgen Cito
Işıl Dillig
V. Murali
S. Chandra
AAML
LRM
68
51
0
10 Nov 2021
Program Synthesis with Large Language Models
Jacob Austin
Augustus Odena
Maxwell Nye
Maarten Bosma
Henryk Michalewski
...
Ellen Jiang
Carrie J. Cai
Michael Terry
Quoc V. Le
Charles Sutton
ELM
AIMat
ReCod
ALM
216
2,004
0
16 Aug 2021
1