Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2105.12655
Cited By
CodeNet: A Large-Scale AI for Code Dataset for Learning a Diversity of Coding Tasks
25 May 2021
Ruchi Puri
David S. Kung
G. Janssen
Wei Zhang
Giacomo Domeniconi
Vladmir A. Zolotov
Julian T Dolby
Jie Chen
M. Choudhury
Lindsey Decker
Veronika Thost
Luca Buratti
Saurabh Pujar
Shyam Ramji
Ulrich Finkler
Susan Malaika
Frederick Reiss
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CodeNet: A Large-Scale AI for Code Dataset for Learning a Diversity of Coding Tasks"
43 / 43 papers shown
Title
Towards Effectively Leveraging Execution Traces for Program Repair with Code LLMs
Mirazul Haque
Petr Babkin
Farima Farmahinifarahani
Manuela Veloso
32
0
0
07 May 2025
LSR-MCTS: Alleviating Long Range Dependency in Code Generation
Tingwei Lu
Yangning Li
Liyuan Wang
Binghuai Lin
Jiwei Tang
...
Hai-tao Zheng
Yinghui Li
Bingxu An
Zhao Wei
Yanwei Xu
LLMAG
62
0
0
10 Apr 2025
LLM-Driven Multi-step Translation from C to Rust using Static Analysis
Tianyang Zhou
Haowen Lin
Somesh Jha
Mihai Christodorescu
Kirill Levchenko
Varun Chandrasekaran
44
0
0
16 Mar 2025
ThrowBench: Benchmarking LLMs by Predicting Runtime Exceptions
Julian Aron Prenner
Romain Robbes
59
0
0
06 Mar 2025
LLM Program Optimization via Retrieval Augmented Search
Sagnik Anupam
Alexander Shypula
Osbert Bastani
126
1
0
31 Jan 2025
Skeleton-Guided-Translation: A Benchmarking Framework for Code Repository Translation with Fine-Grained Quality Evaluation
Xing Zhang
Jiaheng Wen
Fangkai Yang
Pu Zhao
Yu Kang
...
Qingwei Lin
Yingnong Dang
Saravan Rajmohan
Dongmei Zhang
Qi Zhang
56
2
0
28 Jan 2025
AIGCodeSet: A New Annotated Dataset for AI Generated Code Detection
Basak Demirok
Mucahid Kutlu
DeLMO
92
0
0
21 Dec 2024
Automated Proof Generation for Rust Code via Self-Evolution
Tianyu Chen
Shuai Lu
Shan Lu
Y. Gong
Chenyuan Yang
...
Peng Cheng
Fan Yang
Shuvendu Lahiri
Tao Xie
Lidong Zhou
39
7
0
21 Oct 2024
CursorCore: Assist Programming through Aligning Anything
Hao Jiang
Qi Liu
Rui Li
Shengyu Ye
Shijin Wang
53
1
0
09 Oct 2024
The Struggles of LLMs in Cross-lingual Code Clone Detection
Micheline Bénédicte Moumoula
A. Kaboré
Jacques Klein
Tegawende F. Bissyande
120
1
0
08 Aug 2024
Prompting Techniques for Secure Code Generation: A Systematic Investigation
Catherine Tony
Nicolás E. Díaz Ferreyra
Markus Mutas
Salem Dhiff
Riccardo Scandariato
SILM
73
9
0
09 Jul 2024
Towards Hierarchical Multi-Agent Workflows for Zero-Shot Prompt Optimization
Yuchi Liu
Jaskirat Singh
Gaowen Liu
Ali Payani
Liang Zheng
LLMAG
76
4
0
30 May 2024
Granite Code Models: A Family of Open Foundation Models for Code Intelligence
Mayank Mishra
Matt Stallone
Gaoyuan Zhang
Yikang Shen
Aditya Prasad
...
Amith Singhee
Nirmit Desai
David D. Cox
Ruchir Puri
Rameswar Panda
AI4TS
56
55
0
07 May 2024
CodeEditorBench: Evaluating Code Editing Capability of Large Language Models
Jiawei Guo
Ziming Li
Xueling Liu
Kaijing Ma
Tianyu Zheng
...
Xingwei Qu
Xiang Yue
Ge Zhang
Wenhu Chen
Jie Fu
KELM
59
12
0
04 Apr 2024
Semi-Instruct: Bridging Natural-Instruct and Self-Instruct for Code Large Language Models
Xianzhen Luo
Qingfu Zhu
Zhiming Zhang
Xu Wang
Qing Yang
Dongliang Xu
Wanxiang Che
ALM
32
2
0
01 Mar 2024
UniTSyn: A Large-Scale Dataset Capable of Enhancing the Prowess of Large Language Models for Program Testing
Yifeng He
Jiabo Huang
Yuyang Rong
Yiwen Guo
Ethan Wang
Hao Chen
26
4
0
04 Feb 2024
Demystifying Chains, Trees, and Graphs of Thoughts
Maciej Besta
Florim Memedi
Zhenyu Zhang
Robert Gerstenberger
Guangyuan Piao
...
Aleš Kubíček
H. Niewiadomski
Aidan O'Mahony
Onur Mutlu
Torsten Hoefler
AI4CE
LRM
75
27
0
25 Jan 2024
Deduplicating and Ranking Solution Programs for Suggesting Reference Solutions
Atsushi Shirafuji
Yutaka Watanobe
24
1
0
16 Jul 2023
Natural Language Generation and Understanding of Big Code for AI-Assisted Programming: A Review
M. Wong
Shangxin Guo
Ching Nam Hang
Siu-Wai Ho
C. Tan
42
78
0
04 Jul 2023
Coarse-Tuning Models of Code with Reinforcement Learning Feedback
Abhinav C. P. Jain
Chima Adiole
Swarat Chaudhuri
Thomas W. Reps
Chris Jermaine Rice University
ALM
19
2
0
25 May 2023
Neural Machine Translation for Code Generation
K. Dharma
Clayton T. Morrison
32
4
0
22 May 2023
Searching by Code: a New SearchBySnippet Dataset and SnippeR Retrieval Model for Searching by Code Snippets
I. Sedykh
Dmitry Abulkhanov
Nikita Sorokin
Sergey I. Nikolenko
Valentin Malykh
21
1
0
19 May 2023
Heterogeneous Directed Hypergraph Neural Network over abstract syntax tree (AST) for Code Classification
Guang Yang
Tiancheng Jin
Liang Dou
13
2
0
07 May 2023
RunBugRun -- An Executable Dataset for Automated Program Repair
Julian Aron Prenner
Romain Robbes
35
11
0
03 Apr 2023
Implant Global and Local Hierarchy Information to Sequence based Code Representation Models
Kechi Zhang
Zhuo Li
Zhi Jin
Ge Li
23
7
0
14 Mar 2023
CrossCodeBench: Benchmarking Cross-Task Generalization of Source Code Models
Changan Niu
Chuanyi Li
Vincent Ng
Bin Luo
ELM
ALM
34
9
0
08 Feb 2023
A Survey on Natural Language Processing for Programming
Qingfu Zhu
Xianzhen Luo
Fang Liu
Cuiyun Gao
Wanxiang Che
23
1
0
12 Dec 2022
Piloting Copilot and Codex: Hot Temperature, Cold Prompts, or Black Magic?
Jean-Baptiste Döderlein
M. Acher
D. Khelladi
B. Combemale
34
33
0
26 Oct 2022
MIXCODE: Enhancing Code Classification by Mixup-Based Data Augmentation
Zeming Dong
Qiang Hu
Yuejun Guo
Maxime Cordy
Mike Papadakis
Zhenya Zhang
Yves Le Traon
Jianjun Zhao
28
8
0
06 Oct 2022
CodeS: Towards Code Model Generalization Under Distribution Shift
Qiang Hu
Yuejun Guo
Xiaofei Xie
Maxime Cordy
Lei Ma
Mike Papadakis
Yves Le Traon
OOD
28
10
0
11 Jun 2022
Characterizing and Understanding the Behavior of Quantized Models for Reliable Deployment
Qiang Hu
Yuejun Guo
Maxime Cordy
Xiaofei Xie
Wei Ma
Mike Papadakis
Yves Le Traon
MQ
36
1
0
08 Apr 2022
LaF: Labeling-Free Model Selection for Automated Deep Neural Network Reusing
Qiang Hu
Yuejun Guo
Maxime Cordy
Xiaofei Xie
Mike Papadakis
Yves Le Traon
23
5
0
08 Apr 2022
Static Prediction of Runtime Errors by Learning to Execute Programs with External Resource Descriptions
David Bieber
Rishab Goel
Daniel Zheng
Hugo Larochelle
Daniel Tarlow
16
15
0
07 Mar 2022
A Survey on Artificial Intelligence for Source Code: A Dialogue Systems Perspective
Erfan Al-Hossami
Samira Shaikh
26
6
0
10 Feb 2022
Competition-Level Code Generation with AlphaCode
Yujia Li
David Choi
Junyoung Chung
Nate Kushman
Julian Schrittwieser
...
Esme Sutherland Robson
Pushmeet Kohli
Nando de
Koray Kavukcuoglu
Oriol Vinyals
26
1,295
0
08 Feb 2022
Federated Data Science to Break Down Silos [Vision]
Essam Mansour
Kavitha Srinivas
K. Hose
FedML
AI4CE
24
8
0
25 Nov 2021
Many Heads but One Brain: Fusion Brain -- a Competition and a Single Multimodal Multitask Architecture
Daria Bakshandaeva
Denis Dimitrov
V.Ya. Arkhipkin
Alex Shonenkov
M. Potanin
...
Mikhail Martynov
Anton Voronov
Vera Davydova
E. Tutubalina
Aleksandr Petiushko
33
0
0
22 Nov 2021
Deep Distilling: automated code generation using explainable deep learning
Paul J. Blazek
Kesavan Venkatesh
Milo M. Lin
16
2
0
16 Nov 2021
AVATAR: A Parallel Corpus for Java-Python Program Translation
W. Ahmad
Md Golam Rahman Tushar
Saikat Chakraborty
Kai-Wei Chang
35
78
0
26 Aug 2021
Measuring Coding Challenge Competence With APPS
Dan Hendrycks
Steven Basart
Saurav Kadavath
Mantas Mazeika
Akul Arora
...
Collin Burns
Samir Puranik
Horace He
D. Song
Jacob Steinhardt
ELM
AIMat
ALM
208
624
0
20 May 2021
CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation
Shuai Lu
Daya Guo
Shuo Ren
Junjie Huang
Alexey Svyatkovskiy
...
Nan Duan
Neel Sundaresan
Shao Kun Deng
Shengyu Fu
Shujie Liu
ELM
198
853
0
09 Feb 2021
Learning to Represent Programs with Heterogeneous Graphs
Kechi Zhang
Wenhan Wang
Huangzhao Zhang
Ge Li
Zhi Jin
GNN
21
63
0
08 Dec 2020
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Z. Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
716
6,743
0
26 Sep 2016
1