Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2009.08366
Cited By
GraphCodeBERT: Pre-training Code Representations with Data Flow
17 September 2020
Daya Guo
Shuo Ren
Shuai Lu
Zhangyin Feng
Duyu Tang
Shujie Liu
Long Zhou
Nan Duan
Alexey Svyatkovskiy
Shengyu Fu
Michele Tufano
Shao Kun Deng
Colin B. Clement
Dawn Drain
Neel Sundaresan
Jian Yin
Daxin Jiang
M. Zhou
Re-assign community
ArXiv
PDF
HTML
Papers citing
"GraphCodeBERT: Pre-training Code Representations with Data Flow"
50 / 405 papers shown
Title
StagedVulBERT: Multi-Granular Vulnerability Detection with a Novel Pre-trained Code Model
Yuan Jiang
Yujian Zhang
Xiaohong Su
Christoph Treude
Tiantian Wang
47
0
0
08 Oct 2024
Showing LLM-Generated Code Selectively Based on Confidence of LLMs
Jia Li
Yuqi Zhu
Yongmin Li
Ge Li
Zhi Jin
33
0
0
04 Oct 2024
Enhancing Pre-Trained Language Models for Vulnerability Detection via Semantic-Preserving Data Augmentation
Weiliang Qi
Jiahao Cao
Darsh Poddar
Sophia Li
Xinda Wang
24
0
0
30 Sep 2024
zsLLMCode: An Effective Approach for Code Embedding via LLM with Zero-Shot Learning
Zixiang Xian
Chenhui Cui
Rubing Huang
Chunrong Fang
Zhenyu Chen
31
0
0
23 Sep 2024
HexaCoder: Secure Code Generation via Oracle-Guided Synthetic Training Data
Hossein Hajipour
Lea Schönherr
Thorsten Holz
Mario Fritz
AAML
SyDa
26
0
0
10 Sep 2024
HyperAgent: Generalist Software Engineering Agents to Solve Coding Tasks at Scale
Huy N. Phan
Phong X. Nguyen
Nghi D. Q. Bui
LLMAG
33
11
0
09 Sep 2024
GALLa: Graph Aligned Large Language Models for Improved Source Code Understanding
Ziyin Zhang
Hang Yu
Shijie Li
Peng Di
Jianguo Li
Rui Wang
27
2
0
06 Sep 2024
Unintentional Security Flaws in Code: Automated Defense via Root Cause Analysis
Nafis Tanveer Islam
Mazal Bethany
Dylan Manuel
Murtuza Jadliwala
Peyman Najafirad
33
0
0
30 Aug 2024
A Joint Learning Model with Variational Interaction for Multilingual Program Translation
Yali Du
Hui Sun
Ming Li
35
2
0
25 Aug 2024
Understanding Defects in Generated Codes by Language Models
Ali Mohammadi Esfahani
N. Kahani
S. Ajila
25
1
0
23 Aug 2024
Top Pass: Improve Code Generation by Pass@k-Maximized Code Ranking
Zhi-Cun Lyu
Xin-Ye Li
Zheng Xie
Ming Li
44
7
0
11 Aug 2024
ViC: Virtual Compiler Is All You Need For Assembly Code Search
Zeyu Gao
Hao Wang
Yuanda Wang
Chao Zhang
38
1
0
10 Aug 2024
Retrieval-augmented code completion for local projects using large language models
Marko Hostnik
Marko Robnik-Sikonja
RALM
32
0
0
09 Aug 2024
From Generalist to Specialist: Exploring CWE-Specific Vulnerability Detection
Syafiq Al Atiiq
Christian Gehrmann
Kevin Dahlén
Karim Khalil
34
1
0
05 Aug 2024
LLM Agents Improve Semantic Code Search
Sarthak Jain
Aditya Dora
Ka Seng Sam
Prabhat Singh
AIFin
26
5
0
05 Aug 2024
Vulnerability Detection in Ethereum Smart Contracts via Machine Learning: A Qualitative Analysis
Dalila Ressi
Alvise Spanò
Lorenzo Benetollo
Carla Piazza
M. Bugliesi
Sabina Rossi
42
1
0
26 Jul 2024
BLAZE: Cross-Language and Cross-Project Bug Localization via Dynamic Chunking and Hard Example Learning
Partha Chakraborty
Mahmoud Alfadel
Mei Nagappan
27
2
0
24 Jul 2024
Comparison of Static Application Security Testing Tools and Large Language Models for Repo-level Vulnerability Detection
Xin Zhou
Duc-Manh Tran
Thanh Le-Cong
Ting Zhang
Ivana Clairine Irsan
Joshua Sumarlin
Bach Le
David Lo
ELM
32
10
0
23 Jul 2024
Curriculum Learning for Small Code Language Models
Marwa Nair
K. Yamani
Lynda Said Lhadj
Riyadh Baghdadi
32
4
0
14 Jul 2024
Defending Code Language Models against Backdoor Attacks with Deceptive Cross-Entropy Loss
Guang Yang
Yu Zhou
Xiang Chen
Xiangyu Zhang
Terry Yue Zhuo
David Lo
Taolue Chen
AAML
57
4
0
12 Jul 2024
DeepCodeProbe: Towards Understanding What Models Trained on Code Learn
Vahid Majdinasab
Amin Nikanjam
Foutse Khomh
40
1
0
11 Jul 2024
Prompting Techniques for Secure Code Generation: A Systematic Investigation
Catherine Tony
Nicolás E. Díaz Ferreyra
Markus Mutas
Salem Dhiff
Riccardo Scandariato
SILM
79
9
0
09 Jul 2024
Looking into Black Box Code Language Models
Muhammad Umair Haider
Umar Farooq
A. B. Siddique
Mark Marron
39
2
0
05 Jul 2024
ESALE: Enhancing Code-Summary Alignment Learning for Source Code Summarization
Chunrong Fang
Dongrui Liu
Yuchen Chen
Xiao Chen
Zhao Wei
Quanjun Zhang
Yudu You
Bin Luo
Yang Liu
Zhenyu Chen
AI4TS
45
12
0
01 Jul 2024
GraphArena: Evaluating and Exploring Large Language Models on Graph Computation
Jianheng Tang
Qifan Zhang
Yuhan Li
Nuo Chen
Jia Li
21
1
0
29 Jun 2024
NARRepair: Non-Autoregressive Code Generation Model for Automatic Program Repair
Zhenyu Yang
Zhen Yang
Zhongxing Yu
37
1
0
24 Jun 2024
SimClone: Detecting Tabular Data Clones using Value Similarity
Xu Yang
Gopi Krishnan Rajbahadur
Dayi Lin
Shaowei Wang
Zhen Ming
Jiang
27
1
0
24 Jun 2024
Toward Exploring the Code Understanding Capabilities of Pre-trained Code Generation Models
Jiayi Lin
Yutao Xie
Yue Yu
Yibiao Yang
Lei Zhang
SyDa
21
0
0
18 Jun 2024
A Critical Study of What Code-LLMs (Do Not) Learn
Abhinav Anand
Shweta Verma
Krishna Narasimhan
Mira Mezini
40
4
0
17 Jun 2024
AgileCoder: Dynamic Collaborative Agents for Software Development based on Agile Methodology
Minh Huynh Nguyen
Thang Phan Chau
Phong X. Nguyen
Nghi D. Q. Bui
37
11
0
16 Jun 2024
Cross-Modality Program Representation Learning for Electronic Design Automation with High-Level Synthesis
Zongyue Qin
Yunsheng Bai
Atefeh Sohrabizadeh
Zijian Ding
Ziniu Hu
Yizhou Sun
Jason Cong
28
2
0
13 Jun 2024
Estimating Difficulty Levels of Programming Problems with Pre-trained Model
Zhiyuan Wang
Wei Zhang
Jun Wang
26
0
0
13 Jun 2024
Scaling Automatic Extraction of Pseudocode
Levent Toksoz
Gang Tan
C. L. Giles
35
0
0
07 Jun 2024
Enhancing Size Generalization in Graph Neural Networks through Disentangled Representation Learning
Zheng Huang
Qihui Yang
Dawei Zhou
Yujun Yan
AI4CE
36
3
0
07 Jun 2024
Generalization-Enhanced Code Vulnerability Detection via Multi-Task Instruction Fine-Tuning
Xiaohu Du
Ming Wen
Jiahao Zhu
Zifan Xie
Bin Ji
Huijun Liu
Xuanhua Shi
Hai Jin
37
14
0
06 Jun 2024
Enhancing Repository-Level Code Generation with Integrated Contextual Information
Zhiyuan Pan
Xing Hu
Xin Xia
Xiaohu Yang
34
3
0
05 Jun 2024
Focus on the Core: Efficient Attention via Pruned Token Compression for Document Classification
Jungmin Yun
Mihyeon Kim
Youngbin Kim
77
9
0
03 Jun 2024
A Survey on Large Language Models for Code Generation
Juyong Jiang
Fan Wang
Jiasi Shen
Sungju Kim
Sunghun Kim
53
166
0
01 Jun 2024
Uncovering LLM-Generated Code: A Zero-Shot Synthetic Code Detector via Code Rewriting
Tong Ye
Yangkai Du
Tengfei Ma
Lingfei Wu
Xuhong Zhang
Shouling Ji
Wenhai Wang
DeLMO
49
6
0
25 May 2024
Large Language Models for Cyber Security: A Systematic Literature Review
HanXiang Xu
Shenao Wang
Ningke Li
Kaixin Wang
Yanjie Zhao
Kai Chen
Ting Yu
Yang Liu
Haoyu Wang
37
23
0
08 May 2024
Refining Joint Text and Source Code Embeddings for Retrieval Task with Parameter-Efficient Fine-Tuning
Karim Galliamov
Leila Khaertdinova
Karina Denisova
38
1
0
07 May 2024
Advanced Detection of Source Code Clones via an Ensemble of Unsupervised Similarity Measures
Jorge Martínez Gil
34
4
0
03 May 2024
On the Limitations of Embedding Based Methods for Measuring Functional Correctness for Code Generation
Atharva Naik
46
2
0
26 Apr 2024
Graph Neural Networks for Vulnerability Detection: A Counterfactual Explanation
Zhaoyang Chu
Yao Wan
Qian Li
Yang Wu
Hongyu Zhang
Yulei Sui
Guandong Xu
Hai Jin
AAML
46
9
0
24 Apr 2024
VulEval: Towards Repository-Level Evaluation of Software Vulnerability Detection
Xinjie Wen
Xinchen Wang
Yujia Chen
Ruida Hu
David Lo
Cuiyun Gao
35
6
0
24 Apr 2024
On Unified Prompt Tuning for Request Quality Assurance in Public Code Review
Xinyu Chen
Lin Li
Rui Zhang
Peng Liang
32
1
0
11 Apr 2024
Structure-aware Fine-tuning for Code Pre-trained Models
Jiayi Wu
Renyu Zhu
Nuo Chen
Qiushi Sun
Xiang Li
Ming Gao
43
2
0
11 Apr 2024
Analyzing the Performance of Large Language Models on Code Summarization
Rajarshi Haldar
J. Hockenmaier
43
17
0
10 Apr 2024
Open-Source AI-based SE Tools: Opportunities and Challenges of Collaborative Software Learning
Zhihao Lin
Wei Ma
Tao Lin
Yaowen Zheng
Jingquan Ge
Jun Wang
Jacques Klein
Tegawende F. Bissyande
Yang Liu
Li Li
VLM
35
4
0
09 Apr 2024
Multi-modal Learning for WebAssembly Reverse Engineering
Hanxian Huang
Jishen Zhao
31
2
0
04 Apr 2024
Previous
1
2
3
4
5
6
7
8
9
Next