Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2009.08366
Cited By
GraphCodeBERT: Pre-training Code Representations with Data Flow
17 September 2020
Daya Guo
Shuo Ren
Shuai Lu
Zhangyin Feng
Duyu Tang
Shujie Liu
Long Zhou
Nan Duan
Alexey Svyatkovskiy
Shengyu Fu
Michele Tufano
Shao Kun Deng
Colin B. Clement
Dawn Drain
Neel Sundaresan
Jian Yin
Daxin Jiang
M. Zhou
Re-assign community
ArXiv
PDF
HTML
Papers citing
"GraphCodeBERT: Pre-training Code Representations with Data Flow"
50 / 405 papers shown
Title
SimSCOOD: Systematic Analysis of Out-of-Distribution Generalization in Fine-tuned Source Code Models
Hossein Hajipour
Ning Yu
Cristian-Alexandru Staicu
Mario Fritz
OODD
27
4
0
10 Oct 2022
BAFFLE: Hiding Backdoors in Offline Reinforcement Learning Datasets
Chen Gong
Zhou Yang
Yunru Bai
Junda He
Jieke Shi
...
Arunesh Sinha
Bowen Xu
Xinwen Hou
David Lo
Guoliang Fan
AAML
OffRL
21
7
0
07 Oct 2022
CAT-probing: A Metric-based Approach to Interpret How Pre-trained Models for Programming Language Attend Code Structure
Nuo Chen
Qiushi Sun
Renyu Zhu
Xiang Li
Xuesong Lu
Ming Gao
44
10
0
07 Oct 2022
MIXCODE: Enhancing Code Classification by Mixup-Based Data Augmentation
Zeming Dong
Qiang Hu
Yuejun Guo
Maxime Cordy
Mike Papadakis
Zhenya Zhang
Yves Le Traon
Jianjun Zhao
31
8
0
06 Oct 2022
ContraCLM: Contrastive Learning For Causal Language Model
Nihal Jain
Dejiao Zhang
Wasi Uddin Ahmad
Zijian Wang
Feng Nan
...
Ramesh Nallapati
Baishakhi Ray
Parminder Bhatia
Xiaofei Ma
Bing Xiang
31
16
0
03 Oct 2022
CodeQueries: A Dataset of Semantic Queries over Code
Surya Prakash Sahu
Madhurima Mandal
Shikhar Bharadwaj
Aditya Kanade
Petros Maniatis
S. Shevade
25
4
0
17 Sep 2022
Semantic-Preserving Adversarial Code Comprehension
Yiyang Li
Hongqiu Wu
Hai Zhao
AAML
18
7
0
12 Sep 2022
Generalizability of Code Clone Detection on CodeBERT
Tim Sonnekalb
Bernd Gruner
C. Brust
Patrick Mäder
20
14
0
26 Aug 2022
Topical: Learning Repository Embeddings from Source Code using Attention
Agathe Lherondelle
Varun Babbar
Yash Satsangi
Fran Silavong
Shaltiel Eloul
Sean J. Moran
27
0
0
19 Aug 2022
Learning Program Representations with a Tree-Structured Transformer
Wenhan Wang
Kechi Zhang
Ge Li
Shangqing Liu
Anran Li
Zhi Jin
Yang Liu
47
5
0
18 Aug 2022
CommitBART: A Large Pre-trained Model for GitHub Commits
Shangqing Liu
Yanzhou Li
Xiaofei Xie
Yang Liu
VLM
AI4TS
29
18
0
17 Aug 2022
A Library for Representing Python Programs as Graphs for Machine Learning
David Bieber
Kensen Shi
Petros Maniatis
Charles Sutton
Vincent J. Hellendoorn
Daniel D. Johnson
Daniel Tarlow
GNN
AI4CE
30
5
0
15 Aug 2022
Finding Reusable Machine Learning Components to Build Programming Language Processing Pipelines
Patrick Flynn
T. Vanderbruggen
C. Liao
Pei-Hung Lin
M. Emani
Xipeng Shen
24
4
0
11 Aug 2022
CoditT5: Pretraining for Source Code and Natural Language Editing
Jiyang Zhang
Sheena Panthaplackel
Pengyu Nie
Junyi Jessy Li
Miloš Gligorić
KELM
19
88
0
10 Aug 2022
Multi-View Pre-Trained Model for Code Vulnerability Identification
Xuxia Jiang
Yinhao Xiao
Jun Wang
Wei Zhang
40
1
0
10 Aug 2022
Learning to Learn to Predict Performance Regressions in Production at Meta
M. Beller
Hongyu Li
V. Nair
V. Murali
Imad Ahmad
Jürgen Cito
Drew Carlson
Gareth Ari Aye
Wes Dyer
33
5
0
08 Aug 2022
Code Comment Inconsistency Detection with BERT and Longformer
Theo Steiner
Rui Zhang
28
4
0
29 Jul 2022
No More Fine-Tuning? An Experimental Evaluation of Prompt Tuning in Code Intelligence
Chaozheng Wang
Yuanhang Yang
Cuiyun Gao
Yun Peng
Hongyu Zhang
Michael R. Lyu
AAML
67
134
0
24 Jul 2022
PanGu-Coder: Program Synthesis with Function-Level Language Modeling
Fenia Christopoulou
Gerasimos Lampouras
Milan Gritta
Guchun Zhang
Yinpeng Guo
...
Guangtai Liang
Jia Wei
Xin Jiang
Qianxiang Wang
Qun Liu
ELM
SyDa
ALM
45
74
0
22 Jul 2022
What does Transformer learn about source code?
Kechi Zhang
Ge Li
Zhi Jin
ViT
28
8
0
18 Jul 2022
Few-shot training LLMs for project-specific code-summarization
Toufique Ahmed
Prem Devanbu
182
213
0
09 Jul 2022
Repository-Level Prompt Generation for Large Language Models of Code
Disha Shrivastava
Hugo Larochelle
Daniel Tarlow
28
137
0
26 Jun 2022
AST-Probe: Recovering abstract syntax trees from hidden representations of pre-trained language models
José Antonio Hernández López
Martin Weyssow
Jesús Sánchez Cuadrado
H. Sahraoui
27
22
0
23 Jun 2022
NatGen: Generative pre-training by "Naturalizing" source code
Saikat Chakraborty
Toufique Ahmed
Yangruibo Ding
Prem Devanbu
Baishakhi Ray
AI4CE
57
116
0
15 Jun 2022
CERT: Continual Pre-Training on Sketches for Library-Oriented Code Generation
Daoguang Zan
Bei Chen
Dejian Yang
Zeqi Lin
Minsu Kim
Bei Guan
Yongji Wang
Weizhu Chen
Jian-Guang Lou
25
120
0
14 Jun 2022
Multimodal Learning with Transformers: A Survey
P. Xu
Xiatian Zhu
David A. Clifton
ViT
72
528
0
13 Jun 2022
CodeS: Towards Code Model Generalization Under Distribution Shift
Qiang Hu
Yuejun Guo
Xiaofei Xie
Maxime Cordy
Lei Ma
Mike Papadakis
Yves Le Traon
OOD
33
10
0
11 Jun 2022
StructCoder: Structure-Aware Transformer for Code Generation
Sindhu Tipirneni
Ming Zhu
Chandan K. Reddy
30
55
0
10 Jun 2022
Fault-Aware Neural Code Rankers
J. Inala
Chenglong Wang
Mei Yang
Andrés Codas
Mark Encarnación
Shuvendu K. Lahiri
Madan Musuvathi
Jianfeng Gao
ALM
19
42
0
04 Jun 2022
Code Generation Tools (Almost) for Free? A Study of Few-Shot, Pre-Trained Language Models on Code
Patrick Bareiss
Beatriz Souza
Marcelo d’Amorim
Michael Pradel
ELM
16
76
0
02 Jun 2022
Learning code summarization from a small and local dataset
Toufique Ahmed
Prem Devanbu
51
9
0
02 Jun 2022
CodeAttack: Code-Based Adversarial Attacks for Pre-trained Programming Language Models
Akshita Jha
Chandan K. Reddy
SILM
ELM
AAML
30
59
0
31 May 2022
HierarchyNet: Learning to Summarize Source Code with Heterogeneous Representations
Minh Huynh Nguyen
Nghi D. Q. Bui
Truong-Son Hy
Long Tran-Thanh
Tien N. Nguyen
37
4
0
31 May 2022
Understanding Long Programming Languages with Structure-Aware Sparse Attention
Tingting Liu
Chengyu Wang
Cen Chen
Ming Gao
Aoying Zhou
32
3
0
27 May 2022
VulBERTa: Simplified Source Code Pre-Training for Vulnerability Detection
Hazim Hanif
S. Maffeis
66
95
0
25 May 2022
Deep Learning Meets Software Engineering: A Survey on Pre-Trained Models of Source Code
Changan Niu
Chuanyi Li
Bin Luo
Vincent Ng
SyDa
VLM
55
47
0
24 May 2022
Summarize and Generate to Back-translate: Unsupervised Translation of Programming Languages
Wasi Uddin Ahmad
Saikat Chakraborty
Baishakhi Ray
Kai-Wei Chang
47
27
0
23 May 2022
AdaptivePaste: Code Adaptation through Learning Semantics-aware Variable Usage Representations
Xiaoyu Liu
Jinu Jang
Neel Sundaresan
Miltiadis Allamanis
Alexey Svyatkovskiy
16
2
0
23 May 2022
NS3: Neuro-Symbolic Semantic Code Search
Shushan Arakelyan
Anna Hakhverdyan
Miltiadis Allamanis
Luis Garcia
Christophe Hauser
Xiang Ren
89
9
0
21 May 2022
CODE-MVP: Learning to Represent Source Code from Multiple Views with Contrastive Pre-Training
Xin Wang
Yasheng Wang
Yao Wan
Jiawei Wang
Pingyi Zhou
Li Li
Hao Wu
Jin Liu
26
33
0
04 May 2022
A Survey of Deep Learning Models for Structural Code Understanding
Ruoting Wu
Yuxin Zhang
Qibiao Peng
Liang Chen
Zibin Zheng
24
6
0
03 May 2022
On The Cross-Modal Transfer from Natural Language to Code through Adapter Modules
Divyam Goel
Raman Grover
Fatemeh H. Fard
20
18
0
19 Apr 2022
Addressing Leakage in Self-Supervised Contextualized Code Retrieval
Johannes Villmow
Viola Campos
A. Ulges
Ulrich Schwanecke
30
3
0
17 Apr 2022
Fix Bugs with Transformer through a Neural-Symbolic Edit Grammar
Yaojie Hu
Xingjian Shi
Qiang Zhou
Lee Pike
KELM
16
13
0
13 Apr 2022
Characterizing and Understanding the Behavior of Quantized Models for Reliable Deployment
Qiang Hu
Yuejun Guo
Maxime Cordy
Xiaofei Xie
Wei Ma
Mike Papadakis
Yves Le Traon
MQ
44
1
0
08 Apr 2022
CoCoSoDa: Effective Contrastive Learning for Code Search
Ensheng Shi
Yanlin Wang
Wenchao Gu
Lun Du
Hongyu Zhang
Shi Han
Dongmei Zhang
Hongbin Sun
41
33
0
07 Apr 2022
Transformer-Based Language Models for Software Vulnerability Detection
Chandra Thapa
Seung Ick Jang
Muhammad Ejaz Ahmed
S. Çamtepe
J. Pieprzyk
Surya Nepal
34
96
0
07 Apr 2022
An Exploratory Study on Code Attention in BERT
Rishab Sharma
Fuxiang Chen
Fatemeh H. Fard
David Lo
27
25
0
05 Apr 2022
On the Transferability of Pre-trained Language Models for Low-Resource Programming Languages
Fuxiang Chen
F. Fard
David Lo
T. Bryksin
28
44
0
05 Apr 2022
Accelerating Code Search with Deep Hashing and Code Classification
Wenchao Gu
Yanlin Wang
Lun Du
Hongyu Zhang
Shi Han
Dongmei Zhang
Michael R. Lyu
32
16
0
29 Mar 2022
Previous
1
2
3
4
5
6
7
8
9
Next