Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2002.08155
Cited By
CodeBERT: A Pre-Trained Model for Programming and Natural Languages
19 February 2020
Zhangyin Feng
Daya Guo
Duyu Tang
Nan Duan
Xiaocheng Feng
Ming Gong
Linjun Shou
Bing Qin
Ting Liu
Daxin Jiang
Ming Zhou
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CodeBERT: A Pre-Trained Model for Programming and Natural Languages"
50 / 314 papers shown
Title
CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules
Hung Le
Hailin Chen
Amrita Saha
Akash Gokul
Doyen Sahoo
Chenyu You
LRM
28
42
0
13 Oct 2023
Towards Causal Deep Learning for Vulnerability Detection
Md. Mahbubur Rahman
Ira Ceka
Chengzhi Mao
Saikat Chakraborty
Baishakhi Ray
Wei Le
26
10
0
12 Oct 2023
Supersonic: Learning to Generate Source Code Optimizations in C/C++
Zimin Chen
Sen Fang
Monperrus Martin
41
11
0
26 Sep 2023
Reranking for Natural Language Generation from Logical Forms: A Study based on Large Language Models
Levon Haroutunian
Zhuang Li
Lucian Galescu
Philip R. Cohen
Raj Tumuluri
Gholamreza Haffari
LRM
31
1
0
21 Sep 2023
A Full-fledged Commit Message Quality Checker Based on Machine Learning
David Faragó
Michael Färber
Christian Petrov
31
1
0
09 Sep 2023
Trustworthy and Synergistic Artificial Intelligence for Software Engineering: Vision and Roadmaps
David Lo
34
39
0
08 Sep 2023
Bias Testing and Mitigation in LLM-based Code Generation
Dong Huang
Qingwen Bu
Jie M. Zhang
Xiaofei Xie
Junjie Chen
Heming Cui
48
20
0
03 Sep 2023
A Multi-Task Semantic Decomposition Framework with Task-specific Pre-training for Few-Shot NER
Guanting Dong
Zechen Wang
Jinxu Zhao
Gang Zhao
Daichi Guo
...
Keqing He
Xuefeng Li
Liwen Wang
Xinyue Cui
Weiran Xu
37
19
0
28 Aug 2023
On the Impact of Language Selection for Training and Evaluating Programming Language Models
J. Katzy
M. Izadi
A. van Deursen
53
5
0
25 Aug 2023
kTrans: Knowledge-Aware Transformer for Binary Code Embedding
Wenyu Zhu
Hao Wang
Yuchen Zhou
Jiaming Wang
Zihan Sha
Zeyu Gao
Chao Zhang
32
10
0
24 Aug 2023
Towards General Text Embeddings with Multi-stage Contrastive Learning
Zehan Li
Xin Zhang
Yanzhao Zhang
Dingkun Long
Pengjun Xie
Meishan Zhang
71
351
0
07 Aug 2023
An Empirical Study of AI-based Smart Contract Creation
Rabimba Karanjai
Edward Li
Lei Xu
W. Shi
22
9
0
05 Aug 2023
CodeBPE: Investigating Subtokenization Options for Large Language Model Pretraining on Source Code
Nadezhda Chirkova
Sergey Troshin
21
8
0
01 Aug 2023
A Lightweight Framework for High-Quality Code Generation
Mohammed Latif Siddiq
B.K. Casey
Joanna C. S. Santos
44
17
0
17 Jul 2023
Exploring Continual Learning for Code Generation Models
Prateek Yadav
Q. Sun
Hantian Ding
Xiaopeng Li
Dejiao Zhang
...
Parminder Bhatia
Ramesh Nallapati
M. K. Ramanathan
Joey Tianyi Zhou
Bing Xiang
CLL
37
30
0
05 Jul 2023
Natural Language Generation and Understanding of Big Code for AI-Assisted Programming: A Review
M. Wong
Shangxin Guo
Ching Nam Hang
Siu-Wai Ho
C. Tan
42
78
0
04 Jul 2023
How Effective Are Neural Networks for Fixing Security Vulnerabilities
Yi Wu
Nan Jiang
H. Pham
Thibaud Lutellier
Jordan Davis
Lin Tan
Petr Babkin
Sameena Shah
AAML
21
79
0
29 May 2023
Coarse-Tuning Models of Code with Reinforcement Learning Feedback
Abhinav C. P. Jain
Chima Adiole
Swarat Chaudhuri
Thomas W. Reps
Chris Jermaine Rice University
ALM
25
2
0
25 May 2023
Understanding Programs by Exploiting (Fuzzing) Test Cases
Jianyu Zhao
Yuyang Rong
Yiwen Guo
Yifeng He
Hao Chen
35
16
0
23 May 2023
Neural Machine Translation for Code Generation
K. Dharma
Clayton T. Morrison
32
4
0
22 May 2023
BertRLFuzzer: A BERT and Reinforcement Learning Based Fuzzer
Piyush Jha
Joseph Scott
Jaya Sriram Ganeshna
M. Singh
Vijay Ganesh
24
5
0
21 May 2023
Towards Tracing Code Provenance with Code Watermarking
Wei Li
Borui Yang
Yujie Sun
Suyu Chen
Ziyun Song
Liyao Xiang
Xinbing Wang
Cheng Zhou
WaLM
32
6
0
21 May 2023
CCT-Code: Cross-Consistency Training for Multilingual Clone Detection and Code Search
Nikita Sorokin
Dmitry Abulkhanov
Sergey I. Nikolenko
Valentin Malykh
29
3
0
19 May 2023
Searching by Code: a New SearchBySnippet Dataset and SnippeR Retrieval Model for Searching by Code Snippets
I. Sedykh
Dmitry Abulkhanov
Nikita Sorokin
Sergey I. Nikolenko
Valentin Malykh
21
1
0
19 May 2023
Towards Code Generation from BDD Test Case Specifications: A Vision
Leon Chemnitz
David Reichenbach
Hani Aldebes
Mariam Naveed
Krishna Narasimhan
Mira Mezini
28
3
0
19 May 2023
ProgSG: Cross-Modality Representation Learning for Programs in Electronic Design Automation
Yunsheng Bai
Atefeh Sohrabizadeh
Zongyue Qin
Ziniu Hu
Yizhou Sun
Jason Cong
26
1
0
18 May 2023
Think Outside the Code: Brainstorming Boosts Large Language Models in Code Generation
Xinyu Li
Jiang-Tian Xue
Zheng Xie
Ming Li
LRM
19
26
0
18 May 2023
The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation
Dũng Nguyễn Mạnh
Nam Le Hai
An Dau
A. Nguyen
Khanh N. Nghiem
Jingnan Guo
Nghi D. Q. Bui
34
15
0
09 May 2023
The EarlyBIRD Catches the Bug: On Exploiting Early Layers of Encoder Models for More Efficient Code Classification
Anastasiia Grishina
Max Hort
Leon Moonen
22
6
0
08 May 2023
TASTY: A Transformer based Approach to Space and Time complexity
K. Moudgalya
Ankit Ramakrishnan
Vamsikrishna Chemudupati
Xinghai Lu
16
3
0
06 May 2023
Redundancy and Concept Analysis for Code-trained Language Models
Arushi Sharma
Zefu Hu
Christopher Quinn
Ali Jannesari
75
1
0
01 May 2023
Stochastic Code Generation
Swapnil Sharma
Nikita Anand
V. KranthiKiranG.
SyDa
30
0
0
14 Apr 2023
Towards Efficient Fine-tuning of Pre-trained Code Models: An Experimental Study and Beyond
Ensheng Shi
Yanlin Wang
Hongyu Zhang
Lun Du
Shi Han
Dongmei Zhang
Hongbin Sun
36
42
0
11 Apr 2023
"It's Weird That it Knows What I Want": Usability and Interactions with Copilot for Novice Programmers
James Prather
B. Reeves
Paul Denny
Brett A. Becker
Juho Leinonen
Andrew Luxton-Reilly
Garrett B. Powell
James Finnie-Ansley
E. Santos
40
131
0
05 Apr 2023
CodeGeeX: A Pre-Trained Model for Code Generation with Multilingual Benchmarking on HumanEval-X
Qinkai Zheng
Xiao Xia
Xu Zou
Yuxiao Dong
Shanshan Wang
...
Andi Wang
Yang Li
Teng Su
Zhilin Yang
Jie Tang
ELM
ALM
SyDa
69
317
0
30 Mar 2023
Neuro-Symbolic Execution of Generic Source Code
Yaojie Hu
Jin Tian
NAI
30
0
0
23 Mar 2023
JaCoText: A Pretrained Model for Java Code-Text Generation
Jessica Nayeli López Espejel
Mahaman Sanoussi Yahaya Alassan
Walid Dahhane
E. Ettifouri
37
3
0
22 Mar 2023
Implant Global and Local Hierarchy Information to Sequence based Code Representation Models
Kechi Zhang
Zhuo Li
Zhi Jin
Ge Li
29
7
0
14 Mar 2023
xASTNN: Improved Code Representations for Industrial Practice
Zhiwei Xu
Min Zhou
Xibin Zhao
Yang Chen
Xi Cheng
Hongyu Zhang
AI4TS
29
5
0
13 Mar 2023
Greener yet Powerful: Taming Large Code Generation Models with Quantization
Xiaokai Wei
Sujan Kumar Gonugondla
W. Ahmad
Shiqi Wang
Baishakhi Ray
...
Ben Athiwaratkun
Mingyue Shang
M. K. Ramanathan
Parminder Bhatia
Bing Xiang
MQ
30
6
0
09 Mar 2023
A Study of Variable-Role-based Feature Enrichment in Neural Models of Code
Aftab Hussain
Md Rafiqul Islam Rabin
Bowen Xu
David Lo
Mohammad Amin Alipour
42
3
0
08 Mar 2023
A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT
Yihan Cao
Siyu Li
Yixin Liu
Zhiling Yan
Yutong Dai
Philip S. Yu
Lichao Sun
38
508
0
07 Mar 2023
ADELT: Transpilation Between Deep Learning Frameworks
Linyuan Gong
Jiayi Wang
Alvin Cheung
32
3
0
07 Mar 2023
HugNLP: A Unified and Comprehensive Library for Natural Language Processing
Jingbo Wang
Nuo Chen
Qiushi Sun
Wenkang Huang
Chengyu Wang
Ming Gao
27
3
0
28 Feb 2023
Bayesian Networks for Named Entity Prediction in Programming Community Question Answering
Alexey Gorbatovski
Sergey Kovalchuk
19
2
0
26 Feb 2023
On ML-Based Program Translation: Perils and Promises
Aniketh Malyala
K. Zhou
Baishakhi Ray
Saikat Chakraborty
29
5
0
21 Feb 2023
COMET: Neural Cost Model Explanation Framework
Isha Chaudhary
Alex Renda
Charith Mendis
Gagandeep Singh
21
2
0
14 Feb 2023
CodeBERTScore: Evaluating Code Generation with Pretrained Models of Code
Shuyan Zhou
Uri Alon
Sumit Agarwal
Graham Neubig
ELM
ALM
40
99
0
10 Feb 2023
Zero-Shot Learning for Requirements Classification: An Exploratory Study
Waad Alhoshan
Alessio Ferrari
Liping Zhao
VLM
9
39
0
09 Feb 2023
CrossCodeBench: Benchmarking Cross-Task Generalization of Source Code Models
Changan Niu
Chuanyi Li
Vincent Ng
Bin Luo
ELM
ALM
34
9
0
08 Feb 2023
Previous
1
2
3
4
5
6
7
Next