Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2109.00859
Cited By
CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for Code Understanding and Generation
2 September 2021
Yue Wang
Weishi Wang
Shafiq Joty
Guosheng Lin
Re-assign community
ArXiv (abs)
PDF
HTML
Github (2999★)
Papers citing
"CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for Code Understanding and Generation"
50 / 638 papers shown
Title
Large Language Models for Cyber Security: A Systematic Literature Review
HanXiang Xu
Shenao Wang
Ningke Li
Kaidi Wang
Yanjie Zhao
Kai Chen
Ting Yu
Yang Liu
Haoyu Wang
117
43
0
08 May 2024
Refining Joint Text and Source Code Embeddings for Retrieval Task with Parameter-Efficient Fine-Tuning
Karim Galliamov
Leila Khaertdinova
Karina Denisova
74
1
0
07 May 2024
Automatic Programming: Large Language Models and Beyond
Michael R. Lyu
Baishakhi Ray
Abhik Roychoudhury
Shin Hwei Tan
Patanamon Thongtanunam
94
21
0
03 May 2024
Calibration of Large Language Models on Code Summarization
Yuvraj Virk
Prem Devanbu
Toufique Ahmed
99
11
0
30 Apr 2024
On the Limitations of Embedding Based Methods for Measuring Functional Correctness for Code Generation
Atharva Naik
92
3
0
26 Apr 2024
Continual Learning of Large Language Models: A Comprehensive Survey
Haizhou Shi
Zihao Xu
Hengyi Wang
Weiyi Qin
Wenyuan Wang
Yibin Wang
Zifeng Wang
Sayna Ebrahimi
Hao Wang
CLL
KELM
LRM
154
88
0
25 Apr 2024
AI Coders Are Among Us: Rethinking Programming Language Grammar Towards Efficient Code Generation
Zhensu Sun
Xiaoning Du
Zhou Yang
Li Li
David Lo
97
9
0
25 Apr 2024
VulEval: Towards Repository-Level Evaluation of Software Vulnerability Detection
Xinjie Wen
Xinchen Wang
Yujia Chen
Ruida Hu
David Lo
Cuiyun Gao
106
8
0
24 Apr 2024
Is Next Token Prediction Sufficient for GPT? Exploration on Code Logic Comprehension
Mengnan Qi
Yufan Huang
Yongqiang Yao
Maoquan Wang
Bin Gu
Neel Sundaresan
82
4
0
13 Apr 2024
Structure-aware Fine-tuning for Code Pre-trained Models
Jiayi Wu
Renyu Zhu
Nuo Chen
Qiushi Sun
Xiang Li
Ming Gao
88
2
0
11 Apr 2024
Analyzing the Performance of Large Language Models on Code Summarization
Rajarshi Haldar
Julia Hockenmaier
73
18
0
10 Apr 2024
Open-Source AI-based SE Tools: Opportunities and Challenges of Collaborative Software Learning
Zhihao Lin
Wei Ma
Tao Lin
Yaowen Zheng
Jingquan Ge
Jun Wang
Jacques Klein
Tegawende F. Bissyande
Yang Liu
Li Li
VLM
69
5
0
09 Apr 2024
CSA-Trans: Code Structure Aware Transformer for AST
Saeyoon Oh
Shin Yoo
95
1
0
07 Apr 2024
AI for DevSecOps: A Landscape and Future Opportunities
Michael Fu
Jirat Pasuksmit
Chakkrit Tantithamthavorn
79
7
0
07 Apr 2024
The Case for Developing a Foundation Model for Planning-like Tasks from Scratch
Biplav Srivastava
Vishal Pallagani
LRM
64
2
0
06 Apr 2024
CodeEditorBench: Evaluating Code Editing Capability of Large Language Models
Jiawei Guo
Ziming Li
Xueling Liu
Kaijing Ma
Tianyu Zheng
...
Xingwei Qu
Xiang Yue
Ge Zhang
Wenhu Chen
Jie Fu
KELM
155
14
0
04 Apr 2024
CSEPrompts: A Benchmark of Introductory Computer Science Prompts
Md. Nishat Raihan
Dhiman Goswami
Sadiya Sayara Chowdhury Puspo
Christian D. Newman
Tharindu Ranasinghe
Marcos Zampieri
ELM
55
2
0
03 Apr 2024
An Empirical Study of Automated Vulnerability Localization with Large Language Models
Jian Zhang
Chong Wang
Anran Li
Weisong Sun
Cen Zhang
Wei Ma
Yang Liu
95
20
0
30 Mar 2024
A Survey of using Large Language Models for Generating Infrastructure as Code
Kalahasti Ganesh Srivatsa
Sabyasachi Mukhopadhyay
Ganesh Katrapati
Manish Shrivastava
45
3
0
30 Mar 2024
Top Leaderboard Ranking = Top Coding Proficiency, Always? EvoEval: Evolving Coding Benchmarks via LLM
Chun Xia
Yinlin Deng
Lingming Zhang
ALM
ELM
80
39
0
28 Mar 2024
SCALE: Constructing Structured Natural Language Comment Trees for Software Vulnerability Detection
Xinjie Wen
Cuiyun Gao
Shuzheng Gao
Yang Xiao
Michael R. Lyu
106
9
0
28 Mar 2024
Vulnerability Detection with Code Language Models: How Far Are We?
Yangruibo Ding
Yanjun Fu
Omniyyah Ibrahim
Chawin Sitawarin
Xinyun Chen
Basel Alomair
David Wagner
Baishakhi Ray
Yizheng Chen
AAML
82
57
0
27 Mar 2024
FoC: Figure out the Cryptographic Functions in Stripped Binaries with LLMs
Guoqiang Chen
Xiuwei Shang
Shaoyin Cheng
Yanming Zhang
Weiming Zhang
Neng H. Yu
N. Yu
163
2
0
27 Mar 2024
MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue Resolution
Wei Tao
Yucheng Zhou
Yanlin Wang
Wenqiang Zhang
Hongyu Zhang
Yu Cheng
LLMAG
107
47
0
26 Mar 2024
MESIA: Understanding and Leveraging Supplementary Nature of Method-level Comments for Automatic Comment Generation
Xinglu Pan
Chenxiao Liu
Yanzhen Zou
Tao Xie
Bing Xie
88
3
0
26 Mar 2024
Exploring the Impact of the Output Format on the Evaluation of Large Language Models for Code Translation
Marcos Macedo
Yuan Tian
F. Côgo
Bram Adams
75
17
0
25 Mar 2024
CYGENT: A cybersecurity conversational agent with log summarization powered by GPT-3
Prasasthy Balasubramanian
Justin Seby
Panos Kostakos
LLMAG
33
4
0
25 Mar 2024
Iterative Refinement of Project-Level Code Context for Precise Code Generation with Compiler Feedback
Zhangqian Bi
Yao Wan
Zheng Wang
Hongyu Zhang
Batu Guan
Fangxin Lu
Zili Zhang
Yulei Sui
Hai Jin
Xuanhua Shi
54
15
0
25 Mar 2024
ProCQA: A Large-scale Community-based Programming Question Answering Dataset for Code Search
Zehan Li
Jianfei Zhang
Chuantao Yin
Y. Ouyang
Wenge Rong
59
4
0
25 Mar 2024
CodeS: Natural Language to Code Repository via Multi-Layer Sketch
Daoguang Zan
Ailun Yu
Wei Liu
Dong Chen
Bo Shen
...
Bei Guan
Zhiguang Yang
Yongji Wang
Qianxiang Wang
Li-zhen Cui
96
16
0
25 Mar 2024
ChatGPT Incorrectness Detection in Software Reviews
M. Tanzil
Junaed Younus Khan
Gias Uddin
84
4
0
25 Mar 2024
Semantically Aligned Question and Code Generation for Automated Insight Generation
Ananya Singha
Bhavya Chopra
Anirudh Khatry
Sumit Gulwani
Austin Z. Henley
Vu Le
Chris Parnin
Mukul Singh
Microsoft Belgium
121
3
0
21 Mar 2024
Genetic Auto-prompt Learning for Pre-trained Code Intelligence Language Models
Chengzhe Feng
Yanan Sun
Ke Li
Pan Zhou
Jiancheng Lv
Aojun Lu
96
1
0
20 Mar 2024
On the effectiveness of Large Language Models for GitHub Workflows
Xinyu Zhang
Siddharth Muralee
Sourag Cherupattamoolayil
Aravind Machiry
111
3
0
19 Mar 2024
Teaching Machines to Code: Smart Contract Translation with LLMs
Rabimba Karanjai
Lei Xu
Weidong Shi
69
6
0
13 Mar 2024
Mastering Text, Code and Math Simultaneously via Fusing Highly Specialized Language Models
Ning Ding
Yulin Chen
Ganqu Cui
Xingtai Lv
Weilin Zhao
Ruobing Xie
Bowen Zhou
Zhiyuan Liu
Maosong Sun
ALM
MoMe
AI4CE
146
7
0
13 Mar 2024
LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code
Naman Jain
King Han
Alex Gu
Wen-Ding Li
Fanjia Yan
Tianjun Zhang
Sida I. Wang
Armando Solar-Lezama
Koushik Sen
Ion Stoica
ELM
148
448
0
12 Mar 2024
RepoHyper: Better Context Retrieval Is All You Need for Repository-Level Code Completion
Huy N. Phan
Hoang N. Phan
Tien N. Nguyen
Nghi D. Q. Bui
85
12
0
10 Mar 2024
LEGION: Harnessing Pre-trained Language Models for GitHub Topic Recommendations with Distribution-Balance Loss
Yen-Trang Dang
Thanh Le-Cong
Phuc-Thanh Nguyen
Anh M. T. Bui
Phuong T. Nguyen
Bach Le
Quyet-Thang Huynh
67
0
0
09 Mar 2024
Evaluation of LLMs on Syntax-Aware Code Fill-in-the-Middle Tasks
Linyuan Gong
Sida Wang
Mostafa Elhoushi
Alvin Cheung
112
17
0
07 Mar 2024
CatCode: A Comprehensive Evaluation Framework for LLMs On the Mixture of Code and Text
Zhenru Lin
Yiqun Yao
Yang Yuan
ELM
33
0
0
04 Mar 2024
Semi-Instruct: Bridging Natural-Instruct and Self-Instruct for Code Large Language Models
Xianzhen Luo
Qingfu Zhu
Zhiming Zhang
Xu Wang
Qing Yang
Dongliang Xu
Wanxiang Che
ALM
71
2
0
01 Mar 2024
Retrieval-Augmented Generation for AI-Generated Content: A Survey
Penghao Zhao
Hailin Zhang
Qinhan Yu
Zhengren Wang
Yunteng Geng
Fangcheng Fu
Ling Yang
Wentao Zhang
Jie Jiang
Tengjiao Wang
3DV
288
286
0
29 Feb 2024
Exploring the Potential of Large Language Models for Improving Digital Forensic Investigation Efficiency
Akila Wickramasekara
Frank Breitinger
Mark Scanlon
144
10
0
29 Feb 2024
Chain-of-Thought Prompting of Large Language Models for Discovering and Fixing Software Vulnerabilities
Yu Nong
Mohammed Aldeen
Long Cheng
Hongxin Hu
Feng Chen
Haipeng Cai
LRM
72
32
0
27 Feb 2024
Beyond Self-learned Attention: Mitigating Attention Bias in Transformer-based Models Using Attention Guidance
Jiri Gesi
Iftekhar Ahmed
74
0
0
26 Feb 2024
CLAP: Learning Transferable Binary Code Representations with Natural Language Supervision
Hao Wang
Zeyu Gao
Chao Zhang
Zihan Sha
Mingyang Sun
Yuchen Zhou
Wenyu Zhu
Wenju Sun
Han Qiu
Xiangwei Xiao
85
22
0
26 Feb 2024
RoCoIns: Enhancing Robustness of Large Language Models through Code-Style Instructions
Yuan Zhang
Xiao Wang
Zhiheng Xi
Han Xia
Tao Gui
Qi Zhang
Xuanjing Huang
88
4
0
26 Feb 2024
Language Models for Code Completion: A Practical Evaluation
Maliheh Izadi
Jonathan Katzy
Tim van Dam
Marc Otten
R. Popescu
Arie van Deursen
ALM
ELM
81
36
0
25 Feb 2024
Code Needs Comments: Enhancing Code LLMs with Comment Augmentation
Demin Song
Honglin Guo
Yunhua Zhou
Shuhao Xing
Yudong Wang
...
Wenwei Zhang
Qipeng Guo
Hang Yan
Xipeng Qiu
Dahua Lin
SyDa
86
9
0
20 Feb 2024
Previous
1
2
3
4
5
6
...
11
12
13
Next