Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2109.00859
Cited By
CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for Code Understanding and Generation
2 September 2021
Yue Wang
Weishi Wang
Shafiq R. Joty
S. Hoi
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for Code Understanding and Generation"
50 / 610 papers shown
Title
UniCoder: Scaling Code Large Language Model via Universal Code
Tao Sun
Linzheng Chai
Jian Yang
Yuwei Yin
Hongcheng Guo
Jiaheng Liu
Bing Wang
Liqun Yang
Zhoujun Li
OffRL
LRM
63
16
0
24 Jun 2024
Can We Trust Large Language Models Generated Code? A Framework for In-Context Learning, Security Patterns, and Code Evaluations Across Diverse LLMs
Ahmad Mohsin
Helge Janicke
Adrian Wood
Iqbal H. Sarker
Leandros A. Maglaras
N. Janjua
33
8
0
18 Jun 2024
RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content
Joao Monteiro
Pierre-Andre Noel
Étienne Marcotte
Sai Rajeswar
Valentina Zantedeschi
David Vazquez
Nicolas Chapados
Christopher Pal
Perouz Taslakian
39
4
0
17 Jun 2024
A Critical Study of What Code-LLMs (Do Not) Learn
Abhinav Anand
Shweta Verma
Krishna Narasimhan
Mira Mezini
40
4
0
17 Jun 2024
AgileCoder: Dynamic Collaborative Agents for Software Development based on Agile Methodology
Minh Huynh Nguyen
Thang Phan Chau
Phong X. Nguyen
Nghi D. Q. Bui
26
11
0
16 Jun 2024
Out of style: Misadventures with LLMs and code style transfer
Karl Munson
Chih-Kai Ting
Serenity Wade
Anish Savla
Julian T Dolby
Kiran Kate
Kavitha Srinivas
18
0
0
14 Jun 2024
Unlock the Correlation between Supervised Fine-Tuning and Reinforcement Learning in Training Code Large Language Models
Jie Chen
Xintian Han
Yu Ma
Xun Zhou
Liang Xiang
ALM
LRM
34
2
0
14 Jun 2024
Cross-Modality Program Representation Learning for Electronic Design Automation with High-Level Synthesis
Zongyue Qin
Yunsheng Bai
Atefeh Sohrabizadeh
Zijian Ding
Ziniu Hu
Yizhou Sun
Jason Cong
28
1
0
13 Jun 2024
Leveraging Large Language Models for Efficient Failure Analysis in Game Development
Leonardo Marini
Linus Gisslén
Alessandro Sestini
43
0
0
11 Jun 2024
An LLM-Assisted Easy-to-Trigger Backdoor Attack on Code Completion Models: Injecting Disguised Vulnerabilities against Strong Detection
Shenao Yan
Shen Wang
Yue Duan
Hanbin Hong
Kiho Lee
Doowon Kim
Yuan Hong
AAML
SILM
38
16
0
10 Jun 2024
Security Vulnerability Detection with Multitask Self-Instructed Fine-Tuning of Large Language Models
Aidan Z. H. Yang
Haoye Tian
He Ye
Ruben Martins
Claire Le Goues
34
5
0
09 Jun 2024
Enhancing Repository-Level Code Generation with Integrated Contextual Information
Zhiyuan Pan
Xing Hu
Xin Xia
Xiaohu Yang
26
3
0
05 Jun 2024
R2C2-Coder: Enhancing and Benchmarking Real-world Repository-level Code Completion Abilities of Code Large Language Models
Ken Deng
Jiaheng Liu
He Zhu
Congnan Liu
Jingxin Li
...
Yuanxing Zhang
Wenbo Su
Bangyu Xiang
Tiezheng Ge
Bo Zheng
42
2
0
03 Jun 2024
A Survey on Large Language Models for Code Generation
Juyong Jiang
Fan Wang
Jiasi Shen
Sungju Kim
Sunghun Kim
42
159
0
01 Jun 2024
Confidence-Aware Sub-Structure Beam Search (CABS): Mitigating Hallucination in Structured Data Generation with Large Language Models
Chengwei Wei
Kee Kiat Koo
Amir Tavanaei
Karim Bouyarmane
27
1
0
30 May 2024
Divide-and-Conquer Meets Consensus: Unleashing the Power of Functions in Code Generation
Jingchang Chen
Hongxuan Tang
Zheng Chu
Qianglong Chen
Zekun Wang
Ming Liu
Bing Qin
47
4
0
30 May 2024
GenKubeSec: LLM-Based Kubernetes Misconfiguration Detection, Localization, Reasoning, and Remediation
Ehud Malul
Yair Meidan
D. Mimran
Yuval Elovici
A. Shabtai
30
4
0
30 May 2024
Dataflow-Guided Retrieval Augmentation for Repository-Level Code Completion
Wei Cheng
Yuhan Wu
Wei Hu
30
11
0
30 May 2024
Source Code Foundation Models are Transferable Binary Analysis Knowledge Bases
Zian Su
Xiangzhe Xu
Ziyang Huang
Kaiyuan Zhang
Xiangyu Zhang
32
5
0
30 May 2024
ReflectionCoder: Learning from Reflection Sequence for Enhanced One-off Code Generation
Houxing Ren
Mingjie Zhan
Zhongyuan Wu
Aojun Zhou
Junting Pan
Hongsheng Li
SyDa
34
7
0
27 May 2024
Uncovering LLM-Generated Code: A Zero-Shot Synthetic Code Detector via Code Rewriting
Tong Ye
Yangkai Du
Tengfei Ma
Lingfei Wu
Xuhong Zhang
Shouling Ji
Wenhai Wang
DeLMO
35
6
0
25 May 2024
Large Language Models Meet NLP: A Survey
Libo Qin
Qiguang Chen
Xiachong Feng
Yang Wu
Yongheng Zhang
Yinghui Li
Min Li
Wanxiang Che
Philip S. Yu
ALM
LM&MA
ELM
LRM
40
47
0
21 May 2024
MHPP: Exploring the Capabilities and Limitations of Language Models Beyond Basic Code Generation
Jianbo Dai
Jianqiao Lu
Yunlong Feng
Rongju Ruan
Ming Cheng
Haochen Tan
Zhijiang Guo
ELM
LRM
36
12
0
19 May 2024
MapCoder: Multi-Agent Code Generation for Competitive Problem Solving
Md. Ashraful Islam
Mohammed Eunus Ali
Md. Rizwan Parvez
SyDa
26
48
0
18 May 2024
IntelliExplain: Enhancing Interactive Code Generation through Natural Language Explanations for Non-Professional Programmers
Hao Yan
Thomas D. Latoza
Ziyu Yao
LRM
43
0
0
16 May 2024
IGOT: Information Gain Optimized Tokenizer on Domain Adaptive Pretraining
Dawei Feng
Yihai Zhang
Zhixuan Xu
SyDa
30
0
0
16 May 2024
Large Language Models for Cyber Security: A Systematic Literature Review
HanXiang Xu
Shenao Wang
Ningke Li
K. Wang
Yanjie Zhao
Kai Chen
Ting Yu
Yang Janet Liu
H. Wang
31
23
0
08 May 2024
Refining Joint Text and Source Code Embeddings for Retrieval Task with Parameter-Efficient Fine-Tuning
Karim Galliamov
Leila Khaertdinova
Karina Denisova
38
1
0
07 May 2024
Automatic Programming: Large Language Models and Beyond
Michael R. Lyu
Baishakhi Ray
Abhik Roychoudhury
Shin Hwei Tan
Patanamon Thongtanunam
33
15
0
03 May 2024
Enhancing Trust in LLM-Generated Code Summaries with Calibrated Confidence Scores
Yuvraj Virk
Prem Devanbu
Toufique Ahmed
54
10
0
30 Apr 2024
On the Limitations of Embedding Based Methods for Measuring Functional Correctness for Code Generation
Atharva Naik
38
2
0
26 Apr 2024
Continual Learning of Large Language Models: A Comprehensive Survey
Haizhou Shi
Zihao Xu
Hengyi Wang
Weiyi Qin
Wenyuan Wang
Yibin Wang
Zifeng Wang
Sayna Ebrahimi
Hao Wang
CLL
KELM
LRM
39
63
0
25 Apr 2024
AI Coders Are Among Us: Rethinking Programming Language Grammar Towards Efficient Code Generation
Zhensu Sun
Xiaoning Du
Zhou Yang
Li Li
David Lo
28
10
0
25 Apr 2024
VulEval: Towards Repository-Level Evaluation of Software Vulnerability Detection
Xinjie Wen
Xinchen Wang
Yujia Chen
Ruida Hu
David Lo
Cuiyun Gao
27
6
0
24 Apr 2024
Is Next Token Prediction Sufficient for GPT? Exploration on Code Logic Comprehension
Mengnan Qi
Yufan Huang
Yongqiang Yao
Maoquan Wang
Bin Gu
Neel Sundaresan
41
2
0
13 Apr 2024
Structure-aware Fine-tuning for Code Pre-trained Models
Jiayi Wu
Renyu Zhu
Nuo Chen
Qiushi Sun
Xiang Li
Ming Gao
35
2
0
11 Apr 2024
Analyzing the Performance of Large Language Models on Code Summarization
Rajarshi Haldar
J. Hockenmaier
38
16
0
10 Apr 2024
Open-Source AI-based SE Tools: Opportunities and Challenges of Collaborative Software Learning
Zhihao Lin
Wei Ma
Tao Lin
Yaowen Zheng
Jingquan Ge
Jun Wang
Jacques Klein
Tegawende F. Bissyande
Yang Liu
Li Li
VLM
30
4
0
09 Apr 2024
CSA-Trans: Code Structure Aware Transformer for AST
Saeyoon Oh
Shin Yoo
36
1
0
07 Apr 2024
AI for DevSecOps: A Landscape and Future Opportunities
Michael Fu
Jirat Pasuksmit
C. Tantithamthavorn
33
6
0
07 Apr 2024
The Case for Developing a Foundation Model for Planning-like Tasks from Scratch
Biplav Srivastava
Vishal Pallagani
LRM
37
2
0
06 Apr 2024
CodeEditorBench: Evaluating Code Editing Capability of Large Language Models
Jiawei Guo
Ziming Li
Xueling Liu
Kaijing Ma
Tianyu Zheng
...
Xingwei Qu
Xiang Yue
Ge Zhang
Wenhu Chen
Jie Fu
KELM
57
12
0
04 Apr 2024
CSEPrompts: A Benchmark of Introductory Computer Science Prompts
Md. Nishat Raihan
Dhiman Goswami
Sadiya Sayara Chowdhury Puspo
Christian D. Newman
Tharindu Ranasinghe
Marcos Zampieri
ELM
36
2
0
03 Apr 2024
An Empirical Study of Automated Vulnerability Localization with Large Language Models
Jian Zhang
Chong Wang
Anran Li
Weisong Sun
Cen Zhang
Wei Ma
Yang Liu
39
17
0
30 Mar 2024
A Survey of using Large Language Models for Generating Infrastructure as Code
Kalahasti Ganesh Srivatsa
Sabyasachi Mukhopadhyay
Ganesh Katrapati
Manish Shrivastava
33
1
0
30 Mar 2024
Top Leaderboard Ranking = Top Coding Proficiency, Always? EvoEval: Evolving Coding Benchmarks via LLM
Chun Xia
Yinlin Deng
Lingming Zhang
ALM
ELM
30
26
0
28 Mar 2024
SCALE: Constructing Structured Natural Language Comment Trees for Software Vulnerability Detection
Xinjie Wen
Cuiyun Gao
Shuzheng Gao
Yang Xiao
Michael R. Lyu
22
5
0
28 Mar 2024
Vulnerability Detection with Code Language Models: How Far Are We?
Yangruibo Ding
Yanjun Fu
Omniyyah Ibrahim
Chawin Sitawarin
Xinyun Chen
Basel Alomair
David A. Wagner
Baishakhi Ray
Yizheng Chen
AAML
41
45
0
27 Mar 2024
FoC: Figure out the Cryptographic Functions in Stripped Binaries with LLMs
Guoqiang Chen
Xiuwei Shang
Shaoyin Cheng
Yanming Zhang
Weiming Zhang
Neng H. Yu
N. Yu
92
2
0
27 Mar 2024
MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue Resolution
Wei Tao
Yucheng Zhou
Yanlin Wang
Wenqiang Zhang
Hongyu Zhang
Yu-Xi Cheng
LLMAG
52
36
0
26 Mar 2024
Previous
1
2
3
4
5
...
11
12
13
Next