Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2109.00859
Cited By
CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for Code Understanding and Generation
2 September 2021
Yue Wang
Weishi Wang
Shafiq R. Joty
S. Hoi
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for Code Understanding and Generation"
50 / 610 papers shown
Title
MESIA: Understanding and Leveraging Supplementary Nature of Method-level Comments for Automatic Comment Generation
Xinglu Pan
Chenxiao Liu
Yanzhen Zou
Tao Xie
Bing Xie
27
2
0
26 Mar 2024
Exploring the Impact of the Output Format on the Evaluation of Large Language Models for Code Translation
Marcos Macedo
Yuan Tian
F. Côgo
Bram Adams
46
12
0
25 Mar 2024
CYGENT: A cybersecurity conversational agent with log summarization powered by GPT-3
Prasasthy Balasubramanian
Justin Seby
Panos Kostakos
LLMAG
19
3
0
25 Mar 2024
Iterative Refinement of Project-Level Code Context for Precise Code Generation with Compiler Feedback
Zhangqian Bi
Yao Wan
Zheng Wang
Hongyu Zhang
Batu Guan
Fangxin Lu
Zili Zhang
Yulei Sui
Hai Jin
Xuanhua Shi
29
13
0
25 Mar 2024
ProCQA: A Large-scale Community-based Programming Question Answering Dataset for Code Search
Zehan Li
Jianfei Zhang
Chuantao Yin
Y. Ouyang
Wenge Rong
34
3
0
25 Mar 2024
CodeS: Natural Language to Code Repository via Multi-Layer Sketch
Daoguang Zan
Ailun Yu
Wei Liu
Dong Chen
Bo Shen
...
Bei Guan
Zhiguang Yang
Yongji Wang
Qianxiang Wang
Li-zhen Cui
33
14
0
25 Mar 2024
ChatGPT Incorrectness Detection in Software Reviews
M. Tanzil
Junaed Younus Khan
Gias Uddin
19
4
0
25 Mar 2024
Semantically Aligned Question and Code Generation for Automated Insight Generation
Ananya Singha
Bhavya Chopra
Anirudh Khatry
Sumit Gulwani
Austin Z. Henley
Vu Le
Chris Parnin
Mukul Singh
Microsoft Belgium
39
3
0
21 Mar 2024
Genetic Auto-prompt Learning for Pre-trained Code Intelligence Language Models
Chengzhe Feng
Yanan Sun
Ke Li
Pan Zhou
Jiancheng Lv
Aojun Lu
51
1
0
20 Mar 2024
On the effectiveness of Large Language Models for GitHub Workflows
Xinyu Zhang
Siddharth Muralee
Sourag Cherupattamoolayil
Aravind Machiry
33
2
0
19 Mar 2024
Mastering Text, Code and Math Simultaneously via Fusing Highly Specialized Language Models
Ning Ding
Yulin Chen
Ganqu Cui
Xingtai Lv
Weilin Zhao
Ruobing Xie
Bowen Zhou
Zhiyuan Liu
Maosong Sun
ALM
MoMe
AI4CE
38
7
0
13 Mar 2024
LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code
Naman Jain
King Han
Alex Gu
Wen-Ding Li
Fanjia Yan
Tianjun Zhang
Sida I. Wang
Armando Solar-Lezama
Koushik Sen
Ion Stoica
ELM
36
274
0
12 Mar 2024
RepoHyper: Better Context Retrieval Is All You Need for Repository-Level Code Completion
Huy N. Phan
Hoang N. Phan
Tien N. Nguyen
Nghi D. Q. Bui
38
3
0
10 Mar 2024
LEGION: Harnessing Pre-trained Language Models for GitHub Topic Recommendations with Distribution-Balance Loss
Yen-Trang Dang
Thanh Le-Cong
Phuc-Thanh Nguyen
Anh M. T. Bui
Phuong T. Nguyen
Bach Le
Quyet-Thang Huynh
32
0
0
09 Mar 2024
Evaluation of LLMs on Syntax-Aware Code Fill-in-the-Middle Tasks
Linyuan Gong
Sida Wang
Mostafa Elhoushi
Alvin Cheung
27
15
0
07 Mar 2024
CatCode: A Comprehensive Evaluation Framework for LLMs On the Mixture of Code and Text
Zhenru Lin
Yiqun Yao
Yang Yuan
ELM
23
0
0
04 Mar 2024
Semi-Instruct: Bridging Natural-Instruct and Self-Instruct for Code Large Language Models
Xianzhen Luo
Qingfu Zhu
Zhiming Zhang
Xu Wang
Qing Yang
Dongliang Xu
Wanxiang Che
ALM
32
2
0
01 Mar 2024
Retrieval-Augmented Generation for AI-Generated Content: A Survey
Penghao Zhao
Hailin Zhang
Qinhan Yu
Zhengren Wang
Yunteng Geng
Fangcheng Fu
Ling Yang
Wentao Zhang
Jie Jiang
Bin Cui
3DV
115
228
0
29 Feb 2024
Exploring the Potential of Large Language Models for Improving Digital Forensic Investigation Efficiency
Akila Wickramasekara
F. Breitinger
Mark Scanlon
52
8
0
29 Feb 2024
Chain-of-Thought Prompting of Large Language Models for Discovering and Fixing Software Vulnerabilities
Yu Nong
Mohammed Aldeen
Long Cheng
Hongxin Hu
Feng Chen
Haipeng Cai
LRM
26
27
0
27 Feb 2024
Beyond Self-learned Attention: Mitigating Attention Bias in Transformer-based Models Using Attention Guidance
Jiri Gesi
Iftekhar Ahmed
54
0
0
26 Feb 2024
RoCoIns: Enhancing Robustness of Large Language Models through Code-Style Instructions
Yuan Zhang
Xiao Wang
Zhiheng Xi
Han Xia
Tao Gui
Qi Zhang
Xuanjing Huang
36
3
0
26 Feb 2024
Language Models for Code Completion: A Practical Evaluation
M. Izadi
J. Katzy
Tim van Dam
Marc Otten
R. Popescu
A. van Deursen
ALM
ELM
39
22
0
25 Feb 2024
Code Needs Comments: Enhancing Code LLMs with Comment Augmentation
Demin Song
Honglin Guo
Yunhua Zhou
Shuhao Xing
Yudong Wang
...
Wenwei Zhang
Qipeng Guo
Hang Yan
Xipeng Qiu
Dahua Lin
SyDa
65
8
0
20 Feb 2024
DeepCode AI Fix: Fixing Security Vulnerabilities with Large Language Models
Berkay Berabi
Alexey Gronskiy
Veselin Raychev
Gishor Sivanrupan
Victor Chibotaru
Martin Vechev
KELM
25
8
0
19 Feb 2024
CovRL: Fuzzing JavaScript Engines with Coverage-Guided Reinforcement Learning for LLM-based Mutation
Jueon Eom
Seyeon Jeong
Taekyoung Kwon
29
7
0
19 Feb 2024
CodeArt: Better Code Models by Attention Regularization When Symbols Are Lacking
Zian Su
Xiangzhe Xu
Ziyang Huang
Zhuo Zhang
Yapeng Ye
Jianjun Huang
Xiangyu Zhang
OffRL
37
8
0
19 Feb 2024
DolphCoder: Echo-Locating Code Large Language Models with Diverse and Multi-Objective Instruction Tuning
Yejie Wang
Keqing He
Guanting Dong
Pei Wang
Weihao Zeng
...
Yutao Mou
Mengdi Zhang
Jingang Wang
Xunliang Cai
Weiran Xu
ALM
26
9
0
14 Feb 2024
Resilient Watermarking for LLM-Generated Codes
Boquan Li
Mengdi Zhang
Peixin Zhang
Jun Sun
Xingmei Wang
Zijian Liu
Tianzi Zhang
WaLM
33
3
0
12 Feb 2024
On the Effectiveness of Machine Learning-based Call Graph Pruning: An Empirical Study
A. Mir
Mehdi Keshani
Sebastian Proksch
15
1
0
11 Feb 2024
Text-to-Code Generation with Modality-relative Pre-training
Fenia Christopoulou
Guchun Zhang
Gerasimos Lampouras
AI4TS
18
1
0
08 Feb 2024
Studying Vulnerable Code Entities in R
ZiXiao Zhao
Millon Madhur Das
Fatemeh H. Fard
AAML
54
0
0
06 Feb 2024
Make Every Move Count: LLM-based High-Quality RTL Code Generation Using MCTS
Matthew DeLorenzo
A. B. Chowdhury
Vasudev Gohil
Shailja Thakur
Ramesh Karri
Siddharth Garg
Jeyavijayan Rajendran
26
31
0
05 Feb 2024
UniTSyn: A Large-Scale Dataset Capable of Enhancing the Prowess of Large Language Models for Program Testing
Yifeng He
Jiabo Huang
Yuyang Rong
Yiwen Guo
Ethan Wang
Hao Chen
26
4
0
04 Feb 2024
CompeteSMoE -- Effective Training of Sparse Mixture of Experts via Competition
Quang-Cuong Pham
Giang Do
Huy Nguyen
TrungTin Nguyen
Chenghao Liu
...
Binh T. Nguyen
Savitha Ramasamy
Xiaoli Li
Steven C. H. Hoi
Nhat Ho
25
17
0
04 Feb 2024
Code Representation Learning At Scale
Dejiao Zhang
W. Ahmad
Ming Tan
Hantian Ding
Ramesh Nallapati
Dan Roth
Xiaofei Ma
Bing Xiang
OffRL
21
9
0
02 Feb 2024
COMET: Generating Commit Messages using Delta Graph Context Representation
Abhinav Reddy Mandli
Saurabhsingh Rajput
Tushar Sharma
31
1
0
02 Feb 2024
Getting the most out of your tokenizer for pre-training and domain adaptation
Gautier Dagan
Gabriele Synnaeve
Baptiste Rozière
34
20
0
01 Feb 2024
SymbolicAI: A framework for logic-based approaches combining generative models and solvers
Marius-Constantin Dinu
Claudiu Leoveanu-Condrei
Markus Holzleitner
Werner Zellinger
Sepp Hochreiter
43
10
0
01 Feb 2024
Security and Privacy Challenges of Large Language Models: A Survey
B. Das
M. H. Amini
Yanzhao Wu
PILM
ELM
19
103
0
30 Jan 2024
PPM: Automated Generation of Diverse Programming Problems for Benchmarking Code Generation Models
Simin Chen
Xiaoning Feng
Xiao Han
Cong Liu
Wei Yang
42
3
0
28 Jan 2024
A Systematic Literature Review on Explainability for Machine/Deep Learning-based Software Engineering Research
Sicong Cao
Xiaobing Sun
Ratnadira Widyasari
David Lo
Xiaoxue Wu
...
Jiale Zhang
Bin Li
Wei Liu
Di Wu
Yixin Chen
31
6
0
26 Jan 2024
ZS4C: Zero-Shot Synthesis of Compilable Code for Incomplete Code Snippets using ChatGPT
Azmain Kabir
Shaowei Wang
Yuan Tian
Tse-Hsun Chen
Chen
Muhammad Asaduzzaman
Wenbin Zhang
12
0
0
25 Jan 2024
DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence
Daya Guo
Qihao Zhu
Dejian Yang
Zhenda Xie
Kai Dong
...
Yu-Huan Wu
Y. K. Li
Fuli Luo
Yingfei Xiong
W. Liang
ELM
48
664
0
25 Jan 2024
Transformer-Based Models Are Not Yet Perfect At Learning to Emulate Structural Recursion
Dylan Zhang
Curt Tigges
Zory Zhang
Stella Biderman
Maxim Raginsky
Talia Ringer
24
11
0
23 Jan 2024
Distilling Mathematical Reasoning Capabilities into Small Language Models
Xunyu Zhu
Jian Li
Yong Liu
Can Ma
Weiping Wang
LRM
34
9
0
22 Jan 2024
Structured Code Representations Enable Data-Efficient Adaptation of Code Language Models
Mayank Agarwal
Yikang Shen
Bailin Wang
Yoon Kim
Jie Chen
45
5
0
19 Jan 2024
When Neural Code Completion Models Size up the Situation: Attaining Cheaper and Faster Completion through Dynamic Model Inference
Zhensu Sun
Xiaoning Du
Fu Song
Shangwen Wang
Li Li
25
10
0
18 Jan 2024
KADEL: Knowledge-Aware Denoising Learning for Commit Message Generation
Wei Tao
Yucheng Zhou
Yanlin Wang
Hongyu Zhang
Haofen Wang
Wenqiang Zhang
24
10
0
16 Jan 2024
A Novel Approach for Automatic Program Repair using Round-Trip Translation with Large Language Models
Fernando Vallecillos Ruiz
Anastasiia Grishina
Max Hort
Leon Moonen
LRM
42
4
0
15 Jan 2024
Previous
1
2
3
4
5
6
...
11
12
13
Next