ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2109.00859
  4. Cited By
CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for
  Code Understanding and Generation

CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for Code Understanding and Generation

2 September 2021
Yue Wang
Weishi Wang
Shafiq R. Joty
S. Hoi
ArXivPDFHTML

Papers citing "CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for Code Understanding and Generation"

50 / 610 papers shown
Title
Synthetic Code Surgery: Repairing Bugs and Vulnerabilities with LLMs and Synthetic Data
Synthetic Code Surgery: Repairing Bugs and Vulnerabilities with LLMs and Synthetic Data
David de-Fitero-Dominguez
Antonio Garcia-Cabot
Eva García-López
SyDa
61
0
0
12 May 2025
QiMeng-Xpiler: Transcompiling Tensor Programs for Deep Learning Systems with a Neural-Symbolic Approach
QiMeng-Xpiler: Transcompiling Tensor Programs for Deep Learning Systems with a Neural-Symbolic Approach
Shouyang Dong
Yuanbo Wen
Jun Bi
Di Huang
Jiaming Guo
...
Yifan Hao
Xuehai Zhou
Tianshi Chen
Qi Guo
Yunji Chen
22
0
0
04 May 2025
BiGSCoder: State Space Model for Code Understanding
BiGSCoder: State Space Model for Code Understanding
Shweta Verma
Abhinav Anand
Mira Mezini
Mamba
46
0
0
02 May 2025
An Empirical Study on the Effectiveness of Large Language Models for Binary Code Understanding
An Empirical Study on the Effectiveness of Large Language Models for Binary Code Understanding
Xiuwei Shang
Zhenkan Fu
Shaoyin Cheng
Guoqiang Chen
Gangyang Li
Li Hu
W. Zhang
N. Yu
62
0
0
30 Apr 2025
Large Language Models are Qualified Benchmark Builders: Rebuilding Pre-Training Datasets for Advancing Code Intelligence Tasks
Large Language Models are Qualified Benchmark Builders: Rebuilding Pre-Training Datasets for Advancing Code Intelligence Tasks
Kang Yang
Xinjun Mao
Shangwen Wang
Y. Wang
Tanghaoran Zhang
Bo Lin
Yihao Qin
Zhang Zhang
Yao Lu
Kamal Al-Sabahi
ALM
135
1
0
28 Apr 2025
ClarifyCoder: Clarification-Aware Fine-Tuning for Programmatic Problem Solving
ClarifyCoder: Clarification-Aware Fine-Tuning for Programmatic Problem Solving
Jie JW Wu
Manav Chaudhary
Davit O. Abrahamyan
Arhaan Khaku
Anjiang Wei
Fatemeh H. Fard
SyDa
44
0
0
23 Apr 2025
Automated Static Vulnerability Detection via a Holistic Neuro-symbolic Approach
Automated Static Vulnerability Detection via a Holistic Neuro-symbolic Approach
Penghui Li
Songchen Yao
Josef Sarfati Korich
Changhua Luo
Jianjia Yu
Yinzhi Cao
Junfeng Yang
126
0
0
22 Apr 2025
Integrating Symbolic Execution into the Fine-Tuning of Code-Generating LLMs
Integrating Symbolic Execution into the Fine-Tuning of Code-Generating LLMs
Marina Sakharova
Abhinav Anand
Mira Mezini
56
0
0
21 Apr 2025
Iterative Self-Training for Code Generation via Reinforced Re-Ranking
Iterative Self-Training for Code Generation via Reinforced Re-Ranking
Nikita Sorokin
I. Sedykh
Valentin Malykh
31
0
0
13 Apr 2025
DocAgent: A Multi-Agent System for Automated Code Documentation Generation
DocAgent: A Multi-Agent System for Automated Code Documentation Generation
Dayu Yang
Antoine Simoulin
Xin Qian
Xiaoyi Liu
Yuwei Cao
Zhaopu Teng
Grey Yang
LLMAG
54
0
0
11 Apr 2025
ML For Hardware Design Interpretability: Challenges and Opportunities
ML For Hardware Design Interpretability: Challenges and Opportunities
Raymond Baartmans
Andrew Ensinger
Victor Agostinelli
Lizhong Chen
29
0
0
11 Apr 2025
Zero-Shot Cross-Domain Code Search without Fine-Tuning
Zero-Shot Cross-Domain Code Search without Fine-Tuning
Keyu Liang
Z. Liu
Chao Liu
Zhiyuan Wan
David Lo
Xiaohu Yang
26
0
0
10 Apr 2025
From Token to Line: Enhancing Code Generation with a Long-Term Perspective
From Token to Line: Enhancing Code Generation with a Long-Term Perspective
Tingwei Lu
Yangning Li
Liyuan Wang
Binghuai Lin
Jiwei Tang
...
Hai-tao Zheng
Yinghui Li
Bingxu An
Zhao Wei
Y. Xu
LLMAG
57
0
0
10 Apr 2025
DeCoMa: Detecting and Purifying Code Dataset Watermarks through Dual Channel Code Abstraction
DeCoMa: Detecting and Purifying Code Dataset Watermarks through Dual Channel Code Abstraction
Yuan Xiao
Yuchen Chen
Shiqing Ma
Haocheng Huang
Chunrong Fang
Y. Chen
Weisong Sun
Yunfeng Zhu
X. Zhang
Zhenyu Chen
31
0
0
09 Apr 2025
RETROcode: Leveraging a Code Database for Improved Natural Language to Code Generation
RETROcode: Leveraging a Code Database for Improved Natural Language to Code Generation
Nathanael Beau
Benoît Crabbé
23
0
0
08 Apr 2025
Generative Large Language Model usage in Smart Contract Vulnerability Detection
Generative Large Language Model usage in Smart Contract Vulnerability Detection
Peter Ince
Jiangshan Yu
Joseph K. Liu
Xiaoning Du
32
0
0
07 Apr 2025
OpenCodeInstruct: A Large-scale Instruction Tuning Dataset for Code LLMs
OpenCodeInstruct: A Large-scale Instruction Tuning Dataset for Code LLMs
Wasi Uddin Ahmad
Aleksander Ficek
Mehrzad Samadi
Jocelyn Huang
Vahid Noroozi
Somshubra Majumdar
Boris Ginsburg
ALM
37
0
0
05 Apr 2025
On Benchmarking Code LLMs for Android Malware Analysis
On Benchmarking Code LLMs for Android Malware Analysis
Yiling He
Hongyu She
Xingzhi Qian
Xinran Zheng
Zhuo Chen
Z. Qin
Lorenzo Cavallaro
ELM
43
1
0
01 Apr 2025
Carbon Footprint Evaluation of Code Generation through LLM as a Service
Carbon Footprint Evaluation of Code Generation through LLM as a Service
Tina Vartziotis
Maximilian Schmidt
George Dasoulas
Ippolyti Dellatolas
Stefano Attademo
Viet Dung Le
Anke Wiechmann
Tim Hoffmann
Michael Keckeisen
S. Kotsopoulos
33
2
0
30 Mar 2025
ObscuraCoder: Powering Efficient Code LM Pre-Training Via Obfuscation Grounding
ObscuraCoder: Powering Efficient Code LM Pre-Training Via Obfuscation Grounding
Indraneil Paul
Haoyi Yang
Goran Glavas
Kristian Kersting
Iryna Gurevych
AAML
SyDa
36
0
0
27 Mar 2025
ModiGen: A Large Language Model-Based Workflow for Multi-Task Modelica Code Generation
ModiGen: A Large Language Model-Based Workflow for Multi-Task Modelica Code Generation
Jiahui Xiang
Tong Ye
Peiyu Liu
Yinan Zhang
Wenhai Wang
43
0
0
24 Mar 2025
Large Language Models (LLMs) for Source Code Analysis: applications, models and datasets
Large Language Models (LLMs) for Source Code Analysis: applications, models and datasets
Hamed Jelodar
Mohammad Meymani
Roozbeh Razavi-Far
40
0
0
21 Mar 2025
On Explaining (Large) Language Models For Code Using Global Code-Based Explanations
On Explaining (Large) Language Models For Code Using Global Code-Based Explanations
David Nader-Palacio
Dipin Khati
Daniel Rodríguez-Cárdenas
Alejandro Velasco
Denys Poshyvanyk
LRM
42
0
0
21 Mar 2025
Enhancing Code LLM Training with Programmer Attention
Enhancing Code LLM Training with Programmer Attention
Y. Zhang
Chen Huang
Z. Karas
Dung T. Nguyen
Kevin Leach
Yu Huang
72
0
0
19 Mar 2025
LLM-Aided Customizable Profiling of Code Data Based On Programming Language Concepts
LLM-Aided Customizable Profiling of Code Data Based On Programming Language Concepts
Pankaj Thorat
Adnan Qidwai
Adrija Dhar
Aishwariya Chakraborty
Anand Eswaran
Hima Patel
Praveen Jayachandran
50
0
0
19 Mar 2025
Speculative Decoding for Verilog: Speed and Quality, All in One
Speculative Decoding for Verilog: Speed and Quality, All in One
Changran Xu
Yi Liu
Yunhao Zhou
Shan Huang
Ningyi Xu
Qiang Xu
48
0
0
18 Mar 2025
Unveiling Pitfalls: Understanding Why AI-driven Code Agents Fail at GitHub Issue Resolution
Unveiling Pitfalls: Understanding Why AI-driven Code Agents Fail at GitHub Issue Resolution
Zhi Chen
Wei Ma
Lingxiao Jiang
LLMAG
51
0
0
16 Mar 2025
TFHE-Coder: Evaluating LLM-agentic Fully Homomorphic Encryption Code Generation
TFHE-Coder: Evaluating LLM-agentic Fully Homomorphic Encryption Code Generation
Mayank Kumar
J. Xue
Mengxin Zheng
Qian Lou
60
2
0
15 Mar 2025
ASMA-Tune: Unlocking LLMs' Assembly Code Comprehension via Structural-Semantic Instruction Tuning
Xinyi Wang
Jiashui Wang
Peng Chen
Jinbo Su
Yanming Liu
Long Liu
Yangdong Wang
Qiyuan Chen
Kai Yun
Chunfu Jia
42
0
0
14 Mar 2025
Commenting Higher-level Code Unit: Full Code, Reduced Code, or Hierarchical Code Summarization
Weisong Sun
Y. Zhang
J. Zhu
Z. Wang
Chunrong Fang
...
Yebo Feng
Jiangping Huang
X. Wang
Zhi Jin
Yang Liu
58
1
0
13 Mar 2025
Enhancing High-Quality Code Generation in Large Language Models with Comparative Prefix-Tuning
Enhancing High-Quality Code Generation in Large Language Models with Comparative Prefix-Tuning
Yuan Jiang
Yujian Zhang
Liang Lu
Christoph Treude
Xiaohong Su
Shan Huang
Tiantian Wang
ALM
61
0
0
12 Mar 2025
OASIS: Order-Augmented Strategy for Improved Code Search
Zuchen Gao
Zizheng Zhan
Xianming Li
Erxin Yu
Haotian Zhang
Bin Chen
Yuqun Zhang
Jing Li
64
0
0
11 Mar 2025
R+R: Security Vulnerability Dataset Quality Is Critical
Anurag Swarnim Yadav
Joseph N. Wilson
AAML
49
0
0
09 Mar 2025
Grammar-Based Code Representation: Is It a Worthy Pursuit for LLMs?
Qingyuan Liang
Zhao Zhang
Zeyu Sun
Zheng Lin
Qi Luo
...
Yuqun Zhang
Haotian Zhang
Lu Zhang
Bin Chen
Y. Xiong
41
1
0
07 Mar 2025
LoRACode: LoRA Adapters for Code Embeddings
Saumya Chaturvedi
Aman Chadha
Laurent Bindschaedler
61
0
0
07 Mar 2025
Trim My View: An LLM-Based Code Query System for Module Retrieval in Robotic Firmware
Sima Arasteh
Pegah Jandaghi
Nicolaas Weideman
Dennis Perepech
Mukund Raghothaman
Christophe Hauser
Luis Garcia
149
0
0
05 Mar 2025
Experiences with Content Development and Assessment Design in the Era of GenAI
Aakanksha Sharma
S. Shailendra
Rajan Kadel
31
0
0
28 Feb 2025
Learning Code-Edit Embedding to Model Student Debugging Behavior
Learning Code-Edit Embedding to Model Student Debugging Behavior
Hasnain Heickal
Andrew Lan
51
0
0
26 Feb 2025
Deep-Bench: Deep Learning Benchmark Dataset for Code Generation
Deep-Bench: Deep Learning Benchmark Dataset for Code Generation
Alireza Daghighfarsoodeh
Chung-Yu Wang
Hamed Taherkhani
Melika Sepidband
Mohammad Abdollahi
Hadi Hemmati
Hung Viet Pham
ALM
ELM
96
0
0
26 Feb 2025
CodeIF: Benchmarking the Instruction-Following Capabilities of Large Language Models for Code Generation
CodeIF: Benchmarking the Instruction-Following Capabilities of Large Language Models for Code Generation
K. Yan
Hongcheng Guo
Xuanqing Shi
J. Xu
Yaonan Gu
Z. Li
ALM
87
0
0
26 Feb 2025
GNN-Coder: Boosting Semantic Code Retrieval with Combined GNNs and Transformer
GNN-Coder: Boosting Semantic Code Retrieval with Combined GNNs and Transformer
Yufan Ye
Pu Pang
Ting Zhang
Hua Huang
66
0
0
24 Feb 2025
Code Summarization Beyond Function Level
Code Summarization Beyond Function Level
Vladimir Makharev
Vladimir Ivanov
38
0
0
23 Feb 2025
Eliminating Backdoors in Neural Code Models for Secure Code Understanding
Eliminating Backdoors in Neural Code Models for Secure Code Understanding
Weisong Sun
Yuchen Chen
Chunrong Fang
Yebo Feng
Yuan Xiao
An Guo
Quanjun Zhang
Yang Liu
Baowen Xu
Zhenyu Chen
AAML
106
1
0
21 Feb 2025
Show Me Your Code! Kill Code Poisoning: A Lightweight Method Based on Code Naturalness
Show Me Your Code! Kill Code Poisoning: A Lightweight Method Based on Code Naturalness
Weisong Sun
Yuchen Chen
Mengzhe Yuan
Chunrong Fang
Zhenpeng Chen
Chong Wang
Yang Liu
Baowen Xu
Zhenyu Chen
AAML
34
1
0
20 Feb 2025
Understanding and Evaluating Hallucinations in 3D Visual Language Models
Understanding and Evaluating Hallucinations in 3D Visual Language Models
Ruiying Peng
Kaiyuan Li
Weichen Zhang
Chen Gao
Xinlei Chen
Y. Li
42
0
0
18 Feb 2025
UniGenCoder: Merging Seq2Seq and Seq2Tree Paradigms for Unified Code Generation
UniGenCoder: Merging Seq2Seq and Seq2Tree Paradigms for Unified Code Generation
Liangying Shao
Yanfu Yan
Denys Poshyvanyk
Jinsong Su
36
1
0
18 Feb 2025
ScriptoriumWS: A Code Generation Assistant for Weak Supervision
ScriptoriumWS: A Code Generation Assistant for Weak Supervision
Tzu-Heng Huang
Catherine Cao
Spencer Schoenberg
Harit Vishwakarma
Nicholas Roberts
Frederic Sala
NoLa
134
5
0
17 Feb 2025
The Graph's Apprentice: Teaching an LLM Low Level Knowledge for Circuit Quality Estimation
The Graph's Apprentice: Teaching an LLM Low Level Knowledge for Circuit Quality Estimation
Reza Moravej
Saurabh Bodhe
Zhanguang Zhang
Didier Chetelat
Dimitrios Tsaras
Yingxue Zhang
Hui-Ling Zhen
Jianye Hao
M. Yuan
53
1
0
17 Feb 2025
LeDex: Training LLMs to Better Self-Debug and Explain Code
LeDex: Training LLMs to Better Self-Debug and Explain Code
Nan Jiang
Xiaopeng Li
Shiqi Wang
Qiang Zhou
Soneya Binta Hossain
Baishakhi Ray
Varun Kumar
Xiaofei Ma
Anoop Deoras
LRM
92
11
0
17 Feb 2025
URECA: The Chain of Two Minimum Set Cover Problems exists behind Adaptation to Shifts in Semantic Code Search
URECA: The Chain of Two Minimum Set Cover Problems exists behind Adaptation to Shifts in Semantic Code Search
Seok-Ung Choi
Joonghyuk Hahn
Yo-Sub Han
51
0
0
11 Feb 2025
1234...111213
Next