ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.06333
  4. Cited By
Unified Pre-training for Program Understanding and Generation

Unified Pre-training for Program Understanding and Generation

10 March 2021
Wasi Uddin Ahmad
Saikat Chakraborty
Baishakhi Ray
Kai-Wei Chang
ArXivPDFHTML

Papers citing "Unified Pre-training for Program Understanding and Generation"

50 / 316 papers shown
Title
Language Agnostic Code Embeddings
Language Agnostic Code Embeddings
Saiteja Utpala
Alex Gu
Pin-Yu Chen
39
1
0
25 Oct 2023
Understanding Code Semantics: An Evaluation of Transformer Models in
  Summarization
Understanding Code Semantics: An Evaluation of Transformer Models in Summarization
Debanjan Mondal
Abhilasha Lodha
Ankita Sahoo
Beena Kumari
37
0
0
25 Oct 2023
SteloCoder: a Decoder-Only LLM for Multi-Language to Python Code
  Translation
SteloCoder: a Decoder-Only LLM for Multi-Language to Python Code Translation
Jialing Pan
Adrien Sadé
Jin Kim
Eric Soriano
Guillem Sole
Sylvain Flamant
SyDa
13
16
0
24 Oct 2023
SUT: Active Defects Probing for Transcompiler Models
SUT: Active Defects Probing for Transcompiler Models
Mengnan Qi
Yufan Huang
Maoquan Wang
Yongqiang Yao
Zihan Liu
Bin Gu
Colin B. Clement
Neel Sundaresan
25
2
0
22 Oct 2023
Beyond Accuracy: Evaluating Self-Consistency of Code Large Language
  Models with IdentityChain
Beyond Accuracy: Evaluating Self-Consistency of Code Large Language Models with IdentityChain
Marcus J. Min
Yangruibo Ding
Luca Buratti
Saurabh Pujar
Gail E. Kaiser
Suman Jana
Baishakhi Ray
LRM
HILM
27
18
0
21 Oct 2023
CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code
  Completion
CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion
Yangruibo Ding
Zijian Wang
Wasi Uddin Ahmad
Hantian Ding
Ming Tan
...
M. K. Ramanathan
Ramesh Nallapati
Parminder Bhatia
Dan Roth
Bing Xiang
ELM
34
117
0
17 Oct 2023
Program Translation via Code Distillation
Program Translation via Code Distillation
Yufan Huang
Mengnan Qi
Yongqiang Yao
Maoquan Wang
Bin Gu
Colin B. Clement
Neel Sundaresan
23
5
0
17 Oct 2023
Functional Overlap Reranking for Neural Code Generation
Functional Overlap Reranking for Neural Code Generation
H. To
Minh Huynh Nguyen
Nghi D. Q. Bui
32
4
0
16 Oct 2023
Large Language Model-Aware In-Context Learning for Code Generation
Large Language Model-Aware In-Context Learning for Code Generation
Jia Li
Ge Li
Chongyang Tao
Jia Li
Huangzhao Zhang
Fang Liu
Zhi Jin
51
28
0
15 Oct 2023
Towards Causal Deep Learning for Vulnerability Detection
Towards Causal Deep Learning for Vulnerability Detection
Md. Mahbubur Rahman
Ira Ceka
Chengzhi Mao
Saikat Chakraborty
Baishakhi Ray
Wei Le
26
10
0
12 Oct 2023
LLM for SoC Security: A Paradigm Shift
LLM for SoC Security: A Paradigm Shift
Dipayan Saha
Shams Tarek
Katayoon Yahyaei
S. Saha
Jingbo Zhou
M. Tehranipoor
Farimah Farahmandi
63
46
0
09 Oct 2023
CodeTransOcean: A Comprehensive Multilingual Benchmark for Code
  Translation
CodeTransOcean: A Comprehensive Multilingual Benchmark for Code Translation
Weixiang Yan
Yuchen Tian
Yunzhe Li
Qian Chen
Wen Wang
34
35
0
08 Oct 2023
Confronting Reward Model Overoptimization with Constrained RLHF
Confronting Reward Model Overoptimization with Constrained RLHF
Ted Moskovitz
Aaditya K. Singh
DJ Strouse
T. Sandholm
Ruslan Salakhutdinov
Anca D. Dragan
Stephen Marcus McAleer
39
48
0
06 Oct 2023
A Survey of GPT-3 Family Large Language Models Including ChatGPT and
  GPT-4
A Survey of GPT-3 Family Large Language Models Including ChatGPT and GPT-4
Katikapalli Subramanyam Kalyan
LM&MA
AI4CE
LRM
AILaw
ELM
43
224
0
04 Oct 2023
Gotcha! This Model Uses My Code! Evaluating Membership Leakage Risks in
  Code Models
Gotcha! This Model Uses My Code! Evaluating Membership Leakage Risks in Code Models
Zhou Yang
Zhipeng Zhao
Chenyu Wang
Jieke Shi
Dongsum Kim
Donggyun Han
David Lo
SILM
AAML
MIACV
36
12
0
02 Oct 2023
Pop Quiz! Do Pre-trained Code Models Possess Knowledge of Correct API
  Names?
Pop Quiz! Do Pre-trained Code Models Possess Knowledge of Correct API Names?
Terry Yue Zhuo
Xiaoning Du
Zhenchang Xing
Jiamou Sun
Haowei Quan
Li Li
Liming Zhu
31
2
0
14 Sep 2023
RAP-Gen: Retrieval-Augmented Patch Generation with CodeT5 for Automatic
  Program Repair
RAP-Gen: Retrieval-Augmented Patch Generation with CodeT5 for Automatic Program Repair
Weishi Wang
Yue Wang
Chenyu You
Steven C. H. Hoi
29
57
0
12 Sep 2023
Revisiting File Context for Source Code Summarization
Revisiting File Context for Source Code Summarization
Aakash Bansal
Chia-Yi Su
Collin McMillan
17
4
0
05 Sep 2023
A study on the impact of pre-trained model on Just-In-Time defect
  prediction
A study on the impact of pre-trained model on Just-In-Time defect prediction
Yuxiang Guo
Xiaopeng Gao
Zhenyu Zhang
William Chan
Bo Jiang
VLM
22
3
0
05 Sep 2023
Bias Testing and Mitigation in LLM-based Code Generation
Bias Testing and Mitigation in LLM-based Code Generation
Dong Huang
Qingwen Bu
Jie M. Zhang
Xiaofei Xie
Junjie Chen
Heming Cui
48
20
0
03 Sep 2023
Copiloting the Copilots: Fusing Large Language Models with Completion
  Engines for Automated Program Repair
Copiloting the Copilots: Fusing Large Language Models with Completion Engines for Automated Program Repair
Yuxiang Wei
Chun Xia
Lingming Zhang
KELM
33
91
0
01 Sep 2023
Code Llama: Open Foundation Models for Code
Code Llama: Open Foundation Models for Code
Baptiste Rozière
Jonas Gehring
Fabian Gloeckle
Sten Sootla
Itai Gat
...
Hugo Touvron
Louis Martin
Nicolas Usunier
Thomas Scialom
Gabriel Synnaeve
ELM
ALM
63
1,906
0
24 Aug 2023
kTrans: Knowledge-Aware Transformer for Binary Code Embedding
kTrans: Knowledge-Aware Transformer for Binary Code Embedding
Wenyu Zhu
Hao Wang
Yuchen Zhou
Jiaming Wang
Zihan Sha
Zeyu Gao
Chao Zhang
32
10
0
24 Aug 2023
LLaMA-Reviewer: Advancing Code Review Automation with Large Language
  Models through Parameter-Efficient Fine-Tuning
LLaMA-Reviewer: Advancing Code Review Automation with Large Language Models through Parameter-Efficient Fine-Tuning
Jun Lu
Lei Yu
Xiaojia Li
Li Yang
Chun Zuo
ALM
27
72
0
22 Aug 2023
Large Language Models for Software Engineering: A Systematic Literature
  Review
Large Language Models for Software Engineering: A Systematic Literature Review
Xinying Hou
Yanjie Zhao
Yue Liu
Zhou Yang
Kailong Wang
Li Li
Xiapu Luo
David Lo
John C. Grundy
Haoyu Wang
39
324
0
21 Aug 2023
Towards Automatically Addressing Self-Admitted Technical Debt: How Far
  Are We?
Towards Automatically Addressing Self-Admitted Technical Debt: How Far Are We?
A. Mastropaolo
M. D. Penta
Gabriele Bavota
31
7
0
17 Aug 2023
Evaluating and Explaining Large Language Models for Code Using Syntactic
  Structures
Evaluating and Explaining Large Language Models for Code Using Syntactic Structures
David Nader-Palacio
Alejandro Velasco
Daniel Rodríguez-Cárdenas
Kevin Moran
Denys Poshyvanyk
34
8
0
07 Aug 2023
Exploiting Code Symmetries for Learning Program Semantics
Exploiting Code Symmetries for Learning Program Semantics
Kexin Pei
Weichen Li
Qirui Jin
Shuyang Liu
Scott Geng
Lorenzo Cavallaro
Junfeng Yang
Suman Jana
41
4
0
07 Aug 2023
An Empirical Study of AI-based Smart Contract Creation
An Empirical Study of AI-based Smart Contract Creation
Rabimba Karanjai
Edward Li
Lei Xu
W. Shi
18
9
0
05 Aug 2023
Evaluating Instruction-Tuned Large Language Models on Code Comprehension
  and Generation
Evaluating Instruction-Tuned Large Language Models on Code Comprehension and Generation
Zhiqiang Yuan
Junwei Liu
Qiancheng Zi
Mingwei Liu
Xin Peng
Yiling Lou
ALM
ELM
LRM
17
73
0
02 Aug 2023
CodeBPE: Investigating Subtokenization Options for Large Language Model
  Pretraining on Source Code
CodeBPE: Investigating Subtokenization Options for Large Language Model Pretraining on Source Code
Nadezhda Chirkova
Sergey Troshin
21
8
0
01 Aug 2023
LaFiCMIL: Rethinking Large File Classification from the Perspective of
  Correlated Multiple Instance Learning
LaFiCMIL: Rethinking Large File Classification from the Perspective of Correlated Multiple Instance Learning
Tiezhu Sun
Weiguo Pian
N. Daoudi
Kevin Allix
Tegawende F. Bissyande
Jacques Klein
31
1
0
30 Jul 2023
Multilingual Code Co-Evolution Using Large Language Models
Multilingual Code Co-Evolution Using Large Language Models
Jiyang Zhang
Pengyu Nie
Junyi Jessy Li
Miloš Gligorić
32
20
0
27 Jul 2023
RLTF: Reinforcement Learning from Unit Test Feedback
RLTF: Reinforcement Learning from Unit Test Feedback
Jiate Liu
Yiqin Zhu
Kaiwen Xiao
Qiang Fu
Xiao Han
Wei Yang
Deheng Ye
OffRL
52
56
0
10 Jul 2023
An Exploratory Literature Study on Sharing and Energy Use of Language
  Models for Source Code
An Exploratory Literature Study on Sharing and Energy Use of Language Models for Source Code
Max Hort
Anastasiia Grishina
Leon Moonen
18
2
0
05 Jul 2023
Exploring Continual Learning for Code Generation Models
Exploring Continual Learning for Code Generation Models
Prateek Yadav
Q. Sun
Hantian Ding
Xiaopeng Li
Dejiao Zhang
...
Parminder Bhatia
Ramesh Nallapati
M. K. Ramanathan
Joey Tianyi Zhou
Bing Xiang
CLL
37
30
0
05 Jul 2023
Natural Language Generation and Understanding of Big Code for
  AI-Assisted Programming: A Review
Natural Language Generation and Understanding of Big Code for AI-Assisted Programming: A Review
M. Wong
Shangxin Guo
Ching Nam Hang
Siu-Wai Ho
C. Tan
42
78
0
04 Jul 2023
Uncovering the Limits of Machine Learning for Automatic Vulnerability
  Detection
Uncovering the Limits of Machine Learning for Automatic Vulnerability Detection
Niklas Risse
Marcel Bohme
AAML
68
22
0
28 Jun 2023
Constructing Multilingual Code Search Dataset Using Neural Machine
  Translation
Constructing Multilingual Code Search Dataset Using Neural Machine Translation
Ryo Sekizawa
Nan Duan
Shuai Lu
Hitomi Yanaka
13
2
0
27 Jun 2023
Exploring the Robustness of Large Language Models for Solving
  Programming Problems
Exploring the Robustness of Large Language Models for Solving Programming Problems
Atsushi Shirafuji
Yutaka Watanobe
Takumi Ito
Makoto Morishita
Yuki Nakamura
Yusuke Oda
Jun Suzuki
ELM
41
18
0
26 Jun 2023
Guiding Language Models of Code with Global Context using Monitors
Guiding Language Models of Code with Global Context using Monitors
Lakshya A Agrawal
Aditya Kanade
Navin Goyal
Shuvendu K. Lahiri
S. Rajamani
40
23
0
19 Jun 2023
Multi-target Backdoor Attacks for Code Pre-trained Models
Multi-target Backdoor Attacks for Code Pre-trained Models
Yanzhou Li
Shangqing Liu
Kangjie Chen
Xiaofei Xie
Tianwei Zhang
Yang Liu
AAML
SILM
22
23
0
14 Jun 2023
CoTran: An LLM-based Code Translator using Reinforcement Learning with
  Feedback from Compiler and Symbolic Execution
CoTran: An LLM-based Code Translator using Reinforcement Learning with Feedback from Compiler and Symbolic Execution
Prithwish Jana
Piyush Jha
Haoyang Ju
Gautham Kishore
Aryan Mahajan
Vijay Ganesh
24
12
0
11 Jun 2023
A Comprehensive Review of State-of-The-Art Methods for Java Code
  Generation from Natural Language Text
A Comprehensive Review of State-of-The-Art Methods for Java Code Generation from Natural Language Text
Jessica Nayeli López Espejel
Mahaman Sanoussi Yahaya Alassan
El Mehdi Chouham
Walid Dahhane
E. Ettifouri
21
13
0
10 Jun 2023
Large Language Models of Code Fail at Completing Code with Potential
  Bugs
Large Language Models of Code Fail at Completing Code with Potential Bugs
Tuan Dinh
Jinman Zhao
Samson Tan
Renato M. P. Negrinho
Leonard Lausen
Sheng Zha
George Karypis
LRM
37
25
0
06 Jun 2023
A Static Evaluation of Code Completion by Large Language Models
A Static Evaluation of Code Completion by Large Language Models
Hantian Ding
Varun Kumar
Yuchen Tian
Zijian Wang
Robert Kwiatkowski
...
Baishakhi Ray
Parminder Bhatia
Sudipta Sengupta
Dan Roth
Bing Xiang
ELM
ALM
35
14
0
05 Jun 2023
SourceP: Detecting Ponzi Schemes on Ethereum with Source Code
SourceP: Detecting Ponzi Schemes on Ethereum with Source Code
Pengcheng Lu
Liang Cai
Keting Yin
AI4TS
17
4
0
02 Jun 2023
DSHGT: Dual-Supervisors Heterogeneous Graph Transformer -- A pioneer
  study of using heterogeneous graph learning for detecting software
  vulnerabilities
DSHGT: Dual-Supervisors Heterogeneous Graph Transformer -- A pioneer study of using heterogeneous graph learning for detecting software vulnerabilities
Tiehua Zhang
Ruiqian Xu
Jianping Zhang
Yuze Liu
Xin Chen
Jun-Jian Yin
Xi Zheng
19
2
0
02 Jun 2023
Do Large Language Models Pay Similar Attention Like Human Programmers
  When Generating Code?
Do Large Language Models Pay Similar Attention Like Human Programmers When Generating Code?
Bonan Kou
Shengmai Chen
Zhijie Wang
Lei Ma
Tianyi Zhang
ALM
11
13
0
02 Jun 2023
Better Context Makes Better Code Language Models: A Case Study on
  Function Call Argument Completion
Better Context Makes Better Code Language Models: A Case Study on Function Call Argument Completion
Hengzhi Pei
Jinman Zhao
Leonard Lausen
Sheng Zha
George Karypis
ELM
LRM
14
21
0
01 Jun 2023
Previous
1234567
Next