Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.03850
Cited By
UniXcoder: Unified Cross-Modal Pre-training for Code Representation
8 March 2022
Daya Guo
Shuai Lu
Nan Duan
Yanlin Wang
Ming Zhou
Jian Yin
Re-assign community
ArXiv
PDF
HTML
Papers citing
"UniXcoder: Unified Cross-Modal Pre-training for Code Representation"
50 / 200 papers shown
Title
The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation
Dũng Nguyễn Mạnh
Nam Le Hai
An Dau
A. Nguyen
Khanh N. Nghiem
Jingnan Guo
Nghi D. Q. Bui
31
15
0
09 May 2023
Code Execution with Pre-trained Language Models
Chenxiao Liu
Shuai Lu
Weizhu Chen
Daxin Jiang
Alexey Svyatkovskiy
Shengyu Fu
Neel Sundaresan
Nan Duan
ELM
22
21
0
08 May 2023
On the Usage of Continual Learning for Out-of-Distribution Generalization in Pre-trained Language Models of Code
Martin Weyssow
Xin Zhou
Kisub Kim
David Lo
H. Sahraoui
CLL
KELM
27
10
0
06 May 2023
REINFOREST: Reinforcing Semantic Code Similarity for Cross-Lingual Code Search Models
Anthony Saieva
Saikat Chakraborty
Gail E. Kaiser
28
1
0
05 May 2023
Redundancy and Concept Analysis for Code-trained Language Models
Arushi Sharma
Zefu Hu
Christopher Quinn
Ali Jannesari
73
1
0
01 May 2023
Neuro-symbolic Zero-Shot Code Cloning with Cross-Language Intermediate Representation
Krishnam Hasija
Shrishti Pradhan
Manasi S. Patwardhan
Raveendra Kumar Medicherla
L. Vig
Ravindra Naik
23
2
0
26 Apr 2023
Enriching Source Code with Contextual Data for Code Completion Models: An Empirical Study
Tim van Dam
M. Izadi
A. van Deursen
20
13
0
24 Apr 2023
CodeKGC: Code Language Model for Generative Knowledge Graph Construction
Zhen Bi
Jing Chen
Yinuo Jiang
Feiyu Xiong
Wei Guo
Huajun Chen
Ningyu Zhang
11
36
0
18 Apr 2023
An Unbiased Transformer Source Code Learning with Semantic Vulnerability Graph
Nafis Tanveer Islam
G. Parra
Dylan Manuel
E. Bou-Harb
Peyman Najafirad
17
8
0
17 Apr 2023
Towards Efficient Fine-tuning of Pre-trained Code Models: An Experimental Study and Beyond
Ensheng Shi
Yanlin Wang
Hongyu Zhang
Lun Du
Shi Han
Dongmei Zhang
Hongbin Sun
33
42
0
11 Apr 2023
ChatPipe: Orchestrating Data Preparation Program by Optimizing Human-ChatGPT Interactions
Sibei Chen
Han-Chih Liu
Weiting Jin
Xiangyu Sun
Xiaoyao Feng
Ju Fan
Xiaoyong Du
Nan Tang
11
3
0
07 Apr 2023
Better Language Models of Code through Self-Improvement
H. To
Nghi D. Q. Bui
Jingnan Guo
T. Nguyen
SyDa
39
15
0
02 Apr 2023
One Adapter for All Programming Languages? Adapter Tuning for Code Search and Summarization
Deze Wang
Boxing Chen
Shanshan Li
Wei Luo
Shaoliang Peng
Wei Dong
Xiang-ke Liao
33
37
0
28 Mar 2023
RepoCoder: Repository-Level Code Completion Through Iterative Retrieval and Generation
Fengji Zhang
B. Chen
Yue Zhang
Jacky Keung
Jin Liu
Daoguang Zan
Yi Mao
Jian-Guang Lou
Weizhu Chen
27
219
0
22 Mar 2023
Retrieving Multimodal Information for Augmented Generation: A Survey
Ruochen Zhao
Hailin Chen
Weishi Wang
Fangkai Jiao
Do Xuan Long
...
Bosheng Ding
Xiaobao Guo
Minzhi Li
Xingxuan Li
Shafiq R. Joty
25
80
0
20 Mar 2023
Knowledge Transfer for Pseudo-code Generation from Low Resource Programming Language
Ankita Sontakke
Kanika Kalra
Manasi S. Patwardhan
L. Vig
Raveendra Kumar Medicherla
Ravindra Naik
Shrishti Pradhan
12
2
0
16 Mar 2023
xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval
Mohammad Abdullah Matin Khan
M Saiful Bari
Xuan Long Do
Weishi Wang
Md. Rizwan Parvez
Shafiq R. Joty
ALM
ELM
34
14
0
06 Mar 2023
CrossCodeBench: Benchmarking Cross-Task Generalization of Source Code Models
Changan Niu
Chuanyi Li
Vincent Ng
Bin Luo
ELM
ALM
34
9
0
08 Feb 2023
Exploring Data Augmentation for Code Generation Tasks
Pinzhen Chen
Gerasimos Lampouras
31
9
0
05 Feb 2023
Generation-Augmented Query Expansion For Code Retrieval
Dong Li
Yelong Shen
Ruoming Jin
Yi Mao
Kuan-Chieh Jackson Wang
Weizhu Chen
RALM
26
8
0
20 Dec 2022
A Survey on Pretrained Language Models for Neural Code Intelligence
Yichen Xu
Yanqiao Zhu
9
17
0
20 Dec 2022
Unveiling Code Pre-Trained Models: Investigating Syntax and Semantics Capacities
Wei Ma
Shangqing Liu
Mengjie Zhao
Xiaofei Xie
Wenhan Wang
Q. Hu
Jiexin Zhang
Yang Liu
27
16
0
20 Dec 2022
CoCoMIC: Code Completion By Jointly Modeling In-file and Cross-file Context
Yangruibo Ding
Zijian Wang
Wasi Uddin Ahmad
M. K. Ramanathan
Ramesh Nallapati
Parminder Bhatia
Dan Roth
Bing Xiang
21
68
0
20 Dec 2022
Dataflow Analysis-Inspired Deep Learning for Efficient Vulnerability Detection
Benjamin Steenhoek
Hongyang Gao
Wei Le
43
27
0
15 Dec 2022
Who Evaluates the Evaluators? On Automatic Metrics for Assessing AI-based Offensive Code Generators
Pietro Liguori
Cristina Improta
R. Natella
B. Cukic
Domenico Cotroneo
ELM
28
16
0
12 Dec 2022
A Survey on Natural Language Processing for Programming
Qingfu Zhu
Xianzhen Luo
Fang Liu
Cuiyun Gao
Wanxiang Che
23
1
0
12 Dec 2022
DeepVulSeeker: A Novel Vulnerability Identification Framework via Code Graph Structure and Pre-training Mechanism
Jin Wang
Hui Xiao
Shuwen Zhong
Yinhao Xiao
34
11
0
23 Nov 2022
Syntax-Aware On-the-Fly Code Completion
Wannita Takerngsaksiri
C. Tantithamthavorn
Yuankui Li
24
17
0
09 Nov 2022
CodePAD: Sequence-based Code Generation with Pushdown Automaton
Yihong Dong
Xue Jiang
Yuchen Liu
Ge Li
Zhi Jin
20
6
0
02 Nov 2022
Global Contrastive Batch Sampling via Optimization on Sample Permutations
Vin Sachidananda
Ziyi Yang
Chenguang Zhu
16
5
0
23 Oct 2022
Exploring Representation-Level Augmentation for Code Search
Haochen Li
Chun Miao
Cyril Leung
Yanxian Huang
Yuan Huang
Hongyu Zhang
Yanlin Wang
45
19
0
21 Oct 2022
Soft-Labeled Contrastive Pre-training for Function-level Code Representation
Xiaonan Li
Daya Guo
Yeyun Gong
Yun Lin
Yelong Shen
Xipeng Qiu
Daxin Jiang
Weizhu Chen
Nan Duan
31
17
0
18 Oct 2022
Leveraging Artificial Intelligence on Binary Code Comprehension
Yifan Zhang
29
3
0
11 Oct 2022
Pre-Training Representations of Binary Code Using Contrastive Learning
Yifan Zhang
Chen Huang
Yueke Zhang
Kevin Cao
Scott Thomas Andersen
Huajie Shao
Kevin Leach
Yu Huang
47
3
0
11 Oct 2022
CAT-probing: A Metric-based Approach to Interpret How Pre-trained Models for Programming Language Attend Code Structure
Nuo Chen
Qiushi Sun
Renyu Zhu
Xiang Li
Xuesong Lu
Ming Gao
38
10
0
07 Oct 2022
ContraCLM: Contrastive Learning For Causal Language Model
Nihal Jain
Dejiao Zhang
Wasi Uddin Ahmad
Zijian Wang
Feng Nan
...
Ramesh Nallapati
Baishakhi Ray
Parminder Bhatia
Xiaofei Ma
Bing Xiang
23
16
0
03 Oct 2022
Diverse Title Generation for Stack Overflow Posts with Multiple Sampling Enhanced Transformer
Fengji Zhang
Jin Liu
Yao Wan
Xiao Yu
Xiao Liu
J. Keung
88
11
0
24 Aug 2022
Learning Program Representations with a Tree-Structured Transformer
Wenhan Wang
Kechi Zhang
Ge Li
Shangqing Liu
Anran Li
Zhi Jin
Yang Liu
39
5
0
18 Aug 2022
CommitBART: A Large Pre-trained Model for GitHub Commits
Shangqing Liu
Yanzhou Li
Xiaofei Xie
Yang Liu
VLM
AI4TS
23
18
0
17 Aug 2022
CoditT5: Pretraining for Source Code and Natural Language Editing
Jiyang Zhang
Sheena Panthaplackel
Pengyu Nie
Junyi Jessy Li
Miloš Gligorić
KELM
17
88
0
10 Aug 2022
PanGu-Coder: Program Synthesis with Function-Level Language Modeling
Fenia Christopoulou
Gerasimos Lampouras
Milan Gritta
Guchun Zhang
Yinpeng Guo
...
Guangtai Liang
Jia Wei
Xin Jiang
Qianxiang Wang
Qun Liu
ELM
SyDa
ALM
45
74
0
22 Jul 2022
NS3: Neuro-Symbolic Semantic Code Search
Shushan Arakelyan
Anna Hakhverdyan
Miltiadis Allamanis
Luis Garcia
Christophe Hauser
Xiang Ren
81
9
0
21 May 2022
Addressing Leakage in Self-Supervised Contextualized Code Retrieval
Johannes Villmow
Viola Campos
A. Ulges
Ulrich Schwanecke
27
3
0
17 Apr 2022
CoCoSoDa: Effective Contrastive Learning for Code Search
Ensheng Shi
Yanlin Wang
Wenchao Gu
Lun Du
Hongyu Zhang
Shi Han
Dongmei Zhang
Hongbin Sun
38
33
0
07 Apr 2022
Automating Code Review Activities by Large-Scale Pre-training
Zhiyu Li
Shuai Lu
Daya Guo
Nan Duan
Shailesh Jannu
...
Deep Majumder
Jared Green
Alexey Svyatkovskiy
Shengyu Fu
Neel Sundaresan
VLM
23
138
0
17 Mar 2022
ReACC: A Retrieval-Augmented Code Completion Framework
Shuai Lu
Nan Duan
Hojae Han
Daya Guo
Seung-won Hwang
Alexey Svyatkovskiy
25
139
0
15 Mar 2022
CodeRetriever: Unimodal and Bimodal Contrastive Learning for Code Search
Xiaonan Li
Yeyun Gong
Yelong Shen
Xipeng Qiu
Hang Zhang
Bolun Yao
Weizhen Qi
Daxin Jiang
Weizhu Chen
Nan Duan
OffRL
37
27
0
26 Jan 2022
Unveiling Project-Specific Bias in Neural Code Models
Zhiming Li
Yanzhou Li
Tianlin Li
Mengnan Du
Bozhi Wu
Yushi Cao
Yi Li
Yang Liu
31
5
0
19 Jan 2022
CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for Code Understanding and Generation
Yue Wang
Weishi Wang
Shafiq R. Joty
S. Hoi
235
1,489
0
02 Sep 2021
CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation
Shuai Lu
Daya Guo
Shuo Ren
Junjie Huang
Alexey Svyatkovskiy
...
Nan Duan
Neel Sundaresan
Shao Kun Deng
Shengyu Fu
Shujie Liu
ELM
198
1,105
0
09 Feb 2021
Previous
1
2
3
4