Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.16702
Cited By
ProCQA: A Large-scale Community-based Programming Question Answering Dataset for Code Search
25 March 2024
Zehan Li
Jianfei Zhang
Chuantao Yin
Y. Ouyang
Wenge Rong
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"ProCQA: A Large-scale Community-based Programming Question Answering Dataset for Code Search"
14 / 14 papers shown
Title
Towards Learning (Dis)-Similarity of Source Code from Program Contrasts
Yangruibo Ding
Luca Buratti
Saurabh Pujar
Alessandro Morari
Baishakhi Ray
Saikat Chakraborty
53
36
0
08 Oct 2021
CodeQA: A Question Answering Dataset for Source Code Comprehension
Chenxiao Liu
Xiaojun Wan
81
29
0
17 Sep 2021
CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for Code Understanding and Generation
Yue Wang
Weishi Wang
Shafiq Joty
Guosheng Lin
289
1,560
0
02 Sep 2021
SynCoBERT: Syntax-Guided Multi-Modal Contrastive Pre-Training for Code Representation
Xin Wang
Yasheng Wang
Fei Mi
Pingyi Zhou
Yao Wan
Xiao Liu
Li Li
Hao Wu
Jin Liu
Xin Jiang
107
114
0
10 Aug 2021
CoSQA: 20,000+ Web Queries for Code Search and Question Answering
Junjie Huang
Duyu Tang
Linjun Shou
Ming Gong
Ke Xu
Daxin Jiang
Ming Zhou
Nan Duan
69
116
0
27 May 2021
CodeNet: A Large-Scale AI for Code Dataset for Learning a Diversity of Coding Tasks
Ruchi Puri
David S. Kung
G. Janssen
Wei Zhang
Giacomo Domeniconi
...
Saurabh Pujar
Shyam Ramji
Ulrich Finkler
Susan Malaika
Frederick Reiss
79
239
0
25 May 2021
Unified Pre-training for Program Understanding and Generation
Wasi Uddin Ahmad
Saikat Chakraborty
Baishakhi Ray
Kai-Wei Chang
135
769
0
10 Mar 2021
GraphCodeBERT: Pre-training Code Representations with Data Flow
Daya Guo
Shuo Ren
Shuai Lu
Zhangyin Feng
Duyu Tang
...
Dawn Drain
Neel Sundaresan
Jian Yin
Daxin Jiang
M. Zhou
145
1,143
0
17 Sep 2020
Neural Code Search Revisited: Enhancing Code Snippet Retrieval through Natural Language Intent
Geert Heyman
Tom Van Cutsem
55
30
0
27 Aug 2020
CodeBERT: A Pre-Trained Model for Programming and Natural Languages
Zhangyin Feng
Daya Guo
Duyu Tang
Nan Duan
Xiaocheng Feng
...
Linjun Shou
Bing Qin
Ting Liu
Daxin Jiang
Ming Zhou
165
2,637
0
19 Feb 2020
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
445
20,298
0
23 Oct 2019
RoBERTa: A Robustly Optimized BERT Pretraining Approach
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
668
24,528
0
26 Jul 2019
Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL Task
Tao Yu
Rui Zhang
Kai-Chou Yang
Michihiro Yasunaga
Dongxu Wang
...
Irene Li
Qingning Yao
Shanelle Roman
Zilin Zhang
Dragomir R. Radev
RALM
108
1,241
0
24 Sep 2018
Learning to Mine Aligned Code and Natural Language Pairs from Stack Overflow
Pengcheng Yin
Bowen Deng
Edgar Chen
Bogdan Vasilescu
Graham Neubig
63
304
0
23 May 2018
1