Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.24715
Cited By
CoRet: Improved Retriever for Code Editing
30 May 2025
Fabio Fehr
Prabhu Teja Sivaprasad
Luca Franceschi
Giovanni Zappella
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CoRet: Improved Retriever for Code Editing"
36 / 36 papers shown
Title
LLMs are Also Effective Embedding Models: An In-depth Overview
Chongyang Tao
Tao Shen
Shen Gao
Junshuo Zhang
Zhen Li
Zhengwei Tao
Shuai Ma
107
9
0
17 Dec 2024
EvoCodeBench: An Evolving Code Generation Benchmark with Domain-Specific Evaluations
Jia Li
Ge Li
Xuanming Zhang
Yunfei Zhao
Yihong Dong
Zhi Jin
Binhua Li
Fei Huang
Yongbin Li
ALM
ELM
87
15
0
30 Oct 2024
RepoGraph: Enhancing AI Software Engineering with Repository-level Code Graph
Siru Ouyang
Wenhao Yu
Kaixin Ma
Zilin Xiao
Zizhuo Zhang
Mengzhao Jia
Jiawei Han
Han Zhang
Dong Yu
77
15
0
03 Oct 2024
Late Chunking: Contextual Chunk Embeddings Using Long-Context Embedding Models
Michael Gunther
Isabelle Mohr
Daniel James Williams
Bo Wang
Han Xiao
48
12
0
07 Sep 2024
CodexGraph: Bridging Large Language Models and Code Repositories via Code Graph Databases
Xiangyan Liu
Bo Lan
Zhiyuan Hu
Yang Liu
Zhicheng Zhang
Fei Wang
Michael Shieh
Ang Wang
58
17
0
07 Aug 2024
SpecRover: Code Intent Extraction via LLMs
Haifeng Ruan
Yuntong Zhang
Abhik Roychoudhury
47
20
0
05 Aug 2024
CoIR: A Comprehensive Benchmark for Code Information Retrieval Models
Xiangyang Li
Kuicai Dong
Yi Quan Lee
Wei Xia
Yichun Yin
Xinyi Dai
Yasheng Wang
Ruiming Tang
120
16
0
03 Jul 2024
CodeRAG-Bench: Can Retrieval Augment Code Generation?
Zora Z. Wang
Akari Asai
Xinyan Velocity Yu
Frank F. Xu
Yiqing Xie
Graham Neubig
Daniel Fried
RALM
130
34
0
20 Jun 2024
Long Code Arena: a Set of Benchmarks for Long-Context Code Models
Egor Bogomolov
Aleksandra V. Eliseeva
Timur Galimzyanov
Evgeniy Glukhov
Anton Shapkin
...
Yaroslav Golubev
Alexander Kovrigin
Arie van Deursen
Maliheh Izadi
T. Bryksin
ELM
38
21
0
17 Jun 2024
LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders
Parishad BehnamGhader
Vaibhav Adlakha
Marius Mosbach
Dzmitry Bahdanau
Nicolas Chapados
Siva Reddy
67
203
0
09 Apr 2024
DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence
Daya Guo
Qihao Zhu
Dejian Yang
Zhenda Xie
Kai Dong
...
Yu-Huan Wu
Yiming Li
Fuli Luo
Yingfei Xiong
W. Liang
ELM
85
735
0
25 Jan 2024
SWE-bench: Can Language Models Resolve Real-World GitHub Issues?
Carlos E. Jimenez
John Yang
Alexander Wettig
Shunyu Yao
Kexin Pei
Ofir Press
Karthik Narasimhan
ELM
52
529
0
10 Oct 2023
Large Language Models for Information Retrieval: A Survey
Yutao Zhu
Huaying Yuan
Shuting Wang
Jiongnan Liu
Wenhan Liu
Chenlong Deng
Haonan Chen
Zhicheng Dou
Ji-Rong Wen
KELM
82
300
0
14 Aug 2023
RepoCoder: Repository-Level Code Completion Through Iterative Retrieval and Generation
Fengji Zhang
B. Chen
Yue Zhang
Jacky Keung
Jin Liu
Daoguang Zan
Yi Mao
Jian-Guang Lou
Weizhu Chen
49
228
0
22 Mar 2023
CoCoMIC: Code Completion By Jointly Modeling In-file and Cross-file Context
Yangruibo Ding
Zijian Wang
Wasi Uddin Ahmad
M. K. Ramanathan
Ramesh Nallapati
Parminder Bhatia
Dan Roth
Bing Xiang
54
71
0
20 Dec 2022
UniXcoder: Unified Cross-Modal Pre-training for Code Representation
Daya Guo
Shuai Lu
Nan Duan
Yanlin Wang
Ming Zhou
Jian Yin
70
574
0
08 Mar 2022
Unsupervised Dense Information Retrieval with Contrastive Learning
Gautier Izacard
Mathilde Caron
Lucas Hosseini
Sebastian Riedel
Piotr Bojanowski
Armand Joulin
Edouard Grave
RALM
116
864
0
16 Dec 2021
Improving language models by retrieving from trillions of tokens
Sebastian Borgeaud
A. Mensch
Jordan Hoffmann
Trevor Cai
Eliza Rutherford
...
Simon Osindero
Karen Simonyan
Jack W. Rae
Erich Elsen
Laurent Sifre
KELM
RALM
160
1,069
0
08 Dec 2021
CoSQA: 20,000+ Web Queries for Code Search and Question Answering
Junjie Huang
Duyu Tang
Linjun Shou
Ming Gong
Ke Xu
Daxin Jiang
Ming Zhou
Nan Duan
46
114
0
27 May 2021
Project-Level Encoding for Neural Source Code Summarization of Subroutines
Aakash Bansal
S. Haque
Collin McMillan
78
49
0
22 Mar 2021
GraphCodeBERT: Pre-training Code Representations with Data Flow
Daya Guo
Shuo Ren
Shuai Lu
Zhangyin Feng
Duyu Tang
...
Dawn Drain
Neel Sundaresan
Jian Yin
Daxin Jiang
M. Zhou
128
1,111
0
17 Sep 2020
Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval
Lee Xiong
Chenyan Xiong
Ye Li
Kwok-Fung Tang
Jialin Liu
Paul N. Bennett
Junaid Ahmed
Arnold Overwijk
97
1,207
0
01 Jul 2020
Pre-training via Paraphrasing
M. Lewis
Marjan Ghazvininejad
Gargi Ghosh
Armen Aghajanyan
Sida I. Wang
Luke Zettlemoyer
AIMat
76
160
0
26 Jun 2020
Sparse, Dense, and Attentional Representations for Text Retrieval
Y. Luan
Jacob Eisenstein
Kristina Toutanova
M. Collins
57
402
0
01 May 2020
ColBERT: Efficient and Effective Passage Search via Contextualized Late Interaction over BERT
Omar Khattab
Matei A. Zaharia
97
1,337
0
27 Apr 2020
Dense Passage Retrieval for Open-Domain Question Answering
Vladimir Karpukhin
Barlas Oğuz
Sewon Min
Patrick Lewis
Ledell Yu Wu
Sergey Edunov
Danqi Chen
Wen-tau Yih
RALM
137
3,676
0
10 Apr 2020
CodeBERT: A Pre-Trained Model for Programming and Natural Languages
Zhangyin Feng
Daya Guo
Duyu Tang
Nan Duan
Xiaocheng Feng
...
Linjun Shou
Bing Qin
Ting Liu
Daxin Jiang
Ming Zhou
132
2,588
0
19 Feb 2020
A Simple Framework for Contrastive Learning of Visual Representations
Ting-Li Chen
Simon Kornblith
Mohammad Norouzi
Geoffrey E. Hinton
SSL
255
18,607
0
13 Feb 2020
Learning and Evaluating Contextual Embedding of Source Code
Aditya Kanade
Petros Maniatis
Gogul Balakrishnan
Kensen Shi
ELM
57
77
0
21 Dec 2019
CodeSearchNet Challenge: Evaluating the State of Semantic Code Search
Hamel Husain
Hongqiu Wu
Tiferet Gazit
Miltiadis Allamanis
Marc Brockschmidt
ELM
114
1,062
0
20 Sep 2019
Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks
Nils Reimers
Iryna Gurevych
641
11,979
0
27 Aug 2019
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
961
93,936
0
11 Oct 2018
Representation Learning with Contrastive Predictive Coding
Aaron van den Oord
Yazhe Li
Oriol Vinyals
DRL
SSL
229
10,152
0
10 Jul 2018
Predicting Positive and Negative Links with Noisy Queries: Theory & Practice
Charalampos E. Tsourakakis
Michael Mitzenmacher
Kasper Green Larsen
Jarosław Błasiok
Benn Lawson
Preetum Nakkiran
Vasileios Nakos
35
22
0
19 Sep 2017
Prediction and Clustering in Signed Networks: A Local to Global Perspective
Kai-Yang Chiang
Cho-Jui Hsieh
Nagarajan Natarajan
Ambuj Tewari
Inderjit S. Dhillon
61
145
0
20 Feb 2013
A Correlation Clustering Approach to Link Classification in Signed Networks -- Full Version --
Nicolò Cesa-Bianchi
Claudio Gentile
Fabio Vitale
Giovanni Zappella
62
33
0
21 Jan 2013
1