ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2206.06888
  4. Cited By
CERT: Continual Pre-Training on Sketches for Library-Oriented Code
  Generation

CERT: Continual Pre-Training on Sketches for Library-Oriented Code Generation

14 June 2022
Daoguang Zan
Bei Chen
Dejian Yang
Zeqi Lin
Minsu Kim
Bei Guan
Yongji Wang
Weizhu Chen
Jian-Guang Lou
ArXivPDFHTML

Papers citing "CERT: Continual Pre-Training on Sketches for Library-Oriented Code Generation"

50 / 72 papers shown
Title
Towards an Understanding of Context Utilization in Code Intelligence
Towards an Understanding of Context Utilization in Code Intelligence
Yanlin Wang
Kefeng Duan
Dewu Zheng
Ensheng Shi
F. Zhang
...
Xilin Liu
Yuchi Ma
Hongyu Zhang
Qianxiang Wang
Zibin Zheng
29
0
0
11 Apr 2025
Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving
Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving
Daoguang Zan
Zhirong Huang
Wei Liu
Hanwu Chen
L. Zhang
...
Jing Su
Tianyu Liu
Rui Long
Kai Shen
Liang Xiang
43
2
0
03 Apr 2025
ObscuraCoder: Powering Efficient Code LM Pre-Training Via Obfuscation Grounding
ObscuraCoder: Powering Efficient Code LM Pre-Training Via Obfuscation Grounding
Indraneil Paul
Haoyi Yang
Goran Glavas
Kristian Kersting
Iryna Gurevych
AAML
SyDa
43
0
0
27 Mar 2025
Large Language Models (LLMs) for Source Code Analysis: applications, models and datasets
Large Language Models (LLMs) for Source Code Analysis: applications, models and datasets
Hamed Jelodar
Mohammad Meymani
Roozbeh Razavi-Far
42
0
0
21 Mar 2025
LLMs Love Python: A Study of LLMs' Bias for Programming Languages and Libraries
LLMs Love Python: A Study of LLMs' Bias for Programming Languages and Libraries
Lukas Twist
Jie M. Zhang
Mark Harman
Don Syme
Joost Noppen
Detlef Nauck
50
0
0
21 Mar 2025
Fully Autonomous Programming using Iterative Multi-Agent Debugging with Large Language Models
Anastasiia Grishina
Vadim Liventsev
Aki Härmä
Leon Moonen
ELM
87
0
0
10 Mar 2025
Benchmarking AI Models in Software Engineering: A Review, Search Tool, and Enhancement Protocol
Roham Koohestani
Philippe de Bekker
M. Izadi
VLM
45
0
0
07 Mar 2025
Alchemist: Towards the Design of Efficient Online Continual Learning System
Yuyang Huang
Yuhan Liu
Haryadi S. Gunawi
Beibin Li
Changho Hwang
CLL
OnRL
101
0
0
03 Mar 2025
LessLeak-Bench: A First Investigation of Data Leakage in LLMs Across 83 Software Engineering Benchmarks
LessLeak-Bench: A First Investigation of Data Leakage in LLMs Across 83 Software Engineering Benchmarks
Xin Zhou
Martin Weyssow
Ratnadira Widyasari
Ting Zhang
Junda He
Yunbo Lyu
Jianming Chang
Beiqi Zhang
Dan Huang
David Lo
PILM
277
1
0
10 Feb 2025
How Should We Build A Benchmark? Revisiting 274 Code-Related Benchmarks For LLMs
How Should We Build A Benchmark? Revisiting 274 Code-Related Benchmarks For LLMs
Jialun Cao
Yuk-Kit Chan
Zixuan Ling
Wenxuan Wang
Shuqing Li
...
Pinjia He
Shuai Wang
Zibin Zheng
Michael R. Lyu
Shing-Chi Cheung
ALM
71
1
0
18 Jan 2025
A Survey on Time-Series Distance Measures
A Survey on Time-Series Distance Measures
John Paparrizos
Haojun Li
Fan Yang
Kaize Wu
Jens E. d'Hondt
Odysseas Papapetrou
AI4TS
29
0
0
31 Dec 2024
TransitGPT: A Generative AI-based framework for interacting with GTFS
  data using Large Language Models
TransitGPT: A Generative AI-based framework for interacting with GTFS data using Large Language Models
Saipraneeth Devunuri
Lewis J. Lehe
LM&MA
73
1
0
07 Dec 2024
GitChameleon: Unmasking the Version-Switching Capabilities of Code
  Generation Models
GitChameleon: Unmasking the Version-Switching Capabilities of Code Generation Models
Nizar Islah
Justine Gehring
Diganta Misra
Eilif B. Muller
Irina Rish
Terry Yue Zhuo
Massimo Caccia
SyDa
40
1
0
05 Nov 2024
Metamorphic Malware Evolution: The Potential and Peril of Large Language
  Models
Metamorphic Malware Evolution: The Potential and Peril of Large Language Models
Pooria Madani
42
5
0
31 Oct 2024
EvoCodeBench: An Evolving Code Generation Benchmark with Domain-Specific
  Evaluations
EvoCodeBench: An Evolving Code Generation Benchmark with Domain-Specific Evaluations
Jia Li
Ge Li
Xuanming Zhang
Yunfei Zhao
Yihong Dong
Zhi Jin
Binhua Li
Fei Huang
Yongbin Li
ALM
ELM
41
11
0
30 Oct 2024
Faster Language Models with Better Multi-Token Prediction Using Tensor Decomposition
Faster Language Models with Better Multi-Token Prediction Using Tensor Decomposition
Artem Basharin
Andrei Chertkov
Ivan V. Oseledets
42
1
0
23 Oct 2024
Building A Coding Assistant via the Retrieval-Augmented Language Model
Building A Coding Assistant via the Retrieval-Augmented Language Model
Xinze Li
Hanbin Wang
Zhenghao Liu
S. Yu
Shuo Wang
Yukun Yan
Yukai Fu
Yu Gu
Ge Yu
3DV
RALM
23
2
0
21 Oct 2024
Evaluation of Code LLMs on Geospatial Code Generation
Evaluation of Code LLMs on Geospatial Code Generation
Piotr Gramacki
Bruno Martins
Piotr Szymañski
29
2
0
06 Oct 2024
Codev-Bench: How Do LLMs Understand Developer-Centric Code Completion?
Codev-Bench: How Do LLMs Understand Developer-Centric Code Completion?
Zhenyu Pan
Rongyu Cao
Yongchang Cao
Yingwei Ma
Binhua Li
Fei Huang
Han Liu
Yongbin Li
45
4
0
02 Oct 2024
A Comprehensive Framework for Evaluating API-oriented Code Generation in
  Large Language Models
A Comprehensive Framework for Evaluating API-oriented Code Generation in Large Language Models
Yixi Wu
Pengfei He
Zehao Wang
Shaowei Wang
Yuan Tian
Tse-Hsun Chen
ALM
37
0
0
23 Sep 2024
SWE-bench-java: A GitHub Issue Resolving Benchmark for Java
SWE-bench-java: A GitHub Issue Resolving Benchmark for Java
Daoguang Zan
Zhirong Huang
Ailun Yu
Shaoxin Lin
Yifan Shi
...
Bei Guan
Pengjie Huang
Tao Xie
Yongji Wang
Qianxiang Wang
31
8
0
26 Aug 2024
DOMAINEVAL: An Auto-Constructed Benchmark for Multi-Domain Code
  Generation
DOMAINEVAL: An Auto-Constructed Benchmark for Multi-Domain Code Generation
Qiming Zhu
Jialun Cao
Yaojie Lu
Hongyu Lin
Xianpei Han
Le Sun
Shing-Chi Cheung
ALM
35
7
0
23 Aug 2024
CodeJudge-Eval: Can Large Language Models be Good Judges in Code
  Understanding?
CodeJudge-Eval: Can Large Language Models be Good Judges in Code Understanding?
Yuwei Zhao
Ziyang Luo
Yuchen Tian
Hongzhan Lin
Weixiang Yan
Annan Li
Jing Ma
ELM
ALM
LRM
50
8
0
20 Aug 2024
COAST: Enhancing the Code Debugging Ability of LLMs through Communicative Agent Based Data Synthesis
COAST: Enhancing the Code Debugging Ability of LLMs through Communicative Agent Based Data Synthesis
Weiqing Yang
Hanbin Wang
Zhenghao Liu
Xinze Li
Yukun Yan
Shuo Wang
Yu Gu
Minghe Yu
Zhiyuan Liu
Ge Yu
50
2
0
09 Aug 2024
Reuse, Don't Retrain: A Recipe for Continued Pretraining of Language
  Models
Reuse, Don't Retrain: A Recipe for Continued Pretraining of Language Models
Jupinder Parmar
Sanjev Satheesh
M. Patwary
M. Shoeybi
Bryan Catanzaro
50
28
0
09 Jul 2024
What's Wrong with Your Code Generated by Large Language Models? An
  Extensive Study
What's Wrong with Your Code Generated by Large Language Models? An Extensive Study
Shihan Dou
Haoxiang Jia
Shenxi Wu
Huiyuan Zheng
Weikang Zhou
...
Xunliang Cai
Tao Gui
Xipeng Qiu
Qi Zhang
Xuanjing Huang
31
32
0
08 Jul 2024
InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation
InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation
Gaurav Sahu
Abhay Puri
Juan A. Rodriguez
Alexandre Drouin
Perouz Taslakian
...
Christopher Pal
Nicolas Chapados
I. Laradji
Sai Rajeswar Mudumba
Issam Hadj Laradji
ELM
46
4
0
08 Jul 2024
Is Your AI-Generated Code Really Safe? Evaluating Large Language Models
  on Secure Code Generation with CodeSecEval
Is Your AI-Generated Code Really Safe? Evaluating Large Language Models on Secure Code Generation with CodeSecEval
Jiexin Wang
Xitong Luo
Liuwen Cao
Hongkui He
Hailin Huang
Jiayuan Xie
Adam Jatowt
Yi Cai
ELM
38
14
0
02 Jul 2024
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
Terry Yue Zhuo
Minh Chien Vu
Jenny Chim
Han Hu
Wenhao Yu
...
David Lo
Daniel Fried
Xiaoning Du
H. D. Vries
Leandro von Werra
77
131
0
22 Jun 2024
Towards Lifelong Learning of Large Language Models: A Survey
Towards Lifelong Learning of Large Language Models: A Survey
Junhao Zheng
Shengjie Qiu
Chengming Shi
Qianli Ma
KELM
CLL
30
14
0
10 Jun 2024
A Survey on Large Language Models for Code Generation
A Survey on Large Language Models for Code Generation
Juyong Jiang
Fan Wang
Jiasi Shen
Sungju Kim
Sunghun Kim
53
161
0
01 Jun 2024
DevEval: A Manually-Annotated Code Generation Benchmark Aligned with
  Real-World Code Repositories
DevEval: A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories
Jia Li
Ge Li
Yunfei Zhao
Yongming Li
Huanyu Liu
...
Yihong Dong
Zhi Jin
Binhua Li
Fei Huang
Yongbin Li
ALM
29
26
0
30 May 2024
MHPP: Exploring the Capabilities and Limitations of Language Models
  Beyond Basic Code Generation
MHPP: Exploring the Capabilities and Limitations of Language Models Beyond Basic Code Generation
Jianbo Dai
Jianqiao Lu
Yunlong Feng
Rongju Ruan
Ming Cheng
Haochen Tan
Zhijiang Guo
ELM
LRM
36
12
0
19 May 2024
Automatic Programming: Large Language Models and Beyond
Automatic Programming: Large Language Models and Beyond
Michael R. Lyu
Baishakhi Ray
Abhik Roychoudhury
Shin Hwei Tan
Patanamon Thongtanunam
33
15
0
03 May 2024
EvoCodeBench: An Evolving Code Generation Benchmark Aligned with
  Real-World Code Repositories
EvoCodeBench: An Evolving Code Generation Benchmark Aligned with Real-World Code Repositories
Jia Li
Ge Li
Xuanming Zhang
Yihong Dong
Zhi Jin
32
32
0
31 Mar 2024
CodeS: Natural Language to Code Repository via Multi-Layer Sketch
CodeS: Natural Language to Code Repository via Multi-Layer Sketch
Daoguang Zan
Ailun Yu
Wei Liu
Dong Chen
Bo Shen
...
Bei Guan
Zhiguang Yang
Yongji Wang
Qianxiang Wang
Li-zhen Cui
33
14
0
25 Mar 2024
Simple and Scalable Strategies to Continually Pre-train Large Language
  Models
Simple and Scalable Strategies to Continually Pre-train Large Language Models
Adam Ibrahim
Benjamin Thérien
Kshitij Gupta
Mats L. Richter
Quentin Anthony
Timothée Lesort
Eugene Belilovsky
Irina Rish
KELM
CLL
44
52
0
13 Mar 2024
SEED: Customize Large Language Models with Sample-Efficient Adaptation
  for Code Generation
SEED: Customize Large Language Models with Sample-Efficient Adaptation for Code Generation
Xue Jiang
Yihong Dong
Zhi Jin
Ge Li
VLM
44
4
0
29 Feb 2024
Benchmarking Data Science Agents
Benchmarking Data Science Agents
Yuge Zhang
Qiyang Jiang
Xingyu Han
Nan Chen
Yuqing Yang
Kan Ren
ELM
27
10
0
27 Feb 2024
Solving Data-centric Tasks using Large Language Models
Solving Data-centric Tasks using Large Language Models
Shraddha Barke
Christian Poelitz
Carina Negreanu
Ben Zorn
J. Cambronero
...
Nadia Polikarpova
Advait Sarkar
Brian Slininger
N. Toronto
Jack Williams
38
1
0
18 Feb 2024
Text-to-Code Generation with Modality-relative Pre-training
Text-to-Code Generation with Modality-relative Pre-training
Fenia Christopoulou
Guchun Zhang
Gerasimos Lampouras
AI4TS
18
1
0
08 Feb 2024
EffiBench: Benchmarking the Efficiency of Automatically Generated Code
EffiBench: Benchmarking the Efficiency of Automatically Generated Code
Dong Huang
Yuhao Qing
Weiyi Shang
Heming Cui
Jie M. Zhang
82
31
0
03 Feb 2024
Continual Learning for Large Language Models: A Survey
Continual Learning for Large Language Models: A Survey
Tongtong Wu
Linhao Luo
Yuan-Fang Li
Shirui Pan
Thuy-Trang Vu
Gholamreza Haffari
CLL
LRM
KELM
26
102
0
02 Feb 2024
Using LLM such as ChatGPT for Designing and Implementing a RISC
  Processor: Execution,Challenges and Limitations
Using LLM such as ChatGPT for Designing and Implementing a RISC Processor: Execution,Challenges and Limitations
S. Hossain
Aayush Gohil
Yizhou Wang
22
3
0
18 Jan 2024
OOP: Object-Oriented Programming Evaluation Benchmark for Large Language
  Models
OOP: Object-Oriented Programming Evaluation Benchmark for Large Language Models
Shuai Wang
Liang Ding
Li Shen
Yong Luo
Bo Du
Dacheng Tao
ELM
ALM
45
2
0
12 Jan 2024
DevEval: Evaluating Code Generation in Practical Software Projects
Jia Li
Ge Li
Yunfei Zhao
Yongming Li
Zhi Jin
...
Xuanming Zhang
Yihong Dong
Yuqi Zhu
Bin Gu
Mengfei Yang
ALM
ELM
32
11
0
12 Jan 2024
AgentCoder: Multi-Agent-based Code Generation with Iterative Testing and
  Optimisation
AgentCoder: Multi-Agent-based Code Generation with Iterative Testing and Optimisation
Dong Huang
Jie M.Zhang
Michael Luck
Qi Bu
Yuhao Qing
Heming Cui
LLMAG
25
0
0
20 Dec 2023
Traces of Memorisation in Large Language Models for Code
Traces of Memorisation in Large Language Models for Code
Ali Al-Kaswan
M. Izadi
A. van Deursen
ELM
31
14
0
18 Dec 2023
Apollo: Zero-shot MultiModal Reasoning with Multiple Experts
Apollo: Zero-shot MultiModal Reasoning with Multiple Experts
Daniela Ben-David
Tzuf Paz-Argaman
Reut Tsarfaty
MoE
23
0
0
25 Oct 2023
Enhancing Large Language Models for Secure Code Generation: A
  Dataset-driven Study on Vulnerability Mitigation
Enhancing Large Language Models for Secure Code Generation: A Dataset-driven Study on Vulnerability Mitigation
Jiexin Wang
Liuwen Cao
Xitong Luo
Zhiping Zhou
Jiayuan Xie
Adam Jatowt
Yi Cai
44
10
0
25 Oct 2023
12
Next