ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2304.05128
  4. Cited By
Teaching Large Language Models to Self-Debug
v1v2 (latest)

Teaching Large Language Models to Self-Debug

11 April 2023
Xinyun Chen
Maxwell Lin
Nathanael Scharli
Denny Zhou
    LRM
ArXiv (abs)PDFHTML

Papers citing "Teaching Large Language Models to Self-Debug"

45 / 145 papers shown
Title
Hints-In-Browser: Benchmarking Language Models for Programming Feedback Generation
Hints-In-Browser: Benchmarking Language Models for Programming Feedback Generation
Nachiket Kotalwar
Alkis Gotovos
Adish Singla
ALM
118
4
0
07 Jun 2024
Learning to Edit Visual Programs with Self-Supervision
Learning to Edit Visual Programs with Self-Supervision
R. K. Jones
Renhao Zhang
Aditya Ganeshan
Daniel E. Ritchie
SSL
86
3
0
04 Jun 2024
ReflectionCoder: Learning from Reflection Sequence for Enhanced One-off Code Generation
ReflectionCoder: Learning from Reflection Sequence for Enhanced One-off Code Generation
Houxing Ren
Mingjie Zhan
Zhongyuan Wu
Aojun Zhou
Junting Pan
Hongsheng Li
SyDa
123
7
0
27 May 2024
Synthesizing Programmatic Reinforcement Learning Policies with Large Language Model Guided Search
Synthesizing Programmatic Reinforcement Learning Policies with Large Language Model Guided Search
Max Liu
Chan-Hung Yu
Wei-Hsu Lee
Cheng-Wei Hung
Yen-Chun Chen
Shao-Hua Sun
143
5
0
26 May 2024
RLSF: Fine-tuning LLMs via Symbolic Feedback
RLSF: Fine-tuning LLMs via Symbolic Feedback
Piyush Jha
Prithwish Jana
Pranavkrishna Suresh
Arnav Arora
Vijay Ganesh
LRM
99
4
0
26 May 2024
Small Language Models Need Strong Verifiers to Self-Correct Reasoning
Small Language Models Need Strong Verifiers to Self-Correct Reasoning
Yunxiang Zhang
Muhammad Khalifa
Lajanugen Logeswaran
Jaekyeom Kim
Moontae Lee
Honglak Lee
Lu Wang
LRMKELMReLM
116
43
0
26 Apr 2024
NExT: Teaching Large Language Models to Reason about Code Execution
NExT: Teaching Large Language Models to Reason about Code Execution
Ansong Ni
Miltiadis Allamanis
Arman Cohan
Yinlin Deng
Kensen Shi
Charles Sutton
Pengcheng Yin
ReLMLRM
122
45
0
23 Apr 2024
When Hindsight is Not 20/20: Testing Limits on Reflective Thinking in
  Large Language Models
When Hindsight is Not 20/20: Testing Limits on Reflective Thinking in Large Language Models
Yanhong Li
Chenghao Yang
Allyson Ettinger
ReLMLRMLLMAG
80
11
0
14 Apr 2024
Online Safety Analysis for LLMs: a Benchmark, an Assessment, and a Path
  Forward
Online Safety Analysis for LLMs: a Benchmark, an Assessment, and a Path Forward
Xuan Xie
Jiayang Song
Zhehua Zhou
Yuheng Huang
Da Song
Lei Ma
OffRL
128
6
0
12 Apr 2024
Stable Code Technical Report
Stable Code Technical Report
Nikhil Pinnaparaju
Reshinth Adithyan
Duy Phung
J. Tow
James Baicoianu
...
Maksym Zhuravinskyi
Dakota Mahan
Marco Bellagente
Carlos Riquelme
Nathan Cooper
LRMALM
63
13
0
01 Apr 2024
VURF: A General-purpose Reasoning and Self-refinement Framework for Video Understanding
VURF: A General-purpose Reasoning and Self-refinement Framework for Video Understanding
Ahmad A Mahmood
Ashmal Vayani
Muzammal Naseer
Salman Khan
Fahad Shahbaz Khan
LRM
171
9
0
21 Mar 2024
HDLdebugger: Streamlining HDL debugging with Large Language Models
HDLdebugger: Streamlining HDL debugging with Large Language Models
Xufeng Yao
Haoyang Li
T. H. Chan
Wenyi Xiao
Mingxuan Yuan
Yu Huang
Lei Chen
Bei Yu
71
23
0
18 Mar 2024
LiveCodeBench: Holistic and Contamination Free Evaluation of Large
  Language Models for Code
LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code
Naman Jain
King Han
Alex Gu
Wen-Ding Li
Fanjia Yan
Tianjun Zhang
Sida I. Wang
Armando Solar-Lezama
Koushik Sen
Ion Stoica
ELM
148
448
0
12 Mar 2024
Functional Benchmarks for Robust Evaluation of Reasoning Performance,
  and the Reasoning Gap
Functional Benchmarks for Robust Evaluation of Reasoning Performance, and the Reasoning Gap
Saurabh Srivastava
B. AnnaroseM
V. AntoP
Shashank Menon
Ajay Sukumar
T. AdwaithSamod
Alan Philipose
Stevin Prince
Sooraj Thomas
ELMReLMLRM
79
56
0
29 Feb 2024
BBA: Bi-Modal Behavioral Alignment for Reasoning with Large
  Vision-Language Models
BBA: Bi-Modal Behavioral Alignment for Reasoning with Large Vision-Language Models
Xueliang Zhao
Xinting Huang
Tingchen Fu
Qintong Li
Shansan Gong
Lemao Liu
Wei Bi
Lingpeng Kong
LRM
77
1
0
21 Feb 2024
Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding
Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding
Mirac Suzgun
Adam Tauman Kalai
KELMLRMLLMAGReLM
123
78
0
23 Jan 2024
JumpCoder: Go Beyond Autoregressive Coder via Online Modification
JumpCoder: Go Beyond Autoregressive Coder via Online Modification
Mouxiang Chen
Hao Tian
Zhongxi Liu
Xiaoxue Ren
Jianling Sun
SyDaKELM
95
2
0
15 Jan 2024
InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks
InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks
Xueyu Hu
Ziyu Zhao
Shuang Wei
Ziwei Chai
Qianli Ma
...
Jiwei Li
Kun Kuang
Yang Yang
Hongxia Yang
Leilei Gan
LMTDELM
92
58
0
10 Jan 2024
DebugBench: Evaluating Debugging Capability of Large Language Models
DebugBench: Evaluating Debugging Capability of Large Language Models
Runchu Tian
Yining Ye
Yujia Qin
Xin Cong
Yankai Lin
...
Yesai Wu
Haotian Hui
Weichuan Liu
Zhiyuan Liu
Maosong Sun
ELM
122
39
0
09 Jan 2024
KEN: Kernel Extensions using Natural Language
KEN: Kernel Extensions using Natural Language
Yusheng Zheng
Yiwei Yang
Maolin Chen
Andrew Quinn
56
0
0
09 Dec 2023
Igniting Language Intelligence: The Hitchhiker's Guide From
  Chain-of-Thought Reasoning to Language Agents
Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents
Zhuosheng Zhang
Yao Yao
Aston Zhang
Xiangru Tang
Xinbei Ma
...
Yiming Wang
Mark B. Gerstein
Rui Wang
Gongshen Liu
Hai Zhao
LLMAGLM&RoLRM
149
61
0
20 Nov 2023
LINC: A Neurosymbolic Approach for Logical Reasoning by Combining
  Language Models with First-Order Logic Provers
LINC: A Neurosymbolic Approach for Logical Reasoning by Combining Language Models with First-Order Logic Provers
Theo X. Olausson
Alex Gu
Benjamin Lipkin
Cedegao E. Zhang
Armando Solar-Lezama
Josh Tenenbaum
Roger Levy
LRMAI4CEReLM
168
119
0
23 Oct 2023
The Consensus Game: Language Model Generation via Equilibrium Search
The Consensus Game: Language Model Generation via Equilibrium Search
Athul Paul Jacob
Songlin Yang
Gabriele Farina
Jacob Andreas
93
23
0
13 Oct 2023
CodeChain: Towards Modular Code Generation Through Chain of
  Self-revisions with Representative Sub-modules
CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules
Hung Le
Hailin Chen
Amrita Saha
Akash Gokul
Doyen Sahoo
Shafiq Joty
LRM
108
47
0
13 Oct 2023
Lemur: Harmonizing Natural Language and Code for Language Agents
Lemur: Harmonizing Natural Language and Code for Language Agents
Yiheng Xu
Hongjin Su
Chen Xing
Boyu Mi
Qian Liu
...
Siheng Zhao
Lingpeng Kong
Bailin Wang
Caiming Xiong
Tao Yu
99
74
0
10 Oct 2023
Enabling Language Models to Implicitly Learn Self-Improvement
Enabling Language Models to Implicitly Learn Self-Improvement
Ziqi Wang
Le Hou
Tianjian Lu
Yuexin Wu
Yunxuan Li
Hongkun Yu
Heng Ji
ReLMLRM
65
6
0
02 Oct 2023
AutoAgents: A Framework for Automatic Agent Generation
AutoAgents: A Framework for Automatic Agent Generation
Guangyao Chen
Siwei Dong
Yu Shu
Ge Zhang
Jaward Sesay
Börje F. Karlsson
Jie Fu
Yemin Shi
LLMAG
119
129
0
29 Sep 2023
Hypothesis Search: Inductive Reasoning with Language Models
Hypothesis Search: Inductive Reasoning with Language Models
Ruocheng Wang
E. Zelikman
Gabriel Poesia
Yewen Pu
Nick Haber
Noah D. Goodman
ReLMLRM
132
112
0
11 Sep 2023
BatchPrompt: Accomplish more with less
BatchPrompt: Accomplish more with less
Jianzhe Lin
Maurice Diesendruck
Liang Du
Robin Abraham
LRM
96
10
0
01 Sep 2023
AI-Generated Content (AIGC) for Various Data Modalities: A Survey
AI-Generated Content (AIGC) for Various Data Modalities: A Survey
Lin Geng Foo
Hossein Rahmani
Jing Liu
270
31
0
27 Aug 2023
Graph of Thoughts: Solving Elaborate Problems with Large Language Models
Graph of Thoughts: Solving Elaborate Problems with Large Language Models
Maciej Besta
Nils Blach
Aleš Kubíček
Robert Gerstenberger
Michal Podstawski
...
Joanna Gajda
Tomasz Lehmann
H. Niewiadomski
Piotr Nyczyk
Torsten Hoefler
LRMAI4CELM&Ro
189
716
0
18 Aug 2023
Flows: Building Blocks of Reasoning and Collaborating AI
Flows: Building Blocks of Reasoning and Collaborating AI
Martin Josifoski
Lars Klein
Maxime Peyrard
Nicolas Mario Baldwin
Yifei Li
...
Julian Paul Schnitzler
Yuxing Yao
Jiheng Wei
Debjit Paul
Robert West
AI4CE
108
25
0
02 Aug 2023
Explaining Competitive-Level Programming Solutions using LLMs
Explaining Competitive-Level Programming Solutions using LLMs
Jierui Li
Szymon Tworkowski
Yingying Wu
Raymond J. Mooney
LRM
80
17
0
11 Jul 2023
Exploring and Characterizing Large Language Models For Embedded System
  Development and Debugging
Exploring and Characterizing Large Language Models For Embedded System Development and Debugging
Zachary Englhardt
Rong-Hua Li
Dilini Nissanka
Zhihan Zhang
Girish Narayanswamy
Joseph Breda
Xin Liu
Shwetak N. Patel
Vikram Iyer
89
20
0
07 Jul 2023
Let Me Teach You: Pedagogical Foundations of Feedback for Language
  Models
Let Me Teach You: Pedagogical Foundations of Feedback for Language Models
Beatriz Borges
Niket Tandon
Tanja Käser
Antoine Bosselut
142
4
0
01 Jul 2023
System-Level Natural Language Feedback
System-Level Natural Language Feedback
Weizhe Yuan
Kyunghyun Cho
Jason Weston
115
5
0
23 Jun 2023
Deductive Verification of Chain-of-Thought Reasoning
Deductive Verification of Chain-of-Thought Reasoning
Z. Ling
Yunhao Fang
Xuanlin Li
Zhiao Huang
Mingu Lee
Roland Memisevic
Hao Su
ReLMLRM
109
136
0
06 Jun 2023
InstructZero: Efficient Instruction Optimization for Black-Box Large
  Language Models
InstructZero: Efficient Instruction Optimization for Black-Box Large Language Models
Lichang Chen
Jiuhai Chen
Tom Goldstein
Heng-Chiao Huang
Dinesh Manocha
88
45
0
05 Jun 2023
ANPL: Towards Natural Programming with Interactive Decomposition
ANPL: Towards Natural Programming with Interactive Decomposition
Di Huang
Ziyuan Nan
Xingui Hu
Pengwei Jin
Shaohui Peng
...
Rui Zhang
Zidong Du
Qi Guo
Yewen Pu
Yunji Chen
87
9
0
29 May 2023
Gorilla: Large Language Model Connected with Massive APIs
Gorilla: Large Language Model Connected with Massive APIs
Shishir G. Patil
Tianjun Zhang
Xin Wang
Joseph E. Gonzalez
ELMCLLALMSyDa
127
572
0
24 May 2023
Generalized Planning in PDDL Domains with Pretrained Large Language
  Models
Generalized Planning in PDDL Domains with Pretrained Large Language Models
Tom Silver
Soham Dan
Kavitha Srinivas
J. Tenenbaum
L. Kaelbling
Michael Katz
ELMLRM
124
134
0
18 May 2023
Scratch Copilot Evaluation: Assessing AI-Assisted Creative Coding for
  Families
Scratch Copilot Evaluation: Assessing AI-Assisted Creative Coding for Families
Stefania Druga
Nancy Otero
AI4Ed
70
5
0
17 May 2023
LeTI: Learning to Generate from Textual Interactions
LeTI: Learning to Generate from Textual Interactions
Xingyao Wang
Hao Peng
Reyhaneh Jabbarvand
Heng Ji
116
30
0
17 May 2023
Learning to Simulate Natural Language Feedback for Interactive Semantic
  Parsing
Learning to Simulate Natural Language Feedback for Interactive Semantic Parsing
Hao Yan
Saurabh Srivastava
Yintao Tai
Sida I. Wang
Wen-tau Yih
Ziyu Yao
71
19
0
14 May 2023
Hierarchical Programmatic Reinforcement Learning via Learning to Compose
  Programs
Hierarchical Programmatic Reinforcement Learning via Learning to Compose Programs
Guanhui. Liu
En-Pei Hu
Pu-Jen Cheng
Hung-yi Lee
Shao-Hua Sun
144
18
0
30 Jan 2023
Previous
123