ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2402.16906
  4. Cited By
Debug like a Human: A Large Language Model Debugger via Verifying
  Runtime Execution Step-by-step

Debug like a Human: A Large Language Model Debugger via Verifying Runtime Execution Step-by-step

25 February 2024
Li Zhong
Zilong Wang
Jingbo Shang
ArXivPDFHTML

Papers citing "Debug like a Human: A Large Language Model Debugger via Verifying Runtime Execution Step-by-step"

35 / 35 papers shown
Title
Web-Bench: A LLM Code Benchmark Based on Web Standards and Frameworks
Web-Bench: A LLM Code Benchmark Based on Web Standards and Frameworks
Kai Xu
YiWei Mao
XinYi Guan
ZiLong Feng
38
0
0
12 May 2025
Internet of Agents: Fundamentals, Applications, and Challenges
Internet of Agents: Fundamentals, Applications, and Challenges
Yuntao Wang
Shaolong Guo
Yanghe Pan
Zhou Su
Fahao Chen
Tom H. Luan
Peng Li
Jiawen Kang
Dusit Niyato
LLMAG
LM&Ro
AI4CE
60
0
0
12 May 2025
CrashFixer: A crash resolution agent for the Linux kernel
CrashFixer: A crash resolution agent for the Linux kernel
Alex Mathai
Chenxi Huang
Suwei Ma
Jihwan Kim
Hailie Mitchell
Aleksandr Nogikh
Petros Maniatis
Franjo Ivančić
Junfeng Yang
Baishakhi Ray
60
0
0
29 Apr 2025
Empowering AI to Generate Better AI Code: Guided Generation of Deep Learning Projects with LLMs
Empowering AI to Generate Better AI Code: Guided Generation of Deep Learning Projects with LLMs
Chen Xie
Mingsheng Jiao
Xiaodong Gu
Beijun Shen
22
0
0
21 Apr 2025
Weak-for-Strong: Training Weak Meta-Agent to Harness Strong Executors
Weak-for-Strong: Training Weak Meta-Agent to Harness Strong Executors
Fan Nie
Lan Feng
Haotian Ye
Weixin Liang
Pan Lu
Huaxiu Yao
Alexandre Alahi
James Zou
78
0
0
07 Apr 2025
SEAL: Steerable Reasoning Calibration of Large Language Models for Free
SEAL: Steerable Reasoning Calibration of Large Language Models for Free
Runjin Chen
Zhenyu (Allen) Zhang
Junyuan Hong
Souvik Kundu
Zhangyang Wang
OffRL
LRM
47
2
0
07 Apr 2025
Why Stop at One Error? Benchmarking LLMs as Data Science Code Debuggers for Multi-Hop and Multi-Bug Errors
Why Stop at One Error? Benchmarking LLMs as Data Science Code Debuggers for Multi-Hop and Multi-Bug Errors
Zhiyu Yang
Shuo Wang
Yukun Yan
Yang Deng
29
0
0
28 Mar 2025
DatawiseAgent: A Notebook-Centric LLM Agent Framework for Automated Data Science
Ziming You
Yumiao Zhang
Dexuan Xu
Yiwei Lou
Yandong Yan
Wei Wang
H. Zhang
Yu Huang
LLMAG
62
0
0
10 Mar 2025
KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for Coding
Zhangchen Xu
Yang Liu
Yueqin Yin
Mingyuan Zhou
Radha Poovendran
ALM
OffRL
84
7
0
04 Mar 2025
Nexus: A Lightweight and Scalable Multi-Agent Framework for Complex Tasks Automation
Nexus: A Lightweight and Scalable Multi-Agent Framework for Complex Tasks Automation
Humza Sami
Mubashir ul Islam
Samy Charas
Asav Gandhi
P. Gaillardon
V. Tenace
LLMAG
74
0
0
26 Feb 2025
LLM4EFFI: Leveraging Large Language Models to Enhance Code Efficiency and Correctness
LLM4EFFI: Leveraging Large Language Models to Enhance Code Efficiency and Correctness
Tong Ye
Weigang Huang
X. Zhang
Tengfei Ma
Peiyu Liu
Jianwei Yin
Wenhai Wang
LLMAG
48
2
0
17 Feb 2025
CoCoEvo: Co-Evolution of Programs and Test Cases to Enhance Code Generation
CoCoEvo: Co-Evolution of Programs and Test Cases to Enhance Code Generation
Kefan Li
Hongyue Yu
Tingyu Guo
Shijie Cao
Yuan Yuan
47
0
0
15 Feb 2025
RefineCoder: Iterative Improving of Large Language Models via Adaptive Critique Refinement for Code Generation
RefineCoder: Iterative Improving of Large Language Models via Adaptive Critique Refinement for Code Generation
C. Zhou
Xinyu Zhang
Dandan Song
Xiancai Chen
Wanli Gu
Huipeng Ma
Yuhang Tian
M. Zhang
Linmei Hu
63
1
0
13 Feb 2025
CODESIM: Multi-Agent Code Generation and Problem Solving through Simulation-Driven Planning and Debugging
CODESIM: Multi-Agent Code Generation and Problem Solving through Simulation-Driven Planning and Debugging
Md. Ashraful Islam
Mohammed Eunus Ali
Md. Rizwan Parvez
LLMAG
68
2
0
08 Feb 2025
SurgBox: Agent-Driven Operating Room Sandbox with Surgery Copilot
SurgBox: Agent-Driven Operating Room Sandbox with Surgery Copilot
Jinlin Wu
Xusheng Liang
Xuexue Bai
Zhen Chen
74
2
0
06 Dec 2024
Inference Scaling fLaws: The Limits of LLM Resampling with Imperfect
  Verifiers
Inference Scaling fLaws: The Limits of LLM Resampling with Imperfect Verifiers
Benedikt Stroebl
Sayash Kapoor
Arvind Narayanan
LRM
85
13
0
26 Nov 2024
BudgetMLAgent: A Cost-Effective LLM Multi-Agent system for Automating Machine Learning Tasks
BudgetMLAgent: A Cost-Effective LLM Multi-Agent system for Automating Machine Learning Tasks
Shubham Gandhi
Manasi S. Patwardhan
L. Vig
Gautam M. Shroff
LLMAG
45
0
0
12 Nov 2024
Aligning CodeLLMs with Direct Preference Optimization
Aligning CodeLLMs with Direct Preference Optimization
Yibo Miao
Bofei Gao
Shanghaoran Quan
Junyang Lin
Daoguang Zan
J. Liu
Jian Yang
Tianyu Liu
Zhijie Deng
58
5
0
24 Oct 2024
Self-Explained Keywords Empower Large Language Models for Code
  Generation
Self-Explained Keywords Empower Large Language Models for Code Generation
Lishui Fan
Mouxiang Chen
Zhongxin Liu
40
1
0
21 Oct 2024
Utilizing Large Language Models in an iterative paradigm with domain
  feedback for molecule optimization
Utilizing Large Language Models in an iterative paradigm with domain feedback for molecule optimization
Khiem Le
Nitesh V. Chawla
28
0
0
17 Oct 2024
SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation
SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation
Jaehong Yoon
Shoubin Yu
Vaidehi Patil
Huaxiu Yao
Mohit Bansal
76
15
0
16 Oct 2024
Denial-of-Service Poisoning Attacks against Large Language Models
Denial-of-Service Poisoning Attacks against Large Language Models
Kuofeng Gao
Tianyu Pang
Chao Du
Yong Yang
Shu-Tao Xia
Min-Bin Lin
SILM
AAML
59
4
0
14 Oct 2024
Expanding Search Space with Diverse Prompting Agents: An Efficient
  Sampling Approach for LLM Mathematical Reasoning
Expanding Search Space with Diverse Prompting Agents: An Efficient Sampling Approach for LLM Mathematical Reasoning
Gisang Lee
Sangwoo Park
Junyoung Park
Andrew Chung
Sieun Park
Yoonah Park
Byungju Kim
Min-gyu Cho
LRM
27
1
0
13 Oct 2024
RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement
  Learning
RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning
Jonas Gehring
Kunhao Zheng
Jade Copet
Vegard Mella
Taco Cohen
Gabriel Synnaeve
LLMAG
32
21
0
02 Oct 2024
Hidden in Plain Text: Emergence & Mitigation of Steganographic Collusion
  in LLMs
Hidden in Plain Text: Emergence & Mitigation of Steganographic Collusion in LLMs
Yohan Mathew
Ollie Matthews
Robert McCarthy
Joan Velja
Christian Schroeder de Witt
Dylan R. Cope
Nandi Schoots
24
3
0
02 Oct 2024
From Code to Correctness: Closing the Last Mile of Code Generation with
  Hierarchical Debugging
From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging
Yuling Shi
Songsong Wang
Chengcheng Wan
Xiaodong Gu
ELM
29
6
0
02 Oct 2024
A Pair Programming Framework for Code Generation via Multi-Plan
  Exploration and Feedback-Driven Refinement
A Pair Programming Framework for Code Generation via Multi-Plan Exploration and Feedback-Driven Refinement
Huan Zhang
Wei Cheng
Yuhan Wu
Wei Hu
LLMAG
31
5
0
08 Sep 2024
Multi-Programming Language Ensemble for Code Generation in Large
  Language Model
Multi-Programming Language Ensemble for Code Generation in Large Language Model
Tengfei Xue
Xuefeng Li
Tahir Azim
Roman Smirnov
Jianhui Yu
Arash Sadrieh
Babak Pahlavan
21
2
0
06 Sep 2024
From LLMs to LLM-based Agents for Software Engineering: A Survey of Current, Challenges and Future
From LLMs to LLM-based Agents for Software Engineering: A Survey of Current, Challenges and Future
Haolin Jin
Linghan Huang
Haipeng Cai
Jun Yan
Bo Li
Huaming Chen
78
24
0
05 Aug 2024
A Survey on Large Language Models for Code Generation
A Survey on Large Language Models for Code Generation
Juyong Jiang
Fan Wang
Jiasi Shen
Sungju Kim
Sunghun Kim
47
161
0
01 Jun 2024
Can Github issues be solved with Tree Of Thoughts?
Can Github issues be solved with Tree Of Thoughts?
Ricardo La Rosa
Corey Hulse
Bangdi Liu
LRM
LLMAG
41
4
0
20 May 2024
On the Limitations of Embedding Based Methods for Measuring Functional
  Correctness for Code Generation
On the Limitations of Embedding Based Methods for Measuring Functional Correctness for Code Generation
Atharva Naik
40
2
0
26 Apr 2024
A Survey on Self-Evolution of Large Language Models
A Survey on Self-Evolution of Large Language Models
Zhengwei Tao
Ting-En Lin
Xiancai Chen
Hangyu Li
Yuchuan Wu
Yongbin Li
Zhi Jin
Fei Huang
Dacheng Tao
Jingren Zhou
LRM
LM&Ro
54
22
0
22 Apr 2024
Self-Organized Agents: A LLM Multi-Agent Framework toward Ultra
  Large-Scale Code Generation and Optimization
Self-Organized Agents: A LLM Multi-Agent Framework toward Ultra Large-Scale Code Generation and Optimization
Yoichi Ishibashi
Yoshimasa Nishimura
37
31
0
02 Apr 2024
Binding Language Models in Symbolic Languages
Binding Language Models in Symbolic Languages
Zhoujun Cheng
Tianbao Xie
Peng Shi
Chengzu Li
Rahul Nadkarni
...
Dragomir R. Radev
Mari Ostendorf
Luke Zettlemoyer
Noah A. Smith
Tao Yu
LMTD
116
197
0
06 Oct 2022
1