Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2310.08118
Cited By
Can Large Language Models Really Improve by Self-critiquing Their Own Plans?
12 October 2023
Karthik Valmeekam
Matthew Marquez
Subbarao Kambhampati
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Can Large Language Models Really Improve by Self-critiquing Their Own Plans?"
11 / 11 papers shown
Title
Evaluating Judges as Evaluators: The JETTS Benchmark of LLM-as-Judges as Test-Time Scaling Evaluators
Yilun Zhou
Austin Xu
Peifeng Wang
Caiming Xiong
Shafiq Joty
ELM
ALM
LRM
142
5
0
21 Apr 2025
Stay Focused: Problem Drift in Multi-Agent Debate
Jonas Becker
Lars Benedikt Kaesberg
Andreas Stephan
Jan Philip Wahle
Terry Ruas
Bela Gipp
107
2
0
26 Feb 2025
Counterfactual Debating with Preset Stances for Hallucination Elimination of LLMs
Yi Fang
Moxin Li
Wenjie Wang
Hui Lin
Fuli Feng
LRM
93
8
0
17 Jun 2024
KnowAgent: Knowledge-Augmented Planning for LLM-Based Agents
Yuqi Zhu
Shuofei Qiao
Yixin Ou
Shumin Deng
N. Zhang
Shiwei Lyu
Yue Shen
Lei Liang
Jinjie Gu
Ningyu Zhang
LLMAG
LM&Ro
142
32
0
05 Mar 2024
Faith and Fate: Limits of Transformers on Compositionality
Nouha Dziri
Ximing Lu
Melanie Sclar
Xiang Lorraine Li
Liwei Jian
...
Sean Welleck
Xiang Ren
Allyson Ettinger
Zaïd Harchaoui
Yejin Choi
ReLM
LRM
140
384
0
29 May 2023
On the Planning Abilities of Large Language Models : A Critical Investigation
Karthik Valmeekam
Matthew Marquez
S. Sreedharan
Subbarao Kambhampati
LLMAG
LRM
50
237
0
25 May 2023
Improving Factuality and Reasoning in Language Models through Multiagent Debate
Yilun Du
Shuang Li
Antonio Torralba
J. Tenenbaum
Igor Mordatch
LLMAG
LRM
157
741
0
23 May 2023
LM vs LM: Detecting Factual Errors via Cross Examination
Roi Cohen
May Hamri
Mor Geva
Amir Globerson
HILM
106
141
0
22 May 2023
Tree of Thoughts: Deliberate Problem Solving with Large Language Models
Shunyu Yao
Dian Yu
Jeffrey Zhao
Izhak Shafran
Thomas Griffiths
Yuan Cao
Karthik Narasimhan
LM&Ro
LRM
AI4CE
156
2,025
0
17 May 2023
Self-Refine: Iterative Refinement with Self-Feedback
Aman Madaan
Niket Tandon
Prakhar Gupta
Skyler Hallinan
Luyu Gao
...
Bodhisattwa Prasad Majumder
Katherine Hermann
Sean Welleck
Amir Yazdanbakhsh
Peter Clark
ReLM
LRM
DiffM
167
1,670
0
30 Mar 2023
Language Models can Solve Computer Tasks
Geunwoo Kim
Pierre Baldi
Stephen Marcus McAleer
LLMAG
LM&Ro
136
372
0
30 Mar 2023
1