ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.15771
  4. Cited By
On the Planning Abilities of Large Language Models : A Critical
  Investigation
v1v2 (latest)

On the Planning Abilities of Large Language Models : A Critical Investigation

25 May 2023
Karthik Valmeekam
Matthew Marquez
S. Sreedharan
Subbarao Kambhampati
    LLMAGLRM
ArXiv (abs)PDFHTML

Papers citing "On the Planning Abilities of Large Language Models : A Critical Investigation"

28 / 28 papers shown
Title
Teaching Large Language Models to Reason through Learning and Forgetting
Teaching Large Language Models to Reason through Learning and Forgetting
Tianwei Ni
Allen Nie
Sapana Chaudhary
Yao Liu
Huzefa Rangwala
Rasool Fakoor
ReLMCLLLRM
461
0
0
15 Apr 2025
SEAL: Steerable Reasoning Calibration of Large Language Models for Free
SEAL: Steerable Reasoning Calibration of Large Language Models for Free
Runjin Chen
Zhenyu Zhang
Junyuan Hong
Souvik Kundu
Zhangyang Wang
OffRLLRM
121
14
0
07 Apr 2025
Token embeddings violate the manifold hypothesis
Token embeddings violate the manifold hypothesis
Michael Robinson
Sourya Dey
Tony Chiang
104
2
0
01 Apr 2025
Equilibrate RLHF: Towards Balancing Helpfulness-Safety Trade-off in Large Language Models
Equilibrate RLHF: Towards Balancing Helpfulness-Safety Trade-off in Large Language Models
Yingshui Tan
Yilei Jiang
Yongbin Li
Qingbin Liu
Xingyuan Bu
Wenbo Su
Xiangyu Yue
Xiaoyong Zhu
Bo Zheng
ALM
135
6
0
17 Feb 2025
Episodic memory in AI agents poses risks that should be studied and mitigated
Episodic memory in AI agents poses risks that should be studied and mitigated
Chad DeChant
129
4
0
20 Jan 2025
Cocoa: Co-Planning and Co-Execution with AI Agents
Cocoa: Co-Planning and Co-Execution with AI Agents
K. J. Kevin Feng
Kevin Pu
Matt Latzke
Tal August
Pao Siangliulue
Jonathan Bragg
Daniel S. Weld
Amy X. Zhang
Joseph Chee Chang
LM&RoLLMAG
152
7
0
14 Dec 2024
Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning
Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning
Jiacheng Ye
Jiahui Gao
Shansan Gong
Lin Zheng
Xin Jiang
Zhiyu Li
Dianbo Sui
DiffMLRM
146
25
0
18 Oct 2024
The Mystery of the Pathological Path-star Task for Language Models
The Mystery of the Pathological Path-star Task for Language Models
Arvid Frydenlund
LRM
97
4
0
17 Oct 2024
Planning Anything with Rigor: General-Purpose Zero-Shot Planning with LLM-based Formalized Programming
Planning Anything with Rigor: General-Purpose Zero-Shot Planning with LLM-based Formalized Programming
Yilun Hao
Yang Zhang
Chuchu Fan
LLMAG
120
17
0
15 Oct 2024
Automatic Curriculum Expert Iteration for Reliable LLM Reasoning
Automatic Curriculum Expert Iteration for Reliable LLM Reasoning
Zirui Zhao
Hanze Dong
Amrita Saha
Caiming Xiong
Doyen Sahoo
LRM
90
7
0
10 Oct 2024
ET-Plan-Bench: Embodied Task-level Planning Benchmark Towards Spatial-Temporal Cognition with Foundation Models
ET-Plan-Bench: Embodied Task-level Planning Benchmark Towards Spatial-Temporal Cognition with Foundation Models
Lingfeng Zhang
Yuening Wang
Hongjian Gu
Atia Hamidizadeh
Zhanguang Zhang
...
Tongtong Cao
Yuzheng Zhuang
Yingxue Zhang
Jianye Hao
Jianye Hao
LM&Ro
92
2
0
02 Oct 2024
CSCE: Boosting LLM Reasoning by Simultaneous Enhancing of Causal Significance and Consistency
CSCE: Boosting LLM Reasoning by Simultaneous Enhancing of Causal Significance and Consistency
Kangsheng Wang
Xiao Zhang
Zizheng Guo
Tianyu Hu
Huimin Ma
LRM
110
7
0
20 Sep 2024
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning
Zayne Sprague
Fangcong Yin
Juan Diego Rodriguez
Dongwei Jiang
Manya Wadhwa
Prasann Singhal
Xinyu Zhao
Xi Ye
Kyle Mahowald
Greg Durrett
ReLMLRM
207
130
0
18 Sep 2024
Bootstrapping Object-level Planning with Large Language Models
Bootstrapping Object-level Planning with Large Language Models
D. Paulius
Alejandro Agostini
Benedict Quartey
George Konidaris
LM&Ro
137
2
0
18 Sep 2024
Bi-Factorial Preference Optimization: Balancing Safety-Helpfulness in Language Models
Bi-Factorial Preference Optimization: Balancing Safety-Helpfulness in Language Models
Wenxuan Zhang
Philip Torr
Mohamed Elhoseiny
Adel Bibi
168
15
0
27 Aug 2024
Unlocking Large Language Model's Planning Capabilities with Maximum Diversity Fine-tuning
Unlocking Large Language Model's Planning Capabilities with Maximum Diversity Fine-tuning
Wenjun Li
Changyu Chen
Pradeep Varakantham
180
2
0
15 Jun 2024
Benchmarking Generative Models on Computational Thinking Tests in Elementary Visual Programming
Benchmarking Generative Models on Computational Thinking Tests in Elementary Visual Programming
Victor-Alexandru Pădurean
Adish Singla
ELM
94
3
0
14 Jun 2024
A Survey on Vision-Language-Action Models for Embodied AI
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
294
54
0
23 May 2024
Chain of Thoughtlessness? An Analysis of CoT in Planning
Chain of Thoughtlessness? An Analysis of CoT in Planning
Kaya Stechly
Karthik Valmeekam
Subbarao Kambhampati
LRMLM&Ro
147
50
0
08 May 2024
Can Large Language Models Really Improve by Self-critiquing Their Own
  Plans?
Can Large Language Models Really Improve by Self-critiquing Their Own Plans?
Karthik Valmeekam
Matthew Marquez
Subbarao Kambhampati
LRM
76
87
0
12 Oct 2023
CoPAL: Corrective Planning of Robot Actions with Large Language Models
CoPAL: Corrective Planning of Robot Actions with Large Language Models
Frank Joublin
Antonello Ceravola
Pavel Smirnov
Felix Ocker
Joerg Deigmoeller
Anna Belardinelli
Chao Wang
Stephan Hasler
Daniel Tanneberg
Michael Gienger
LM&RoLLMAG
95
36
0
11 Oct 2023
ReAct: Synergizing Reasoning and Acting in Language Models
ReAct: Synergizing Reasoning and Acting in Language Models
Shunyu Yao
Jeffrey Zhao
Dian Yu
Nan Du
Izhak Shafran
Karthik Narasimhan
Yuan Cao
LLMAGReLMLRM
436
2,955
0
06 Oct 2022
On the Paradox of Learning to Reason from Data
On the Paradox of Learning to Reason from Data
Honghua Zhang
Liunian Harold Li
Tao Meng
Kai-Wei Chang
Guy Van den Broeck
NAIReLMOODLRM
199
107
0
23 May 2022
PaLM: Scaling Language Modeling with Pathways
PaLM: Scaling Language Modeling with Pathways
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
...
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
PILMLRM
524
6,293
0
05 Apr 2022
Do As I Can, Not As I Say: Grounding Language in Robotic Affordances
Do As I Can, Not As I Say: Grounding Language in Robotic Affordances
Michael Ahn
Anthony Brohan
Noah Brown
Yevgen Chebotar
Omar Cortes
...
Ted Xiao
Peng Xu
Sichun Xu
Mengyuan Yan
Andy Zeng
LM&Ro
192
1,984
0
04 Apr 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&RoLRMAI4CEReLM
843
9,644
0
28 Jan 2022
Towards Understanding and Mitigating Social Biases in Language Models
Towards Understanding and Mitigating Social Biases in Language Models
Paul Pu Liang
Chiyu Wu
Louis-Philippe Morency
Ruslan Salakhutdinov
97
393
0
24 Jun 2021
CommonsenseQA: A Question Answering Challenge Targeting Commonsense
  Knowledge
CommonsenseQA: A Question Answering Challenge Targeting Commonsense Knowledge
Alon Talmor
Jonathan Herzig
Nicholas Lourie
Jonathan Berant
RALM
144
1,747
0
02 Nov 2018
1