Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.18505
Cited By
Mental Modeling of Reinforcement Learning Agents by Language Models
26 June 2024
Wenhao Lu
Xufeng Zhao
Josua Spisak
Jae Hee Lee
Stefan Wermter
LLMAG
LRM
LM&Ro
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Mental Modeling of Reinforcement Learning Agents by Language Models"
14 / 14 papers shown
Title
Causal State Distillation for Explainable Reinforcement Learning
Wenhao Lu
Xufeng Zhao
Thilo Fryen
Jae Hee Lee
Mengdi Li
S. Magg
Stefan Wermter
CML
72
2
0
30 Dec 2023
Evaluating Cognitive Maps and Planning in Large Language Models with CogEval
Ida Momennejad
Hosein Hasanbeig
Felipe Vieira Frujeri
Hiteshi Sharma
Robert Osazuwa Ness
Nebojsa Jojic
Hamid Palangi
Jonathan Larson
ELM
LLMAG
LRM
54
72
0
25 Sep 2023
Supervised Pretraining Can Learn In-Context Reinforcement Learning
Jonathan Lee
Annie Xie
Aldo Pacchiano
Yash Chandak
Chelsea Finn
Ofir Nachum
Emma Brunskill
OffRL
75
80
0
26 Jun 2023
In-context Reinforcement Learning with Algorithm Distillation
Michael Laskin
Luyu Wang
Junhyuk Oh
Emilio Parisotto
Stephen Spencer
...
Ethan A. Brooks
Maxime Gazeau
Himanshu Sahni
Satinder Singh
Volodymyr Mnih
OffRL
53
128
0
25 Oct 2022
Mind's Eye: Grounded Language Model Reasoning through Simulation
Ruibo Liu
Jason W. Wei
S. Gu
Te-Yen Wu
Soroush Vosoughi
Claire Cui
Denny Zhou
Andrew M. Dai
ReLM
LRM
178
81
0
11 Oct 2022
Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering
Pan Lu
Swaroop Mishra
Tony Xia
Liang Qiu
Kai-Wei Chang
Song-Chun Zhu
Oyvind Tafjord
Peter Clark
Ashwin Kalyan
ELM
ReLM
LRM
250
1,230
0
20 Sep 2022
What Can Transformers Learn In-Context? A Case Study of Simple Function Classes
Shivam Garg
Dimitris Tsipras
Percy Liang
Gregory Valiant
116
504
0
01 Aug 2022
Prompting Decision Transformer for Few-Shot Policy Generalization
Mengdi Xu
Songlin Yang
Shun Zhang
Yuchen Lu
Ding Zhao
J. Tenenbaum
Chuang Gan
OffRL
60
144
0
27 Jun 2022
Pre-Trained Language Models for Interactive Decision-Making
Shuang Li
Xavier Puig
Chris Paxton
Yilun Du
Clinton Jia Wang
...
Anima Anandkumar
Jacob Andreas
Igor Mordatch
Antonio Torralba
Yuke Zhu
LM&Ro
93
257
0
03 Feb 2022
How Many Data Points is a Prompt Worth?
Teven Le Scao
Alexander M. Rush
VLM
143
302
0
15 Mar 2021
How Much Knowledge Can You Pack Into the Parameters of a Language Model?
Adam Roberts
Colin Raffel
Noam M. Shazeer
KELM
104
889
0
10 Feb 2020
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
444
18,931
0
20 Jul 2017
Deep Reinforcement Learning with Double Q-learning
H. V. Hasselt
A. Guez
David Silver
OffRL
156
7,623
0
22 Sep 2015
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
310
13,214
0
09 Sep 2015
1