Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.00690
Cited By
Playing NetHack with LLMs: Potential & Limitations as Zero-Shot Agents
1 March 2024
Dominik Jeurissen
Diego Perez-Liebana
Jeremy Gow
Duygu Cakmak
James Kwan
LLMAG
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Playing NetHack with LLMs: Potential & Limitations as Zero-Shot Agents"
7 / 7 papers shown
Title
Codenames as a Benchmark for Large Language Models
Matthew Stephenson
Matthew Sidji
Benoît Ronval
LLMAG
LRM
ELM
111
1
0
16 Dec 2024
BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games
Davide Paglieri
Bartłomiej Cupiał
Samuel Coward
Ulyana Piterbarg
Maciej Wolczyk
...
Lerrel Pinto
Rob Fergus
Jakob Foerster
Jack Parker-Holder
Tim Rocktaschel
LLMAG
LRM
116
12
0
20 Nov 2024
GPT for Games: An Updated Scoping Review (2020-2024)
Daijin Yang
Erica Kleinman
Casper Harteveld
LLMAG
AI4TS
AI4CE
51
3
0
01 Nov 2024
MLLM as Retriever: Interactively Learning Multimodal Retrieval for Embodied Agents
Junpeng Yue
Xinru Xu
Börje F. Karlsson
Zongqing Lu
39
0
0
04 Oct 2024
AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM Agents
Petr Anokhin
Nikita Semenov
Artyom Sorokin
Dmitry Evseev
Andrey Kravchenko
Mikhail Burtsev
Evgeny Burnaev
LLMAG
RALM
KELM
55
7
0
05 Jul 2024
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
447
8,650
0
28 Jan 2022
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Mikayel Samvelyan
Robert Kirk
Vitaly Kurin
Jack Parker-Holder
Minqi Jiang
Eric Hambro
Fabio Petroni
Heinrich Küttler
Edward Grefenstette
Tim Rocktaschel
OffRL
238
89
0
27 Sep 2021
1