ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2109.13202
  4. Cited By
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning
  Research

MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research

27 September 2021
Mikayel Samvelyan
Robert Kirk
Vitaly Kurin
Jack Parker-Holder
Minqi Jiang
Eric Hambro
Fabio Petroni
Heinrich Küttler
Edward Grefenstette
Tim Rocktaschel
    OffRL
ArXivPDFHTML

Papers citing "MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research"

50 / 64 papers shown
Title
CAE: Repurposing the Critic as an Explorer in Deep Reinforcement Learning
CAE: Repurposing the Critic as an Explorer in Deep Reinforcement Learning
Yexin Li
Pring Wong
Hanfang Zhang
Shuo Chen
Siyuan Qi
OffRL
54
0
0
23 Mar 2025
Lifelong Reinforcement Learning with Similarity-Driven Weighting by Large Models
Lifelong Reinforcement Learning with Similarity-Driven Weighting by Large Models
Zhiyi Huang
Xiaohan Shan
Jianmin Li
CLL
OffRL
41
0
0
17 Mar 2025
SySLLM: Generating Synthesized Policy Summaries for Reinforcement Learning Agents Using Large Language Models
Sahar Admoni
Omer Ben-Porat
Ofra Amir
LLMAG
49
0
0
13 Mar 2025
SENSEI: Semantic Exploration Guided by Foundation Models to Learn Versatile World Models
Cansu Sancaktar
Christian Gumbsch
Andrii Zadaianchuk
Pavel Kolev
Georg Martius
LM&Ro
VLM
61
1
0
03 Mar 2025
Memory, Benchmark & Robots: A Benchmark for Solving Complex Tasks with Reinforcement Learning
Memory, Benchmark & Robots: A Benchmark for Solving Complex Tasks with Reinforcement Learning
Egor Cherepanov
Nikita Kachaev
A. Kovalev
Aleksandr I. Panov
OffRL
38
0
0
14 Feb 2025
Environment Descriptions for Usability and Generalisation in
  Reinforcement Learning
Environment Descriptions for Usability and Generalisation in Reinforcement Learning
Dennis J. N. J. Soemers
Spyridon Samothrakis
Kurt Driessens
M. Winands
OffRL
80
1
0
22 Dec 2024
Probing for Consciousness in Machines
Probing for Consciousness in Machines
Mathis Immertreu
A. Schilling
Andreas K. Maier
P. Krauss
AI4CE
72
1
0
25 Nov 2024
BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games
BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games
Davide Paglieri
Bartłomiej Cupiał
Samuel Coward
Ulyana Piterbarg
Maciej Wolczyk
...
Lerrel Pinto
Rob Fergus
Jakob Foerster
Jack Parker-Holder
Tim Rocktaschel
LLMAG
LRM
108
10
0
20 Nov 2024
IntersectionZoo: Eco-driving for Benchmarking Multi-Agent Contextual
  Reinforcement Learning
IntersectionZoo: Eco-driving for Benchmarking Multi-Agent Contextual Reinforcement Learning
Vindula Jayawardana
Baptiste Freydt
Ao Qu
Cameron Hickert
Zhongxia Yan
Cathy Wu
43
1
0
19 Oct 2024
Retrieval-Augmented Decision Transformer: External Memory for In-context
  RL
Retrieval-Augmented Decision Transformer: External Memory for In-context RL
Thomas Schmied
Fabian Paischer
Vihang Patil
M. Hofmarcher
Razvan Pascanu
Sepp Hochreiter
OffRL
39
6
0
09 Oct 2024
Assessing the Zero-Shot Capabilities of LLMs for Action Evaluation in RL
Assessing the Zero-Shot Capabilities of LLMs for Action Evaluation in RL
Eduardo Pignatelli
Johan Ferret
Tim Rockäschel
Edward Grefenstette
Davide Paglieri
Samuel Coward
Laura Toni
38
2
0
19 Sep 2024
Efficient Exploration and Discriminative World Model Learning with an Object-Centric Abstraction
Efficient Exploration and Discriminative World Model Learning with an Object-Centric Abstraction
Anthony GX-Chen
Kenneth Marino
Rob Fergus
OCL
48
1
0
21 Aug 2024
Enhancing Agent Learning through World Dynamics Modeling
Enhancing Agent Learning through World Dynamics Modeling
Zhiyuan Sun
Haochen Shi
Marc-Alexandre Côté
Glen Berseth
Xingdi Yuan
Bang Liu
47
3
0
25 Jul 2024
Variable-Agnostic Causal Exploration for Reinforcement Learning
Variable-Agnostic Causal Exploration for Reinforcement Learning
Minh Hoang Nguyen
Hung Le
Svetha Venkatesh
CML
24
2
0
17 Jul 2024
Craftium: An Extensible Framework for Creating Reinforcement Learning
  Environments
Craftium: An Extensible Framework for Creating Reinforcement Learning Environments
Mikel Malagón
Josu Ceberio
Jose A. Lozano
38
0
0
04 Jul 2024
DISCOVERYWORLD: A Virtual Environment for Developing and Evaluating
  Automated Scientific Discovery Agents
DISCOVERYWORLD: A Virtual Environment for Developing and Evaluating Automated Scientific Discovery Agents
Peter Alexander Jansen
Marc-Alexandre Côté
Tushar Khot
Erin Bransom
Bhavana Dalvi Mishra
Bodhisattwa Prasad Majumder
Oyvind Tafjord
Peter Clark
LLMAG
35
21
0
10 Jun 2024
Massively Multiagent Minigames for Training Generalist Agents
Massively Multiagent Minigames for Training Generalist Agents
Kyoung Whan Choe
Ryan Sullivan
Joseph Suárez
AI4CE
32
0
0
07 Jun 2024
DreamCraft: Text-Guided Generation of Functional 3D Environments in
  Minecraft
DreamCraft: Text-Guided Generation of Functional 3D Environments in Minecraft
Sam Earle
Filippos Kokkinos
Yuhe Nie
Julian Togelius
Roberta Raileanu
32
8
0
23 Apr 2024
Playing NetHack with LLMs: Potential & Limitations as Zero-Shot Agents
Playing NetHack with LLMs: Potential & Limitations as Zero-Shot Agents
Dominik Jeurissen
Diego Perez-Liebana
Jeremy Gow
Duygu Cakmak
James Kwan
LLMAG
24
8
0
01 Mar 2024
Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement
  Learning
Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement Learning
Michael T. Matthews
Michael Beukman
Benjamin Ellis
Mikayel Samvelyan
Matthew Jackson
Samuel Coward
Jakob Foerster
OffRL
29
24
0
26 Feb 2024
Skill Set Optimization: Reinforcing Language Model Behavior via
  Transferable Skills
Skill Set Optimization: Reinforcing Language Model Behavior via Transferable Skills
Kolby Nottingham
Bodhisattwa Prasad Majumder
Bhavana Dalvi
Sameer Singh
Peter Clark
Roy Fox
37
7
0
05 Feb 2024
How much can change in a year? Revisiting Evaluation in Multi-Agent
  Reinforcement Learning
How much can change in a year? Revisiting Evaluation in Multi-Agent Reinforcement Learning
Siddarth S. Singh
Omayma Mahjoub
Ruan de Kock
Wiem Khlifi
Abidine Vall
Kale-ab Tessera
Arnu Pretorius
26
1
0
13 Dec 2023
The Generalization Gap in Offline Reinforcement Learning
The Generalization Gap in Offline Reinforcement Learning
Ishita Mediratta
Qingfei You
Minqi Jiang
Roberta Raileanu
OffRL
78
10
0
10 Dec 2023
Generalization to New Sequential Decision Making Tasks with In-Context
  Learning
Generalization to New Sequential Decision Making Tasks with In-Context Learning
Sharath Chandra Raparthy
Eric Hambro
Robert Kirk
Mikael Henaff
Roberta Raileanu
OffRL
103
21
0
06 Dec 2023
LLM Augmented Hierarchical Agents
LLM Augmented Hierarchical Agents
Bharat Prakash
Tim Oates
T. Mohsenin
11
4
0
09 Nov 2023
Improving Intrinsic Exploration by Creating Stationary Objectives
Improving Intrinsic Exploration by Creating Stationary Objectives
Roger Creus Castanyer
Javier Civera
Taihú Pire
OffRL
32
3
0
27 Oct 2023
Motif: Intrinsic Motivation from Artificial Intelligence Feedback
Motif: Intrinsic Motivation from Artificial Intelligence Feedback
Martin Klissarov
P. DÓro
Shagun Sodhani
Roberta Raileanu
Pierre-Luc Bacon
Pascal Vincent
Amy Zhang
Mikael Henaff
LRM
LLMAG
29
54
0
29 Sep 2023
Cyclophobic Reinforcement Learning
Cyclophobic Reinforcement Learning
Stefan Sylvius Wagner
P. Arndt
Jan Robine
Stefan Harmeling
24
1
0
30 Aug 2023
Selective Perception: Optimizing State Descriptions with Reinforcement
  Learning for Language Model Actors
Selective Perception: Optimizing State Descriptions with Reinforcement Learning for Language Model Actors
Kolby Nottingham
Yasaman Razeghi
Kyungmin Kim
JB Lanier
Pierre Baldi
Roy Fox
Sameer Singh
26
8
0
21 Jul 2023
LuckyMera: a Modular AI Framework for Building Hybrid NetHack Agents
LuckyMera: a Modular AI Framework for Building Hybrid NetHack Agents
Luigi Quarantiello
Simone Marzeddu
Antonio Guzzi
Vincenzo Lomonaco
27
0
0
17 Jul 2023
Comparing Reinforcement Learning and Human Learning using the Game of
  Hidden Rules
Comparing Reinforcement Learning and Human Learning using the Game of Hidden Rules
Eric Pulick
Vladimir Menkov
Yonatan Dov Mintz
Paul B. Kantor
Vicki M. Bier
OffRL
9
0
0
30 Jun 2023
LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning
LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning
Bo Liu
Yifeng Zhu
Chongkai Gao
Yihao Feng
Qian Liu
Yuke Zhu
Peter Stone
CLL
35
114
0
05 Jun 2023
A Study of Global and Episodic Bonuses for Exploration in Contextual
  MDPs
A Study of Global and Episodic Bonuses for Exploration in Contextual MDPs
Mikael Henaff
Minqi Jiang
Roberta Raileanu
36
13
0
05 Jun 2023
Scaling Goal-based Exploration via Pruning Proto-goals
Scaling Goal-based Exploration via Pruning Proto-goals
Akhil Bagaria
Ray Jiang
Ramana Kumar
Tom Schaul
LRM
11
2
0
09 Feb 2023
Composing Task Knowledge with Modular Successor Feature Approximators
Composing Task Knowledge with Modular Successor Feature Approximators
Wilka Carvalho
Angelos Filos
Richard L. Lewis
Honglak Lee
Satinder Singh
15
7
0
28 Jan 2023
The configurable tree graph (CT-graph): measurable problems in partially
  observable and distal reward environments for lifelong reinforcement learning
The configurable tree graph (CT-graph): measurable problems in partially observable and distal reward environments for lifelong reinforcement learning
Andrea Soltoggio
Eseoghene Ben-Iwhiwhu
Christos Peridis
Pawel Ladosz
Jeffery Dick
Praveen K. Pilly
Soheil Kolouri
OffRL
30
3
0
21 Jan 2023
A Domain-Agnostic Approach for Characterization of Lifelong Learning
  Systems
A Domain-Agnostic Approach for Characterization of Lifelong Learning Systems
Megan M. Baker
Alexander New
Mario Aguilar-Simon
Ziad Al-Halah
Sébastien M. R. Arnold
...
Zifan Xu
A. Yanguas-Gil
Harel Yedidsion
Shangqun Yu
Gautam K. Vallabha
22
15
0
18 Jan 2023
Human-Timescale Adaptation in an Open-Ended Task Space
Human-Timescale Adaptation in an Open-Ended Task Space
Adaptive Agent Team
Jakob Bauer
Kate Baumli
Satinder Baveja
Feryal M. P. Behbahani
...
Jakub Sygnowski
K. Tuyls
Sarah York
Alexander Zacherl
Lei Zhang
LM&Ro
OffRL
AI4CE
LRM
30
108
0
18 Jan 2023
SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement
  Learning
SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning
Benjamin Ellis
Jonathan Cook
S. Moalla
Mikayel Samvelyan
Mingfei Sun
Anuj Mahajan
Jakob N. Foerster
Shimon Whiteson
19
83
0
14 Dec 2022
The Effectiveness of World Models for Continual Reinforcement Learning
The Effectiveness of World Models for Continual Reinforcement Learning
Samuel Kessler
M. Ostaszewski
Michal Bortkiewicz
M. Żarski
Maciej Wołczyk
Jack Parker-Holder
Stephen J. Roberts
Piotr Milo's
KELM
OffRL
CLL
25
7
0
29 Nov 2022
Powderworld: A Platform for Understanding Generalization via Rich Task
  Distributions
Powderworld: A Platform for Understanding Generalization via Rich Task Distributions
Kevin Frans
Phillip Isola
OffRL
39
9
0
23 Nov 2022
General Intelligence Requires Rethinking Exploration
General Intelligence Requires Rethinking Exploration
Minqi Jiang
Tim Rocktaschel
Edward Grefenstette
LRM
27
17
0
15 Nov 2022
Dungeons and Data: A Large-Scale NetHack Dataset
Dungeons and Data: A Large-Scale NetHack Dataset
Eric Hambro
Roberta Raileanu
Dan Rothermel
Vegard Mella
Tim Rocktaschel
Heinrich Küttler
Naila Murray
OffRL
126
18
0
01 Nov 2022
Avalon: A Benchmark for RL Generalization Using Procedurally Generated
  Worlds
Avalon: A Benchmark for RL Generalization Using Procedurally Generated Worlds
Joshua Albrecht
Abraham J. Fetterman
Bryden Fogelman
Ellie Kitanidis
Bartosz Wróblewski
...
Michael Rosenthal
Maksis Knutins
Zachary Polizzi
James B. Simon
Kanjun Qiu
OffRL
19
23
0
24 Oct 2022
Exploration via Elliptical Episodic Bonuses
Exploration via Elliptical Episodic Bonuses
Mikael Henaff
Roberta Raileanu
Minqi Jiang
Tim Rocktaschel
OffRL
29
39
0
11 Oct 2022
A Generalist Neural Algorithmic Learner
A Generalist Neural Algorithmic Learner
Borja Ibarz
Vitaly Kurin
George Papamakarios
Kyriacos Nikiforou
Mehdi Abbana Bennani
...
Andreea Deac
Beatrice Bevilacqua
Yaroslav Ganin
Charles Blundell
Petar Velivcković
OOD
24
53
0
22 Sep 2022
The Game of Hidden Rules: A New Kind of Benchmark Challenge for Machine
  Learning
The Game of Hidden Rules: A New Kind of Benchmark Challenge for Machine Learning
Eric Pulick
S. Bharti
Yiding Chen
Vladimir Menkov
Yonatan Dov Mintz
Paul B. Kantor
Vicki M. Bier
8
1
0
20 Jul 2022
GriddlyJS: A Web IDE for Reinforcement Learning
GriddlyJS: A Web IDE for Reinforcement Learning
C. Bamford
Minqi Jiang
Mikayel Samvelyan
Tim Rocktaschel
OnRL
36
4
0
13 Jul 2022
Grounding Aleatoric Uncertainty for Unsupervised Environment Design
Grounding Aleatoric Uncertainty for Unsupervised Environment Design
Minqi Jiang
Michael Dennis
Jack Parker-Holder
Andrei Lupu
Heinrich Küttler
Edward Grefenstette
Tim Rocktaschel
Jakob N. Foerster
41
13
0
11 Jul 2022
IGLU Gridworld: Simple and Fast Environment for Embodied Dialog Agents
IGLU Gridworld: Simple and Fast Environment for Embodied Dialog Agents
Artem Zholus
Alexey Skrynnik
Shrestha Mohanty
Zoya Volovikova
Julia Kiseleva
Artur Szlam
Marc-Alexandre Côté
Aleksandr I. Panov
VLM
LM&Ro
AI4CE
17
13
0
31 May 2022
12
Next