Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2306.09884
Cited By
Jumanji: a Diverse Suite of Scalable Reinforcement Learning Environments in JAX
16 June 2023
Clément Bonnet
Daniel Luo
Donal Byrne
Shikha Surana
Sasha Abramowitz
Paul Duckworth
Vincent Coyette
Laurence I. Midgley
Elshadai Tegegn
Tristan Kalloniatis
Omayma Mahjoub
Matthew Macfarlane
Andries P. Smit
Nathan Grinsztajn
Raphael Boige
Cemlyn N. Waters
Mohamed A. Mimouni
Ulrich A. Mbou Sob
Ruan de Kock
Siddarth S. Singh
Daniel Furelos-Blanco
Victor Le
Arnu Pretorius
Alexandre Laterre
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Jumanji: a Diverse Suite of Scalable Reinforcement Learning Environments in JAX"
24 / 24 papers shown
Title
Trust-Region Twisted Policy Improvement
Joery A. de Vries
Jinke He
Yaniv Oren
M. Spaan
OffRL
LRM
40
0
0
08 Apr 2025
Algorithm Discovery With LLMs: Evolutionary Search Meets Reinforcement Learning
Anja Surina
Amin Mansouri
Lars Quaedvlieg
Amal Seddas
Maryna Viazovska
Emmanuel Abbe
Çağlar Gülçehre
38
1
0
07 Apr 2025
SocialJax: An Evaluation Suite for Multi-agent Reinforcement Learning in Sequential Social Dilemmas
Zihao Guo
Richard Willis
Richard Willis
Tristan Tomilin
Joel Z Leibo
Yali Du
63
0
0
18 Mar 2025
A2Perf: Real-World Autonomous Agents Benchmark
Ikechukwu Uchendu
Jason J. Jabbour
Korneel Van den Berghe
Joel Runevic
Matthew P. Stewart
...
S. Guadarrama
Jie Tan
Jordan K. Terry
Aleksandra Faust
Vijay Janapa Reddi
39
0
0
04 Mar 2025
EvoRL: A GPU-accelerated Framework for Evolutionary Reinforcement Learning
Bowen Zheng
Ran Cheng
Kay Chen Tan
47
0
0
25 Jan 2025
Multi-Agent Environments for Vehicle Routing Problems
Ricardo Gama
Daniel Fuertes
Carlos R. del-Blanco
Hugo L. Fernandes
AI4CE
89
0
0
21 Nov 2024
Beyond the Boundaries of Proximal Policy Optimization
Charlie B. Tan
Edan Toledo
Benjamin Ellis
Jakob Foerster
Ferenc Huszár
25
0
0
01 Nov 2024
GPUDrive: Data-driven, multi-agent driving simulation at 1 million FPS
Saman Kazemkhani
Aarav Pandya
Daphne Cornelisse
Brennan Shacklett
Eugene Vinitsky
49
9
0
02 Aug 2024
POGEMA: A Benchmark Platform for Cooperative Multi-Agent Pathfinding
Alexey Skrynnik
Anton Andreychuk
Anatolii Borzilov
Alexander Chernyavskiy
Konstantin Yakovlev
Aleksandr I. Panov
48
1
0
20 Jul 2024
Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy
Cameron Allen
Aaron Kirtland
Ruo Yu Tao
Sam Lobel
Daniel Scott
Nicholas Petrocelli
Omer Gottesman
Ronald E. Parr
M. L. Littman
George Konidaris
28
1
0
10 Jul 2024
Simplifying Deep Temporal Difference Learning
Matteo Gallici
Mattie Fellows
Benjamin Ellis
B. Pou
Ivan Masmitja
Jakob Foerster
Mario Martin
OffRL
62
17
0
05 Jul 2024
The Overcooked Generalisation Challenge
Constantin Ruhdorfer
Matteo Bortoletto
Anna Penzkofer
Andreas Bulling
51
4
0
25 Jun 2024
Discovering Minimal Reinforcement Learning Environments
Jarek Liesen
Chris Xiaoxuan Lu
Andrei Lupu
Jakob N. Foerster
Henning Sprekeler
R. T. Lange
OffRL
56
3
0
18 Jun 2024
A Batch Sequential Halving Algorithm without Performance Degradation
Sotetsu Koyamada
Soichiro Nishimori
Shin Ishii
32
0
0
01 Jun 2024
Deep Dive into Model-free Reinforcement Learning for Biological and Robotic Systems: Theory and Practice
Yusheng Jiao
Feng Ling
Sina Heydari
N. Heess
J. Merel
Eva Kanso
39
0
0
19 May 2024
Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement Learning
Michael T. Matthews
Michael Beukman
Benjamin Ellis
Mikayel Samvelyan
Matthew Jackson
Samuel Coward
Jakob Foerster
OffRL
42
26
0
26 Feb 2024
SPO: Sequential Monte Carlo Policy Optimisation
Matthew Macfarlane
Edan Toledo
Donal Byrne
Paul Duckworth
Alexandre Laterre
32
1
0
12 Feb 2024
XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX
Alexander Nikulin
Vladislav Kurenkov
Ilya Zisman
Artem Agarkov
Viacheslav Sinii
Sergey Kolesnikov
38
26
0
19 Dec 2023
minimax: Efficient Baselines for Autocurricula in JAX
Minqi Jiang
Michael Dennis
Edward Grefenstette
Tim Rocktaschel
32
8
0
21 Nov 2023
AI planning in the imagination: High-level planning on learned abstract search spaces
Carlos Martin
T. Sandholm
37
0
0
16 Aug 2023
QDax: A Library for Quality-Diversity and Population-based Algorithms with Hardware Acceleration
Félix Chalumeau
Bryan Lim
Raphael Boige
Maxime Allard
Luca Grillotti
Manon Flageat
Valentin Macé
Arthur Flajolet
Thomas Pierrot
Antoine Cully
34
21
0
07 Aug 2023
Pgx: Hardware-Accelerated Parallel Game Simulators for Reinforcement Learning
Sotetsu Koyamada
Shinri Okano
Soichiro Nishimori
Y. Murata
Keigo Habara
Haruka Kita
Shin Ishii
27
24
0
29 Mar 2023
Enhancing MAP-Elites with Multiple Parallel Evolution Strategies
Manon Flageat
Bryan Lim
Antoine Cully
32
2
0
10 Mar 2023
Mava: a research library for distributed multi-agent reinforcement learning in JAX
Arnu Pretorius
Kale-ab Tessera
St John Grimbly
Kevin Eloff
Lawrence Francis
Claude Formanek
Andries P. Smit
Alexandre Laterre
41
12
0
03 Jul 2021
1