Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2303.03196
Cited By
Population-based Evaluation in Repeated Rock-Paper-Scissors as a Benchmark for Multiagent Reinforcement Learning
2 March 2023
Marc Lanctot
John Schultz
Neil Burch
Max O. Smith
Daniel Hennes
Thomas W. Anthony
Julien Perolat
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Population-based Evaluation in Repeated Rock-Paper-Scissors as a Benchmark for Multiagent Reinforcement Learning"
13 / 13 papers shown
Title
Training Compute-Optimal Large Language Models
Jordan Hoffmann
Sebastian Borgeaud
A. Mensch
Elena Buchatskaya
Trevor Cai
...
Karen Simonyan
Erich Elsen
Jack W. Rae
Oriol Vinyals
Laurent Sifre
AI4TS
189
1,944
0
29 Mar 2022
Approximately Solving Mean Field Games via Entropy-Regularized Deep Reinforcement Learning
Kai Cui
Heinz Koeppl
100
94
0
02 Feb 2021
Approximate exploitability: Learning a best response in large games
Finbarr Timbers
Nolan Bard
Edward Lockhart
Marc Lanctot
Martin Schmid
Neil Burch
Julian Schrittwieser
Thomas Hubert
Michael Bowling
AAML
31
27
0
20 Apr 2020
"Other-Play" for Zero-Shot Coordination
Hengyuan Hu
Adam Lerer
A. Peysakhovich
Jakob N. Foerster
VLM
OffRL
164
221
0
06 Mar 2020
Dota 2 with Large Scale Deep Reinforcement Learning
OpenAI OpenAI
:
Christopher Berner
Greg Brockman
Brooke Chan
...
Szymon Sidor
Ilya Sutskever
Jie Tang
Filip Wolski
Susan Zhang
GNN
VLM
CLL
AI4CE
LRM
149
1,822
0
13 Dec 2019
On the Utility of Learning about Humans for Human-AI Coordination
Micah Carroll
Rohin Shah
Mark K. Ho
Thomas Griffiths
Sanjit A. Seshia
Pieter Abbeel
Anca Dragan
HAI
67
394
0
13 Oct 2019
OpenSpiel: A Framework for Reinforcement Learning in Games
Marc Lanctot
Edward Lockhart
Jean-Baptiste Lespiau
V. Zambaldi
Satyaki Upadhyay
...
Julian Schrittwieser
Thomas W. Anthony
Edward Hughes
Ivo Danihelka
Jonah Ryan-Davis
OffRL
89
250
0
26 Aug 2019
The Hanabi Challenge: A New Frontier for AI Research
Nolan Bard
Jakob N. Foerster
A. Chandar
Neil Burch
Marc Lanctot
...
Iain Dunning
Shibl Mourad
Hugo Larochelle
Marc G. Bellemare
Michael Bowling
LLMAG
62
352
0
01 Feb 2019
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
...
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
204
1,598
0
05 Feb 2018
Reinforcement Learning with Unsupervised Auxiliary Tasks
Max Jaderberg
Volodymyr Mnih
Wojciech M. Czarnecki
Tom Schaul
Joel Z Leibo
David Silver
Koray Kavukcuoglu
SSL
101
1,228
0
16 Nov 2016
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
191
8,850
0
04 Feb 2016
Strongly Adaptive Online Learning
Amit Daniely
Alon Gonen
Shai Shalev-Shwartz
ODL
160
178
0
25 Feb 2015
The Arcade Learning Environment: An Evaluation Platform for General Agents
Marc G. Bellemare
Yavar Naddaf
J. Veness
Michael Bowling
109
3,004
0
19 Jul 2012
1