Population-based Evaluation in Repeated Rock-Paper-Scissors as a
Benchmark for Multiagent Reinforcement Learning

Population-based Evaluation in Repeated Rock-Paper-Scissors as a Benchmark for Multiagent Reinforcement Learning

2 March 2023

Thomas W. Anthony

Julien Perolat

Papers citing "Population-based Evaluation in Repeated Rock-Paper-Scissors as a Benchmark for Multiagent Reinforcement Learning"

13 / 13 papers shown

Title
Training Compute-Optimal Large Language Models Jordan Hoffmann Sebastian Borgeaud A. Mensch Elena Buchatskaya Trevor Cai ... Karen Simonyan Erich Elsen Jack W. Rae Oriol Vinyals Laurent Sifre AI4TS 189 1,944 0 29 Mar 2022
Approximately Solving Mean Field Games via Entropy-Regularized Deep Reinforcement Learning Kai Cui Heinz Koeppl 100 94 0 02 Feb 2021
Approximate exploitability: Learning a best response in large games Finbarr Timbers Nolan Bard Edward Lockhart Marc Lanctot Martin Schmid Neil Burch Julian Schrittwieser Thomas Hubert Michael Bowling AAML 31 27 0 20 Apr 2020
"Other-Play" for Zero-Shot Coordination Hengyuan Hu Adam Lerer A. Peysakhovich Jakob N. Foerster VLM OffRL 164 221 0 06 Mar 2020
Dota 2 with Large Scale Deep Reinforcement Learning OpenAI OpenAI : Christopher Berner Greg Brockman Brooke Chan ... Szymon Sidor Ilya Sutskever Jie Tang Filip Wolski Susan Zhang GNN VLM CLL AI4CE LRM 149 1,822 0 13 Dec 2019
On the Utility of Learning about Humans for Human-AI Coordination Micah Carroll Rohin Shah Mark K. Ho Thomas Griffiths Sanjit A. Seshia Pieter Abbeel Anca Dragan HAI 67 394 0 13 Oct 2019
OpenSpiel: A Framework for Reinforcement Learning in Games Marc Lanctot Edward Lockhart Jean-Baptiste Lespiau V. Zambaldi Satyaki Upadhyay ... Julian Schrittwieser Thomas W. Anthony Edward Hughes Ivo Danihelka Jonah Ryan-Davis OffRL 89 250 0 26 Aug 2019
The Hanabi Challenge: A New Frontier for AI Research Nolan Bard Jakob N. Foerster A. Chandar Neil Burch Marc Lanctot ... Iain Dunning Shibl Mourad Hugo Larochelle Marc G. Bellemare Michael Bowling LLMAG 62 352 0 01 Feb 2019
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures L. Espeholt Hubert Soyer Rémi Munos Karen Simonyan Volodymyr Mnih ... Vlad Firoiu Tim Harley Iain Dunning Shane Legg Koray Kavukcuoglu 204 1,598 0 05 Feb 2018
Reinforcement Learning with Unsupervised Auxiliary Tasks Max Jaderberg Volodymyr Mnih Wojciech M. Czarnecki Tom Schaul Joel Z Leibo David Silver Koray Kavukcuoglu SSL 101 1,228 0 16 Nov 2016
Asynchronous Methods for Deep Reinforcement Learning Volodymyr Mnih Adria Puigdomenech Badia M. Berk Mirza Alex Graves Timothy Lillicrap Tim Harley David Silver Koray Kavukcuoglu 191 8,850 0 04 Feb 2016
Strongly Adaptive Online Learning Amit Daniely Alon Gonen Shai Shalev-Shwartz ODL 160 178 0 25 Feb 2015
The Arcade Learning Environment: An Evaluation Platform for General Agents Marc G. Bellemare Yavar Naddaf J. Veness Michael Bowling 109 3,004 0 19 Jul 2012