Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2201.07700
Cited By
Anytime PSRO for Two-Player Zero-Sum Games
19 January 2022
Stephen Marcus McAleer
Kevin A. Wang
John Lanier
Marc Lanctot
Pierre Baldi
Tuomas Sandholm
Roy Fox
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Anytime PSRO for Two-Player Zero-Sum Games"
12 / 12 papers shown
Title
A Survey on Self-play Methods in Reinforcement Learning
Chao Yu
Zelai Xu
Chengdong Ma
Chao Yu
Weijuan Tu
...
Deheng Ye
Wenbo Ding
Yaodong Yang
Yu Wang
Yu Wang
SyDa
SSL
OnRL
81
9
0
02 Aug 2024
Iterative Empirical Game Solving via Single Policy Best Response
Max O. Smith
Thomas W. Anthony
Michael P. Wellman
52
18
0
03 Jun 2021
Evaluating Strategy Exploration in Empirical Game-Theoretic Analysis
Yongzhao Wang
Gary Qiurui Ma
Michael P. Wellman
21
11
0
21 May 2021
XDO: A Double Oracle Algorithm for Extensive-Form Games
Stephen Marcus McAleer
John Lanier
Kevin A. Wang
Pierre Baldi
Roy Fox
44
54
0
11 Mar 2021
Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games
Stephen Marcus McAleer
John Lanier
Roy Fox
Pierre Baldi
24
78
0
15 Jun 2020
OpenSpiel: A Framework for Reinforcement Learning in Games
Marc Lanctot
Edward Lockhart
Jean-Baptiste Lespiau
V. Zambaldi
Satyaki Upadhyay
...
Julian Schrittwieser
Thomas W. Anthony
Edward Hughes
Ivo Danihelka
Jonah Ryan-Davis
OffRL
75
250
0
26 Aug 2019
Stable-Predictive Optimistic Counterfactual Regret Minimization
Gabriele Farina
Christian Kroer
Noam Brown
Tuomas Sandholm
64
34
0
13 Feb 2019
Open-ended Learning in Symmetric Zero-sum Games
David Balduzzi
M. Garnelo
Yoram Bachrach
Wojciech M. Czarnecki
Julien Perolat
Max Jaderberg
T. Graepel
46
171
0
23 Jan 2019
Solving Imperfect-Information Games via Discounted Regret Minimization
Noam Brown
Tuomas Sandholm
122
167
0
11 Sep 2018
A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning
Marc Lanctot
V. Zambaldi
A. Gruslys
Angeliki Lazaridou
K. Tuyls
Julien Perolat
David Silver
T. Graepel
91
635
0
02 Nov 2017
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
444
18,931
0
20 Jul 2017
Deep Reinforcement Learning with Double Q-learning
H. V. Hasselt
A. Guez
David Silver
OffRL
156
7,623
0
22 Sep 2015
1