Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2401.05133
Cited By
Neural Population Learning beyond Symmetric Zero-sum Games
10 January 2024
Siqi Liu
Luke Marris
Marc Lanctot
Georgios Piliouras
Joel Z Leibo
N. Heess
MLT
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Neural Population Learning beyond Symmetric Zero-sum Games"
17 / 17 papers shown
Title
A Survey on Self-play Methods in Reinforcement Learning
Chao Yu
Zelai Xu
Chengdong Ma
Chao Yu
Weijuan Tu
...
Deheng Ye
Wenbo Ding
Yaodong Yang
Yu Wang
Yu Wang
SyDa
SSL
OnRL
98
9
0
02 Aug 2024
Turbocharging Solution Concepts: Solving NEs, CEs and CCEs with Neural Equilibrium Solvers
Luke Marris
I. Gemp
Thomas W. Anthony
Andrea Tacchetti
Siqi Liu
K. Tuyls
56
15
0
17 Oct 2022
Revisiting Gaussian mixture critics in off-policy reinforcement learning: a sample-based approach
Bobak Shahriari
A. Abdolmaleki
Arunkumar Byravan
A. Friesen
Siqi Liu
Jost Tobias Springenberg
N. Heess
Matthew W. Hoffman
Martin Riedmiller
OffRL
75
9
0
21 Apr 2022
Near-Optimal No-Regret Learning in General Games
C. Daskalakis
Maxwell Fishelson
Noah Golowich
71
106
0
16 Aug 2021
Multi-Agent Training beyond Zero-Sum with Correlated Equilibrium Meta-Solvers
Luke Marris
Paul Muller
Marc Lanctot
K. Tuyls
T. Graepel
73
36
0
17 Jun 2021
From Motor Control to Team Play in Simulated Humanoid Football
Siqi Liu
Guy Lever
Zhe Wang
J. Merel
S. M. Ali Eslami
...
Tuomas Haarnoja
Brendan D. Tracey
K. Tuyls
T. Graepel
N. Heess
110
131
0
25 May 2021
dm_control: Software and Tasks for Continuous Control
Yuval Tassa
S. Tunyasuvunakool
Alistair Muldal
Yotam Doron
Piotr Trochim
...
Steven Bohez
J. Merel
Tom Erez
Timothy Lillicrap
N. Heess
LM&Ro
91
416
0
22 Jun 2020
From Poincaré Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization
Julien Perolat
Rémi Munos
Jean-Baptiste Lespiau
Shayegan Omidshafiei
Mark Rowland
...
David Balduzzi
Bart De Vylder
Georgios Piliouras
Marc Lanctot
K. Tuyls
54
85
0
19 Feb 2020
OpenSpiel: A Framework for Reinforcement Learning in Games
Marc Lanctot
Edward Lockhart
Jean-Baptiste Lespiau
V. Zambaldi
Satyaki Upadhyay
...
Julian Schrittwieser
Thomas W. Anthony
Edward Hughes
Ivo Danihelka
Jonah Ryan-Davis
OffRL
99
252
0
26 Aug 2019
Correlation in Extensive-Form Games: Saddle-Point Formulation and Benchmarks
Gabriele Farina
Chun Kai Ling
Fei Fang
Tuomas Sandholm
45
42
0
29 May 2019
α
α
α
-Rank: Multi-Agent Evaluation by Evolution
Shayegan Omidshafiei
Christos H. Papadimitriou
Georgios Piliouras
K. Tuyls
Mark Rowland
Jean-Baptiste Lespiau
Wojciech M. Czarnecki
Marc Lanctot
Julien Perolat
Rémi Munos
75
121
0
04 Mar 2019
Emergent Coordination Through Competition
Siqi Liu
Guy Lever
J. Merel
S. Tunyasuvunakool
N. Heess
T. Graepel
87
150
0
19 Feb 2019
Human-level performance in first-person multiplayer games with population-based deep reinforcement learning
Max Jaderberg
Wojciech M. Czarnecki
Iain Dunning
Luke Marris
Guy Lever
...
Joel Z Leibo
David Silver
Demis Hassabis
Koray Kavukcuoglu
T. Graepel
OffRL
112
727
0
03 Jul 2018
Progress & Compress: A scalable framework for continual learning
Jonathan Richard Schwarz
Jelena Luketina
Wojciech M. Czarnecki
A. Grabska-Barwinska
Yee Whye Teh
Razvan Pascanu
R. Hadsell
CLL
125
889
0
16 May 2018
FiLM: Visual Reasoning with a General Conditioning Layer
Ethan Perez
Florian Strub
H. D. Vries
Vincent Dumoulin
Aaron Courville
FAtt
AIMat
OffRL
AI4CE
356
2,230
0
22 Sep 2017
Safe and Efficient Off-Policy Reinforcement Learning
Rémi Munos
T. Stepleton
Anna Harutyunyan
Marc G. Bellemare
OffRL
138
617
0
08 Jun 2016
Bayes' Bluff: Opponent Modelling in Poker
F. Southey
Michael Bowling
Bryce Larson
Carmelo Piccione
Neil Burch
Darse Billings
D. C. Rayner
165
262
0
04 Jul 2012
1