Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.00841
Cited By
Offline Fictitious Self-Play for Competitive Games
29 February 2024
Jingxiao Chen
Weiji Xie
Weinan Zhang
Yong Zu
Ying Wen
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Offline Fictitious Self-Play for Competitive Games"
12 / 12 papers shown
Title
AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning
Michaël Mathieu
Sherjil Ozair
Srivatsan Srinivasan
Çağlar Gülçehre
Shangtong Zhang
...
Sergio Gomez Colmenarejo
Aaron van den Oord
Wojciech M. Czarnecki
Nando de Freitas
Oriol Vinyals
OffRL
35
10
0
07 Aug 2023
Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time Multi-Robot Cooperative Exploration
Chao Yu
Xinyi Yang
Jiaxuan Gao
Jiayu Chen
Yunfei Li
...
Yunfei Xiang
Rui Huang
Huazhong Yang
Yi Wu
Yu Wang
56
37
0
09 Jan 2023
Pessimistic Minimax Value Iteration: Provably Efficient Equilibrium Learning from Offline Datasets
Han Zhong
Wei Xiong
Jiyuan Tan
Liwei Wang
Tong Zhang
Zhaoran Wang
Zhuoran Yang
OffRL
65
39
0
15 Feb 2022
Cooperative Multi-Agent Deep Reinforcement Learning for Reliable Surveillance via Autonomous Multi-UAV Control
Won Joon Yun
Soohyun Park
Joongheon Kim
Myungjae Shin
Soyi Jung
David A. Mohaisen
Jae-Hyun Kim
48
134
0
15 Jan 2022
Critic Regularized Regression
Ziyun Wang
Alexander Novikov
Konrad Zolna
Jost Tobias Springenberg
Scott E. Reed
...
Noah Y. Siegel
J. Merel
Çağlar Gülçehre
N. Heess
Nando de Freitas
OffRL
160
330
0
26 Jun 2020
Conservative Q-Learning for Offline Reinforcement Learning
Aviral Kumar
Aurick Zhou
George Tucker
Sergey Levine
OffRL
OnRL
143
1,831
0
08 Jun 2020
Approximate exploitability: Learning a best response in large games
Finbarr Timbers
Nolan Bard
Edward Lockhart
Marc Lanctot
Martin Schmid
Neil Burch
Julian Schrittwieser
Thomas Hubert
Michael Bowling
AAML
54
27
0
20 Apr 2020
Google Research Football: A Novel Reinforcement Learning Environment
Karol Kurach
Anton Raichuk
Piotr Stańczyk
Michal Zajac
Olivier Bachem
...
C. Riquelme
Damien Vincent
Marcin Michalski
Olivier Bousquet
Sylvain Gelly
154
405
0
25 Jul 2019
DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections
Ofir Nachum
Yinlam Chow
Bo Dai
Lihong Li
OffRL
151
338
0
10 Jun 2019
Off-Policy Deep Reinforcement Learning without Exploration
Scott Fujimoto
David Meger
Doina Precup
OffRL
BDL
243
1,624
0
07 Dec 2018
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
David Silver
Thomas Hubert
Julian Schrittwieser
Ioannis Antonoglou
Matthew Lai
...
D. Kumaran
T. Graepel
Timothy Lillicrap
Karen Simonyan
Demis Hassabis
153
1,782
0
05 Dec 2017
Safe and Efficient Off-Policy Reinforcement Learning
Rémi Munos
T. Stepleton
Anna Harutyunyan
Marc G. Bellemare
OffRL
138
617
0
08 Jun 2016
1