Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2006.10410
Cited By
DREAM: Deep Regret minimization with Advantage baselines and Model-free learning
18 June 2020
Eric Steinberger
Adam Lerer
Noam Brown
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DREAM: Deep Regret minimization with Advantage baselines and Model-free learning"
6 / 6 papers shown
Title
A Survey on Self-play Methods in Reinforcement Learning
Chao Yu
Zelai Xu
Chengdong Ma
Chao Yu
Weijuan Tu
...
Deheng Ye
Wenbo Ding
Yaodong Yang
Yu Wang
Yu Wang
SyDa
SSL
OnRL
51
8
0
02 Aug 2024
JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games
Yang Li
Kun Xiong
Yingping Zhang
Jiangcheng Zhu
Stephen Marcus McAleer
Wei Pan
Jun Wang
Zonghong Dai
Yaodong Yang
39
2
0
09 Aug 2023
Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Stephen Marcus McAleer
JB Lanier
Kevin A. Wang
Pierre Baldi
Roy Fox
T. Sandholm
35
18
0
13 Jul 2022
Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: Corrections
Dustin Morrill
Ryan DÓrazio
Marc Lanctot
J. R. Wright
Michael Bowling
Amy Greenwald
51
21
0
24 May 2022
PerfectDou: Dominating DouDizhu with Perfect Information Distillation
Yang Guan
Minghuan Liu
Weijun Hong
Weinan Zhang
Fei Fang
Guangjun Zeng
Yue Lin
27
26
0
30 Mar 2022
Learning to Be Cautious
Montaser Mohammedalamen
Dustin Morrill
Alexander Sieusahai
Yash Satsangi
Michael Bowling
18
3
0
29 Oct 2021
1