ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.06426
  4. Cited By
XDO: A Double Oracle Algorithm for Extensive-Form Games

XDO: A Double Oracle Algorithm for Extensive-Form Games

11 March 2021
Stephen Marcus McAleer
John Lanier
Kevin A. Wang
Pierre Baldi
Roy Fox
ArXivPDFHTML

Papers citing "XDO: A Double Oracle Algorithm for Extensive-Form Games"

10 / 10 papers shown
Title
A Survey on Self-play Methods in Reinforcement Learning
A Survey on Self-play Methods in Reinforcement Learning
Ruize Zhang
Zelai Xu
Chengdong Ma
Chao Yu
Weijuan Tu
...
Deheng Ye
Wenbo Ding
Yaodong Yang
Yu Wang
Yu Wang
SyDa
SSL
OnRL
51
8
0
02 Aug 2024
JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player
  Zero-Sum Games
JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games
Yang Li
Kun Xiong
Yingping Zhang
Jiangcheng Zhu
Stephen Marcus McAleer
Wei Pan
Jun Wang
Zonghong Dai
Yaodong Yang
39
2
0
09 Aug 2023
A Deep Reinforcement Learning Approach for Finding Non-Exploitable
  Strategies in Two-Player Atari Games
A Deep Reinforcement Learning Approach for Finding Non-Exploitable Strategies in Two-Player Atari Games
Zihan Ding
DiJia Su
Qinghua Liu
Chi Jin
33
3
0
18 Jul 2022
Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Stephen Marcus McAleer
JB Lanier
Kevin A. Wang
Pierre Baldi
Roy Fox
T. Sandholm
35
18
0
13 Jul 2022
Offline Equilibrium Finding
Offline Equilibrium Finding
Shuxin Li
Xinrun Wang
Youzhi Zhang
Jakub Cerny
Pengdeng Li
Hau Chan
Bo An
OffRL
43
2
0
12 Jul 2022
A Game-Theoretic Framework for Managing Risk in Multi-Agent Systems
A Game-Theoretic Framework for Managing Risk in Multi-Agent Systems
Oliver Slumbers
D. Mguni
Stephen Marcus McAleer
Stefano B. Blumberg
Jun Wang
Yaodong Yang
32
9
0
30 May 2022
Anytime PSRO for Two-Player Zero-Sum Games
Anytime PSRO for Two-Player Zero-Sum Games
Stephen Marcus McAleer
Kevin A. Wang
John Lanier
Marc Lanctot
Pierre Baldi
T. Sandholm
Roy Fox
22
12
0
19 Jan 2022
Independent Natural Policy Gradient Always Converges in Markov Potential
  Games
Independent Natural Policy Gradient Always Converges in Markov Potential Games
Roy Fox
Stephen Marcus McAleer
W. Overman
Ioannis Panageas
26
49
0
20 Oct 2021
Multi-Agent Training beyond Zero-Sum with Correlated Equilibrium
  Meta-Solvers
Multi-Agent Training beyond Zero-Sum with Correlated Equilibrium Meta-Solvers
Luke Marris
Paul Muller
Marc Lanctot
K. Tuyls
T. Graepel
35
36
0
17 Jun 2021
A learning agent that acquires social norms from public sanctions in
  decentralized multi-agent settings
A learning agent that acquires social norms from public sanctions in decentralized multi-agent settings
Eugene Vinitsky
Raphael Köster
J. Agapiou
Edgar A. Duénez-Guzmán
A. Vezhnevets
Joel Z. Leibo
27
37
0
16 Jun 2021
1