ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.04958
  4. Cited By
Unifying Behavioral and Response Diversity for Open-ended Learning in
  Zero-sum Games

Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games

9 June 2021
Xiangyu Liu
Hangtian Jia
Ying Wen
Yaodong Yang
Yujing Hu
Yingfeng Chen
Changjie Fan
Zhipeng Hu
ArXivPDFHTML

Papers citing "Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games"

11 / 11 papers shown
Title
Fusion-PSRO: Nash Policy Fusion for Policy Space Response Oracles
Fusion-PSRO: Nash Policy Fusion for Policy Space Response Oracles
Jiesong Lian
Yucong Huang
Chengdong Ma
Mingzhi Wang
Ying Wen
Long Hu
Yixue Hao
65
0
0
31 May 2024
Diverse Policies Converge in Reward-free Markov Decision Processe
Diverse Policies Converge in Reward-free Markov Decision Processe
Fanqing Lin
Shiyu Huang
Weiwei Tu
30
0
0
23 Aug 2023
JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player
  Zero-Sum Games
JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games
Yang Li
Kun Xiong
Yingping Zhang
Jiangcheng Zhu
Stephen Marcus McAleer
Wei Pan
Jun Wang
Zonghong Dai
Yaodong Yang
39
2
0
09 Aug 2023
TiZero: Mastering Multi-Agent Football with Curriculum Learning and
  Self-Play
TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play
Fanqing Lin
Shiyu Huang
Tim Pearce
Wenze Chen
Weijuan Tu
26
17
0
15 Feb 2023
Cooperative Open-ended Learning Framework for Zero-shot Coordination
Cooperative Open-ended Learning Framework for Zero-shot Coordination
Yang Li
Shao Zhang
Jichen Sun
Yali Du
Ying Wen
Xinbing Wang
Wei Pan
32
22
0
09 Feb 2023
Continuously Discovering Novel Strategies via Reward-Switching Policy
  Optimization
Continuously Discovering Novel Strategies via Reward-Switching Policy Optimization
Zihan Zhou
Wei Fu
Bingliang Zhang
Yi Wu
25
28
0
04 Apr 2022
Maximum Entropy Population-Based Training for Zero-Shot Human-AI
  Coordination
Maximum Entropy Population-Based Training for Zero-Shot Human-AI Coordination
Rui Zhao
Jinming Song
Yufeng Yuan
Haifeng Hu
Yang Gao
Yi Wu
Zhongqian Sun
Yang Wei
32
63
0
22 Dec 2021
Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence
  Model Tackles All SMAC Tasks
Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks
Linghui Meng
Muning Wen
Yaodong Yang
Chenyang Le
Xiyun Li
Weinan Zhang
Ying Wen
Haifeng Zhang
Jun Wang
Bo Xu
OffRL
31
38
0
06 Dec 2021
A Game-Theoretic Approach for Improving Generalization Ability of TSP
  Solvers
A Game-Theoretic Approach for Improving Generalization Ability of TSP Solvers
Chenguang Wang
Yaodong Yang
Oliver Slumbers
Congying Han
Tiande Guo
Haifeng Zhang
Jun Wang
26
17
0
28 Oct 2021
Measuring the Non-Transitivity in Chess
Measuring the Non-Transitivity in Chess
R. Sanjaya
Jun Wang
Yaodong Yang
21
22
0
22 Oct 2021
Determinantal point processes for machine learning
Determinantal point processes for machine learning
Alex Kulesza
B. Taskar
176
1,125
0
25 Jul 2012
1