Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2003.13590
Cited By
Suphx: Mastering Mahjong with Deep Reinforcement Learning
30 March 2020
Junjie Li
Sotetsu Koyamada
Qiwei Ye
Guoqing Liu
Chao Wang
Ruihan Yang
Li Zhao
Tao Qin
Tie-Yan Liu
H. Hon
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Suphx: Mastering Mahjong with Deep Reinforcement Learning"
15 / 15 papers shown
Title
A Survey on Self-play Methods in Reinforcement Learning
Chao Yu
Zelai Xu
Chengdong Ma
Chao Yu
Weijuan Tu
...
Deheng Ye
Wenbo Ding
Yaodong Yang
Yu Wang
Yu Wang
SyDa
SSL
OnRL
64
8
0
02 Aug 2024
Nash Equilibrium and Learning Dynamics in Three-Player Matching
m
m
m
-Action Games
Yuma Fujimoto
Kaito Ariu
Kenshi Abe
29
1
0
16 Feb 2024
JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games
Yang Li
Kun Xiong
Yingping Zhang
Jiangcheng Zhu
Stephen Marcus McAleer
Wei Pan
Jun Wang
Zonghong Dai
Yaodong Yang
44
2
0
09 Aug 2023
Mastering Asymmetrical Multiplayer Game with Multi-Agent Asymmetric-Evolution Reinforcement Learning
Chenglu Sun
Yi-cui Zhang
Yu Zhang
Ziling Lu
Jingbin Liu
Si-Qi Xu
Weidong Zhang
27
0
0
20 Apr 2023
Mastering Strategy Card Game (Legends of Code and Magic) via End-to-End Policy and Optimistic Smooth Fictitious Play
Wei Xi
Yongxin Zhang
Changnan Xiao
Xuefeng Huang
Shihong Deng
Haowei Liang
Jie Chen
Peng Sun
OffRL
50
8
0
07 Mar 2023
Distributed Deep Reinforcement Learning: A Survey and A Multi-Player Multi-Agent Learning Toolbox
Qiyue Yin
Tongtong Yu
S. Shen
Jun Yang
Meijing Zhao
Kaiqi Huang
Bin Liang
Liangsheng Wang
OffRL
33
13
0
01 Dec 2022
DanZero: Mastering GuanDan Game with Reinforcement Learning
Yudong Lu
Jian Zhao
Youpeng Zhao
Wen-gang Zhou
Houqiang Li
19
6
0
31 Oct 2022
Classifying Ambiguous Identities in Hidden-Role Stochastic Games with Multi-Agent Reinforcement Learning
Shijie Han
Siyuan Li
Bo An
Wei Zhao
P. Liu
35
0
0
24 Oct 2022
Efficient Distributed Framework for Collaborative Multi-Agent Reinforcement Learning
Shuhan Qi
Shuhao Zhang
Xiaohan Hou
Jia-jia Zhang
Xinyu Wang
Jing Xiao
24
0
0
11 May 2022
PerfectDou: Dominating DouDizhu with Perfect Information Distillation
Yang Guan
Minghuan Liu
Weijun Hong
Weinan Zhang
Fei Fang
Guangjun Zeng
Yue Lin
33
26
0
30 Mar 2022
A Fast Algorithm for Computing the Deficiency Number of a Mahjong Hand
Xueqing Yan
Yongming Li
Sanjiang Li
17
0
0
15 Aug 2021
DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning
Daochen Zha
Jingru Xie
Wenye Ma
Sheng Zhang
Xiangru Lian
Xia Hu
Ji Liu
25
117
0
11 Jun 2021
Universal Trading for Order Execution with Oracle Policy Distillation
Yuchen Fang
Kan Ren
Weiqing Liu
Dong Zhou
Weinan Zhang
Jiang Bian
Yong Yu
Tie-Yan Liu
OffRL
29
45
0
28 Jan 2021
Masked Contrastive Representation Learning for Reinforcement Learning
Jinhua Zhu
Yingce Xia
Lijun Wu
Jiajun Deng
Wen-gang Zhou
Tao Qin
Houqiang Li
SSL
OffRL
34
55
0
15 Oct 2020
Joint Policy Search for Multi-agent Collaboration with Imperfect Information
Yuandong Tian
Qucheng Gong
Tina Jiang
42
19
0
14 Aug 2020
1