ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.06135
  4. Cited By
DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning

DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning

11 June 2021
Daochen Zha
Jingru Xie
Wenye Ma
Sheng Zhang
Xiangru Lian
Xia Hu
Ji Liu
ArXivPDFHTML

Papers citing "DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning"

50 / 56 papers shown
Title
A Finite-Sample Analysis of Distributionally Robust Average-Reward Reinforcement Learning
A Finite-Sample Analysis of Distributionally Robust Average-Reward Reinforcement Learning
Zachary Roch
Chi Zhang
George Atia
Yue Wang
15
0
0
18 May 2025
PolicyEvol-Agent: Evolving Policy via Environment Perception and Self-Awareness with Theory of Mind
Yajie Yu
Yue Feng
LLMAG
33
0
0
20 Apr 2025
Imitation Learning of Correlated Policies in Stackelberg Games
Imitation Learning of Correlated Policies in Stackelberg Games
Kunag-Da Wang
Ping-Chun Hsieh
Wen-Chih Peng
48
0
0
11 Mar 2025
AI-driven control of bioelectric signalling for real-time topological reorganization of cells
AI-driven control of bioelectric signalling for real-time topological reorganization of cells
Gonçalo Hora de Carvalho
AI4CE
48
0
0
10 Mar 2025
Continuous Control of Diverse Skills in Quadruped Robots Without Complete Expert Datasets
Jiaxin Tu
Xiaoyi Wei
Yueqi Zhang
Taixian Hou
Xiaofei Gao
Zhiyan Dong
Peng Zhai
Lihua Zhang
58
0
0
05 Mar 2025
Cardiverse: Harnessing LLMs for Novel Card Game Prototyping
Cardiverse: Harnessing LLMs for Novel Card Game Prototyping
Danrui Li
Sen Zhang
Sam S. Sohn
Kaidong Hu
Muhammad Usman
Mubbasir Kapadia
40
0
0
10 Feb 2025
RL-LLM-DT: An Automatic Decision Tree Generation Method Based on RL
  Evaluation and LLM Enhancement
RL-LLM-DT: An Automatic Decision Tree Generation Method Based on RL Evaluation and LLM Enhancement
Junjie Lin
Jian Zhao
Lin Liu
Yue Deng
Youpeng Zhao
Lanxiao Huang
Xia Lin
Wengang Zhou
Yiming Li
74
0
0
16 Dec 2024
Improve Value Estimation of Q Function and Reshape Reward with Monte
  Carlo Tree Search
Improve Value Estimation of Q Function and Reshape Reward with Monte Carlo Tree Search
Jiamian Li
25
0
0
15 Oct 2024
A Survey on Self-play Methods in Reinforcement Learning
A Survey on Self-play Methods in Reinforcement Learning
Chao Yu
Zelai Xu
Chengdong Ma
Chao Yu
Weijuan Tu
...
Deheng Ye
Wenbo Ding
Yaodong Yang
Yu Wang
Yu Wang
SyDa
SSL
OnRL
62
8
0
02 Aug 2024
AlphaDou: High-Performance End-to-End Doudizhu AI Integrating Bidding
AlphaDou: High-Performance End-to-End Doudizhu AI Integrating Bidding
Chang Lei
Huan Lei
25
0
0
14 Jul 2024
A Simple, Solid, and Reproducible Baseline for Bridge Bidding AI
A Simple, Solid, and Reproducible Baseline for Bridge Bidding AI
Haruka Kita
Sotetsu Koyamada
Yotaro Yamaguchi
Shin Ishii
40
0
0
14 Jun 2024
Competing for pixels: a self-play algorithm for weakly-supervised
  segmentation
Competing for pixels: a self-play algorithm for weakly-supervised segmentation
Shaheer U. Saeed
Shiqi Huang
João Ramalhinho
Iani J. M. B. Gayo
Nina Montaña-Brown
...
Stephen P. Pereira
Brian R. Davidson
D. Barratt
Matthew J. Clarkson
Yipeng Hu
63
0
0
26 May 2024
DouRN: Improving DouZero by Residual Neural Networks
DouRN: Improving DouZero by Residual Neural Networks
Yiquan Chen
Yingchao Lyu
Di Zhang
16
0
0
21 Mar 2024
Agent-Pro: Learning to Evolve via Policy-Level Reflection and
  Optimization
Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization
Wenqi Zhang
Ke Tang
Hai Wu
Mengna Wang
Yongliang Shen
Guiyang Hou
Zeqi Tan
Peng Li
Yueting Zhuang
Weiming Lu
LLMAG
44
37
0
27 Feb 2024
Mastering the Game of Guandan with Deep Reinforcement Learning and
  Behavior Regulating
Mastering the Game of Guandan with Deep Reinforcement Learning and Behavior Regulating
Yifan YangGong
Haojun Pan
Lei Wang
23
0
0
21 Feb 2024
DanZero+: Dominating the GuanDan Game through Reinforcement Learning
DanZero+: Dominating the GuanDan Game through Reinforcement Learning
Youpeng Zhao
Yudong Lu
Jian Zhao
Wen-gang Zhou
Houqiang Li
41
6
0
05 Dec 2023
An Integrated Framework Integrating Monte Carlo Tree Search and
  Supervised Learning for Train Timetabling Problem
An Integrated Framework Integrating Monte Carlo Tree Search and Supervised Learning for Train Timetabling Problem
Feiyu Yang
9
2
0
02 Nov 2023
Deep Reinforcement Learning for Autonomous Cyber Operations: A Survey
Deep Reinforcement Learning for Autonomous Cyber Operations: A Survey
Gregory Palmer
Chris Parry
Daniel J.B. Harrold
Chris Willis
AI4CE
21
1
0
11 Oct 2023
Quantifying Agent Interaction in Multi-agent Reinforcement Learning for
  Cost-efficient Generalization
Quantifying Agent Interaction in Multi-agent Reinforcement Learning for Cost-efficient Generalization
Yuxin Chen
Chen Tang
Ran Tian
Chenran Li
Jinning Li
Masayoshi Tomizuka
Wei Zhan
47
3
0
11 Oct 2023
Imitation Learning from Purified Demonstration
Imitation Learning from Purified Demonstration
Yunke Wang
Minjing Dong
Bo Du
Chang Xu
31
1
0
11 Oct 2023
Suspicion-Agent: Playing Imperfect Information Games with Theory of Mind
  Aware GPT-4
Suspicion-Agent: Playing Imperfect Information Games with Theory of Mind Aware GPT-4
Jiaxian Guo
Bo Yang
Paul D. Yoo
Bill Yuchen Lin
Yusuke Iwasawa
Yutaka Matsuo
LLMAG
21
41
0
29 Sep 2023
Neurosymbolic Reinforcement Learning and Planning: A Survey
Neurosymbolic Reinforcement Learning and Planning: A Survey
Kamal Acharya
Waleed Raza
Carlos Dourado
Alvaro Velasquez
Houbing Song
NAI
OffRL
32
16
0
02 Sep 2023
JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player
  Zero-Sum Games
JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games
Yang Li
Kun Xiong
Yingping Zhang
Jiangcheng Zhu
Stephen Marcus McAleer
Wei Pan
Jun Wang
Zonghong Dai
Yaodong Yang
39
2
0
09 Aug 2023
Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled
  Perturbations
Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations
Yongyuan Liang
Yanchao Sun
Ruijie Zheng
Xiangyu Liu
Benjamin Eysenbach
T. Sandholm
Furong Huang
Stephen Marcus McAleer
OOD
43
0
0
22 Jul 2023
More Like Real World Game Challenge for Partially Observable Multi-Agent
  Cooperation
More Like Real World Game Challenge for Partially Observable Multi-Agent Cooperation
Meng Yao
Xueou Feng
Qiyue Yin
24
0
0
15 May 2023
Pre-train and Search: Efficient Embedding Table Sharding with
  Pre-trained Neural Cost Models
Pre-train and Search: Efficient Embedding Table Sharding with Pre-trained Neural Cost Models
Daochen Zha
Louis Feng
Liangchen Luo
Bhargav Bhushanam
Zirui Liu
...
J. McMahon
Yuzhen Huang
Bryan Clarke
A. Kejariwal
Xia Hu
58
7
0
03 May 2023
Games for Artificial Intelligence Research: A Review and Perspectives
Games for Artificial Intelligence Research: A Review and Perspectives
Chengpeng Hu
Yunlong Zhao
Ziqi Wang
Haocheng Du
Jialin Liu
AI4CE
35
13
0
26 Apr 2023
Data-centric Artificial Intelligence: A Survey
Data-centric Artificial Intelligence: A Survey
Daochen Zha
Zaid Pervaiz Bhat
Kwei-Herng Lai
Fan Yang
Zhimeng Jiang
Shaochen Zhong
Xia Hu
27
193
0
17 Mar 2023
Policy Dispersion in Non-Markovian Environment
B. Qu
Xiaofeng Cao
Jielong Yang
Hechang Chen
Chang Yi
Ivor W.Tsang
Yew-Soon Ong
22
0
0
28 Feb 2023
Towards Personalized Preprocessing Pipeline Search
Towards Personalized Preprocessing Pipeline Search
Diego Martinez
Daochen Zha
Qiaoyu Tan
Xia Hu
AI4TS
31
2
0
28 Feb 2023
TiZero: Mastering Multi-Agent Football with Curriculum Learning and
  Self-Play
TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play
Fanqing Lin
Shiyu Huang
Tim Pearce
Wenze Chen
Weijuan Tu
26
17
0
15 Feb 2023
Unlabeled Imperfect Demonstrations in Adversarial Imitation Learning
Unlabeled Imperfect Demonstrations in Adversarial Imitation Learning
Yunke Wang
Bo Du
Chang Xu
38
8
0
13 Feb 2023
Non-zero-sum Game Control for Multi-vehicle Driving via Reinforcement
  Learning
Non-zero-sum Game Control for Multi-vehicle Driving via Reinforcement Learning
Xujie Song
Zexi Lin
11
0
0
08 Feb 2023
Distributed Deep Reinforcement Learning: A Survey and A Multi-Player
  Multi-Agent Learning Toolbox
Distributed Deep Reinforcement Learning: A Survey and A Multi-Player Multi-Agent Learning Toolbox
Qiyue Yin
Tongtong Yu
S. Shen
Jun Yang
Meijing Zhao
Kaiqi Huang
Bin Liang
Liangsheng Wang
OffRL
28
13
0
01 Dec 2022
DanZero: Mastering GuanDan Game with Reinforcement Learning
DanZero: Mastering GuanDan Game with Reinforcement Learning
Yudong Lu
Jian Zhao
Youpeng Zhao
Wen-gang Zhou
Houqiang Li
19
6
0
31 Oct 2022
Classifying Ambiguous Identities in Hidden-Role Stochastic Games with
  Multi-Agent Reinforcement Learning
Classifying Ambiguous Identities in Hidden-Role Stochastic Games with Multi-Agent Reinforcement Learning
Shijie Han
Siyuan Li
Bo An
Wei Zhao
P. Liu
35
0
0
24 Oct 2022
DreamShard: Generalizable Embedding Table Placement for Recommender
  Systems
DreamShard: Generalizable Embedding Table Placement for Recommender Systems
Daochen Zha
Louis Feng
Qiaoyu Tan
Zirui Liu
Kwei-Herng Lai
Bhargav Bhushanam
Yuandong Tian
A. Kejariwal
Xia Hu
LMTD
OffRL
33
28
0
05 Oct 2022
Towards Automated Imbalanced Learning with Deep Hierarchical
  Reinforcement Learning
Towards Automated Imbalanced Learning with Deep Hierarchical Reinforcement Learning
Daochen Zha
Kwei-Herng Lai
Qiaoyu Tan
Sirui Ding
Na Zou
Xia Hu
AI4TS
18
18
0
26 Aug 2022
AutoShard: Automated Embedding Table Sharding for Recommender Systems
AutoShard: Automated Embedding Table Sharding for Recommender Systems
Daochen Zha
Louis Feng
Bhargav Bhushanam
Dhruv Choudhary
Jade Nie
Yuandong Tian
Jay Chae
Yi-An Ma
A. Kejariwal
Xia Hu
40
30
0
12 Aug 2022
A Deep Reinforcement Learning Approach for Finding Non-Exploitable
  Strategies in Two-Player Atari Games
A Deep Reinforcement Learning Approach for Finding Non-Exploitable Strategies in Two-Player Atari Games
Zihan Ding
DiJia Su
Qinghua Liu
Chi Jin
33
3
0
18 Jul 2022
Towards Modern Card Games with Large-Scale Action Spaces Through Action
  Representation
Towards Modern Card Games with Large-Scale Action Spaces Through Action Representation
Zhiyuan Yao
Tianyu Shi
Site Li
Yiting Xie
Yu Qin
Xiongjie Xie
Huijuan Lu
Yan Zhang
19
2
0
25 Jun 2022
ESCHER: Eschewing Importance Sampling in Games by Computing a History
  Value Function to Estimate Regret
ESCHER: Eschewing Importance Sampling in Games by Computing a History Value Function to Estimate Regret
Stephen Marcus McAleer
Gabriele Farina
Marc Lanctot
T. Sandholm
40
24
0
08 Jun 2022
Policy Diagnosis via Measuring Role Diversity in Cooperative Multi-agent
  RL
Policy Diagnosis via Measuring Role Diversity in Cooperative Multi-agent RL
Siyi Hu
Chuanlong Xie
Xiaodan Liang
Xiaojun Chang
16
20
0
01 Jun 2022
Efficient Distributed Framework for Collaborative Multi-Agent
  Reinforcement Learning
Efficient Distributed Framework for Collaborative Multi-Agent Reinforcement Learning
Shuhan Qi
Shuhao Zhang
Xiaohan Hou
Jia-jia Zhang
Xueliang Wang
Jing Xiao
24
0
0
11 May 2022
DouZero+: Improving DouDizhu AI by Opponent Modeling and Coach-guided
  Learning
DouZero+: Improving DouDizhu AI by Opponent Modeling and Coach-guided Learning
Youpeng Zhao
Jian Zhao
Xu Hu
Wen-gang Zhou
Houqiang Li
25
15
0
06 Apr 2022
PerfectDou: Dominating DouDizhu with Perfect Information Distillation
PerfectDou: Dominating DouDizhu with Perfect Information Distillation
Yang Guan
Minghuan Liu
Weijun Hong
Weinan Zhang
Fei Fang
Guangjun Zeng
Yue Lin
33
26
0
30 Mar 2022
Concentration Network for Reinforcement Learning of Large-Scale
  Multi-Agent Systems
Concentration Network for Reinforcement Learning of Large-Scale Multi-Agent Systems
Qing Fu
Tenghai Qiu
Jianqiang Yi
Zhiqiang Pu
Shiguang Wu
23
16
0
12 Mar 2022
Reinforcement Learning in Practice: Opportunities and Challenges
Reinforcement Learning in Practice: Opportunities and Challenges
Yuxi Li
OffRL
38
9
0
23 Feb 2022
Variational Quantum Soft Actor-Critic
Variational Quantum Soft Actor-Critic
Qingfeng Lan
22
20
0
20 Dec 2021
On the complexity of Dark Chinese Chess
On the complexity of Dark Chinese Chess
Cong Wang
Tongwei Lu
12
0
0
06 Dec 2021
12
Next