ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.01955
  4. Cited By
The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games

The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games

2 March 2021
Chao Yu
Akash Velu
Eugene Vinitsky
Jiaxuan Gao
Yu Wang
Alexandre M. Bayen
Yi Wu
    OffRL
ArXivPDFHTML

Papers citing "The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games"

34 / 184 papers shown
Title
Distributed Influence-Augmented Local Simulators for Parallel MARL in
  Large Networked Systems
Distributed Influence-Augmented Local Simulators for Parallel MARL in Large Networked Systems
Miguel Suau
Jinke He
Mustafa Mert cCelikok
M. Spaan
F. Oliehoek
21
1
0
01 Jul 2022
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned
  Reinforcement Learning
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning
Yunfei Li
Tian Gao
Jiaqi Yang
Huazhe Xu
Yi Wu
OffRL
31
22
0
24 Jun 2022
Nocturne: a scalable driving benchmark for bringing multi-agent learning
  one step closer to the real world
Nocturne: a scalable driving benchmark for bringing multi-agent learning one step closer to the real world
Eugene Vinitsky
Nathan Lichtlé
Xiaomeng Yang
Brandon Amos
Jakob N. Foerster
OffRL
43
52
0
20 Jun 2022
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement
  Learning
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning
Yuanpei Chen
Tianhao Wu
Shengjie Wang
Xidong Feng
Jiechuan Jiang
...
Yiran Geng
Hao Dong
Zongqing Lu
Song-Chun Zhu
Yaodong Yang
OffRL
51
109
0
17 Jun 2022
Universally Expressive Communication in Multi-Agent Reinforcement
  Learning
Universally Expressive Communication in Multi-Agent Reinforcement Learning
Matthew Morris
Thomas D. Barrett
Arnu Pretorius
24
4
0
14 Jun 2022
Policy Optimization for Markov Games: Unified Framework and Faster
  Convergence
Policy Optimization for Markov Games: Unified Framework and Faster Convergence
Runyu Zhang
Qinghua Liu
Haiquan Wang
Caiming Xiong
Na Li
Yu Bai
29
26
0
06 Jun 2022
Learning Generalized Wireless MAC Communication Protocols via
  Abstraction
Learning Generalized Wireless MAC Communication Protocols via Abstraction
Luciano Miuccio
Salvatore Riolo
S. Samarakoon
D. Panno
M. Bennis
24
17
0
06 Jun 2022
Learning Generalizable Risk-Sensitive Policies to Coordinate in Decentralized Multi-Agent General-Sum Games
Ziyi Liu
Xian Guo
Yongchun Fang
18
0
0
31 May 2022
MAVIPER: Learning Decision Tree Policies for Interpretable Multi-Agent
  Reinforcement Learning
MAVIPER: Learning Decision Tree Policies for Interpretable Multi-Agent Reinforcement Learning
Stephanie Milani
Zhicheng Zhang
Nicholay Topin
Z. Shi
Charles A. Kamhoua
Evangelos E. Papalexakis
Fei Fang
OffRL
83
13
0
25 May 2022
Penalized Proximal Policy Optimization for Safe Reinforcement Learning
Penalized Proximal Policy Optimization for Safe Reinforcement Learning
Linrui Zhang
Li Shen
Long Yang
Shi-Yong Chen
Bo Yuan
Xueqian Wang
Dacheng Tao
13
62
0
24 May 2022
The Sufficiency of Off-Policyness and Soft Clipping: PPO is still
  Insufficient according to an Off-Policy Measure
The Sufficiency of Off-Policyness and Soft Clipping: PPO is still Insufficient according to an Off-Policy Measure
Xing Chen
Dongcui Diao
Hechang Chen
Hengshuai Yao
Haiyin Piao
Zhixiao Sun
Zhiwei Yang
Randy Goebel
Bei Jiang
Yi-Ju Chang
OffRL
41
8
0
20 May 2022
Learning Progress Driven Multi-Agent Curriculum
Learning Progress Driven Multi-Agent Curriculum
Wenshuai Zhao
Zhiyuan Li
Joni Pajarinen
40
0
0
20 May 2022
RoMFAC: A robust mean-field actor-critic reinforcement learning against
  adversarial perturbations on states
RoMFAC: A robust mean-field actor-critic reinforcement learning against adversarial perturbations on states
Ziyuan Zhou
Guanjun Liu
AAML
35
24
0
15 May 2022
Collaborative Target Search with a Visual Drone Swarm: An Adaptive
  Curriculum Embedded Multistage Reinforcement Learning Approach
Collaborative Target Search with a Visual Drone Swarm: An Adaptive Curriculum Embedded Multistage Reinforcement Learning Approach
Jiaping Xiao
Phumrapee Pisutsin
Mir Feroskhan
27
16
0
26 Apr 2022
Towards Comprehensive Testing on the Robustness of Cooperative
  Multi-agent Reinforcement Learning
Towards Comprehensive Testing on the Robustness of Cooperative Multi-agent Reinforcement Learning
Jun Guo
Yonghong Chen
Yihang Hao
Zixin Yin
Yin Yu
Simin Li
AAML
32
32
0
17 Apr 2022
Asynchronous, Option-Based Multi-Agent Policy Gradient: A Conditional
  Reasoning Approach
Asynchronous, Option-Based Multi-Agent Policy Gradient: A Conditional Reasoning Approach
Xubo Lyu
Amin Banitalebi-Dehkordi
Mo Chen
Yong Zhang
32
2
0
29 Mar 2022
An Introduction to Multi-Agent Reinforcement Learning and Review of its
  Application to Autonomous Mobility
An Introduction to Multi-Agent Reinforcement Learning and Review of its Application to Autonomous Mobility
Lukas M. Schmidt
Johanna Brosig
Axel Plinge
Bjoern M. Eskofier
Christopher Mutschler
38
33
0
15 Mar 2022
Multi-Agent Reinforcement Learning for Network Load Balancing in Data
  Center
Multi-Agent Reinforcement Learning for Network Load Balancing in Data Center
Zhiyuan Yao
Zihan Ding
T. Clausen
34
7
0
27 Jan 2022
GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement
  Learning
GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement Learning
Jingqing Ruan
Yali Du
Xuantang Xiong
Dengpeng Xing
Xiyun Li
Linghui Meng
Haifeng Zhang
Jun Wang
Bo Xu
46
29
0
17 Jan 2022
CGIBNet: Bandwidth-constrained Communication with Graph Information
  Bottleneck in Multi-Agent Reinforcement Learning
CGIBNet: Bandwidth-constrained Communication with Graph Information Bottleneck in Multi-Agent Reinforcement Learning
Qi Tian
Kun Kuang
Baoxiang Wang
Furui Liu
Fei Wu
26
0
0
20 Dec 2021
Multi-agent Soft Actor-Critic Based Hybrid Motion Planner for Mobile
  Robots
Multi-agent Soft Actor-Critic Based Hybrid Motion Planner for Mobile Robots
Zicheng He
Lu Dong
Chunwei Song
Changyin Sun
35
25
0
13 Dec 2021
Cooperative Multi-Agent Reinforcement Learning with Hypergraph
  Convolution
Cooperative Multi-Agent Reinforcement Learning with Hypergraph Convolution
Yunru Bai
Chen Gong
Bin Zhang
Guoliang Fan
Xinwen Hou
Yu Liu
27
6
0
09 Dec 2021
Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence
  Model Tackles All SMAC Tasks
Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks
Linghui Meng
Muning Wen
Yaodong Yang
Chenyang Le
Xiyun Li
Weinan Zhang
Ying Wen
Haifeng Zhang
Jun Wang
Bo Xu
OffRL
28
38
0
06 Dec 2021
LIGS: Learnable Intrinsic-Reward Generation Selection for Multi-Agent
  Learning
LIGS: Learnable Intrinsic-Reward Generation Selection for Multi-Agent Learning
D. Mguni
Taher Jafferjee
Jianhong Wang
Oliver Slumbers
Nicolas Perez Nieves
Feifei Tong
Yang Li
Jiangcheng Zhu
Yaodong Yang
Jun Wang
45
18
0
05 Dec 2021
On the Use and Misuse of Absorbing States in Multi-agent Reinforcement
  Learning
On the Use and Misuse of Absorbing States in Multi-agent Reinforcement Learning
Andrew Cohen
Ervin Teng
Vincent-Pierre Berges
Ruo-Ping Dong
Hunter Henry
Marwan Mattar
Alexander Zook
Sujoy Ganguly
24
33
0
10 Nov 2021
Variational Automatic Curriculum Learning for Sparse-Reward Cooperative
  Multi-Agent Problems
Variational Automatic Curriculum Learning for Sparse-Reward Cooperative Multi-Agent Problems
Jiayu Chen
Yuanxin Zhang
Yuanfan Xu
Huimin Ma
Huazhong Yang
Jiaming Song
Yu Wang
Yi Wu
VLM
DRL
26
32
0
08 Nov 2021
ToM2C: Target-oriented Multi-agent Communication and Cooperation with
  Theory of Mind
ToM2C: Target-oriented Multi-agent Communication and Cooperation with Theory of Mind
Yuan-Fang Wang
Fangwei Zhong
Jing Xu
Yizhou Wang
LLMAG
19
67
0
15 Oct 2021
Divergence-Regularized Multi-Agent Actor-Critic
Divergence-Regularized Multi-Agent Actor-Critic
Kefan Su
Zongqing Lu
46
25
0
01 Oct 2021
MACRPO: Multi-Agent Cooperative Recurrent Policy Optimization
MACRPO: Multi-Agent Cooperative Recurrent Policy Optimization
E. Kargar
Ville Kyrki
24
2
0
02 Sep 2021
A review of mobile robot motion planning methods: from classical motion
  planning workflows to reinforcement learning-based architectures
A review of mobile robot motion planning methods: from classical motion planning workflows to reinforcement learning-based architectures
Changyin Sun
Zicheng He
Chunwei Song
Changyin Sun
38
54
0
31 Aug 2021
Settling the Variance of Multi-Agent Policy Gradients
Settling the Variance of Multi-Agent Policy Gradients
J. Kuba
Muning Wen
Yaodong Yang
Linghui Meng
Shangding Gu
Haifeng Zhang
D. Mguni
Jun Wang
24
58
0
19 Aug 2021
SA-MATD3:Self-attention-based multi-agent continuous control method in
  cooperative environments
SA-MATD3:Self-attention-based multi-agent continuous control method in cooperative environments
Kai Liu
Yuyang Zhao
Gang Wang
Bei Peng
25
18
0
01 Jul 2021
The Power of Exploiter: Provable Multi-Agent RL in Large State Spaces
The Power of Exploiter: Provable Multi-Agent RL in Large State Spaces
Chi Jin
Qinghua Liu
Tiancheng Yu
26
50
0
07 Jun 2021
Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in
  Cooperative Tasks
Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks
Georgios Papoudakis
Filippos Christianos
Lukas Schafer
Stefano V. Albrecht
OffRL
26
220
0
14 Jun 2020
Previous
1234