Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2011.09533
Cited By
Is Independent Learning All You Need in the StarCraft Multi-Agent Challenge?
18 November 2020
Christian Schroeder de Witt
Tarun Gupta
Denys Makoviichuk
Viktor Makoviychuk
Philip Torr
Mingfei Sun
Shimon Whiteson
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Is Independent Learning All You Need in the StarCraft Multi-Agent Challenge?"
26 / 26 papers shown
Title
SocialJax: An Evaluation Suite for Multi-agent Reinforcement Learning in Sequential Social Dilemmas
Zihao Guo
Richard Willis
Richard Willis
Tristan Tomilin
Joel Z Leibo
Yali Du
81
0
0
18 Mar 2025
MADiff: Offline Multi-agent Learning with Diffusion Models
Zhengbang Zhu
Minghuan Liu
Liyuan Mao
Bingyi Kang
Minkai Xu
Yong Yu
Stefano Ermon
Weinan Zhang
DiffM
OffRL
100
39
0
03 Jan 2025
Dashing for the Golden Snitch: Multi-Drone Time-Optimal Motion Planning with Multi-Agent Reinforcement Learning
Xingyu Wang
Jin Zhou
Yuanli Feng
Jiahao Mei
Jiming Chen
Shuo Li
70
1
0
25 Sep 2024
InvAgent: A Large Language Model based Multi-Agent System for Inventory Management in Supply Chains
Yinzhu Quan
Zefang Liu
LLMAG
84
4
0
16 Jul 2024
Simplifying Deep Temporal Difference Learning
Matteo Gallici
Mattie Fellows
Benjamin Ellis
B. Pou
Ivan Masmitja
Jakob Foerster
Mario Martin
OffRL
82
21
0
05 Jul 2024
Carbon Footprint Reduction for Sustainable Data Centers in Real-Time
Soumyendu Sarkar
Avisek Naug
Ricardo Luna
Antonio Guillen
Vineet Gundecha
Sahand Ghorbanpour
Sajad Mousavi
Dejan Markovikj
Ashwin Ramesh Babu
AI4CE
44
8
0
21 Mar 2024
Physics-Informed Multi-Agent Reinforcement Learning for Distributed Multi-Robot Problems
Eduardo Sebastián
T. Duong
Nikolay Atanasov
Eduardo Montijano
C. Sagüés
90
3
0
30 Dec 2023
Revisiting Design Choices in Proximal Policy Optimization
Chloe Ching-Yun Hsu
Celestine Mendler-Dünner
Moritz Hardt
101
53
0
23 Sep 2020
Implementation Matters in Deep Policy Gradients: A Case Study on PPO and TRPO
Logan Engstrom
Andrew Ilyas
Shibani Santurkar
Dimitris Tsipras
Firdaus Janoos
L. Rudolph
Aleksander Madry
AAML
46
225
0
25 May 2020
Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Tabish Rashid
Mikayel Samvelyan
Christian Schroeder de Witt
Gregory Farquhar
Jakob N. Foerster
Shimon Whiteson
81
781
0
19 Mar 2020
MAVEN: Multi-Agent Variational Exploration
Anuj Mahajan
Tabish Rashid
Mikayel Samvelyan
Shimon Whiteson
DRL
161
358
0
16 Oct 2019
Deep Coordination Graphs
Wendelin Bohmer
Vitaly Kurin
Shimon Whiteson
GNN
46
175
0
27 Sep 2019
Exploration with Unreliable Intrinsic Reward in Multi-Agent Reinforcement Learning
Wendelin Bohmer
Tabish Rashid
Shimon Whiteson
23
24
0
05 Jun 2019
QTRAN: Learning to Factorize with Transformation for Cooperative Multi-Agent Reinforcement Learning
Kyunghwan Son
Daewoo Kim
Wan Ju Kang
D. Hostallero
Yung Yi
OffRL
50
793
0
14 May 2019
Truly Proximal Policy Optimization
Yuhui Wang
Hao He
Chao Wen
Xiaoyang Tan
40
123
0
19 Mar 2019
The StarCraft Multi-Agent Challenge
Mikayel Samvelyan
Tabish Rashid
Christian Schroeder de Witt
Gregory Farquhar
Nantas Nardelli
Tim G. J. Rudner
Chia-Man Hung
Philip Torr
Jakob N. Foerster
Shimon Whiteson
74
941
0
11 Feb 2019
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Tabish Rashid
Mikayel Samvelyan
Christian Schroeder de Witt
Gregory Farquhar
Jakob N. Foerster
Shimon Whiteson
118
1,662
0
30 Mar 2018
Guided Deep Reinforcement Learning for Swarm Systems
Maximilian Hüttenrauch
Adrian Šošić
Gerhard Neumann
42
130
0
18 Sep 2017
StarCraft II: A New Challenge for Reinforcement Learning
Oriol Vinyals
T. Ewalds
Sergey Bartunov
Petko Georgiev
A. Vezhnevets
...
Anthony Brunasso
David Lawrence
Anders Ekermo
J. Repp
Rodney Tsing
51
868
0
16 Aug 2017
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
208
18,685
0
20 Jul 2017
Value-Decomposition Networks For Cooperative Multi-Agent Learning
P. Sunehag
Guy Lever
A. Gruslys
Wojciech M. Czarnecki
V. Zambaldi
...
Marc Lanctot
Nicolas Sonnerat
Joel Z Leibo
K. Tuyls
T. Graepel
64
997
0
16 Jun 2017
Counterfactual Multi-Agent Policy Gradients
Jakob N. Foerster
Gregory Farquhar
Triantafyllos Afouras
Nantas Nardelli
Shimon Whiteson
52
2,062
0
24 May 2017
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
161
8,805
0
04 Feb 2016
High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
OffRL
38
3,368
0
08 Jun 2015
Trust Region Policy Optimization
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
237
6,722
0
19 Feb 2015
Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
VLM
166
18,534
0
06 Feb 2015
1