ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.09533
  4. Cited By
Is Independent Learning All You Need in the StarCraft Multi-Agent
  Challenge?

Is Independent Learning All You Need in the StarCraft Multi-Agent Challenge?

18 November 2020
Christian Schroeder de Witt
Tarun Gupta
Denys Makoviichuk
Viktor Makoviychuk
Philip Torr
Mingfei Sun
Shimon Whiteson
ArXivPDFHTML

Papers citing "Is Independent Learning All You Need in the StarCraft Multi-Agent Challenge?"

26 / 26 papers shown
Title
SocialJax: An Evaluation Suite for Multi-agent Reinforcement Learning in Sequential Social Dilemmas
SocialJax: An Evaluation Suite for Multi-agent Reinforcement Learning in Sequential Social Dilemmas
Zihao Guo
Richard Willis
Richard Willis
Tristan Tomilin
Joel Z Leibo
Yali Du
81
0
0
18 Mar 2025
MADiff: Offline Multi-agent Learning with Diffusion Models
MADiff: Offline Multi-agent Learning with Diffusion Models
Zhengbang Zhu
Minghuan Liu
Liyuan Mao
Bingyi Kang
Minkai Xu
Yong Yu
Stefano Ermon
Weinan Zhang
DiffM
OffRL
100
39
0
03 Jan 2025
Dashing for the Golden Snitch: Multi-Drone Time-Optimal Motion Planning with Multi-Agent Reinforcement Learning
Dashing for the Golden Snitch: Multi-Drone Time-Optimal Motion Planning with Multi-Agent Reinforcement Learning
Xingyu Wang
Jin Zhou
Yuanli Feng
Jiahao Mei
Jiming Chen
Shuo Li
70
1
0
25 Sep 2024
InvAgent: A Large Language Model based Multi-Agent System for Inventory Management in Supply Chains
InvAgent: A Large Language Model based Multi-Agent System for Inventory Management in Supply Chains
Yinzhu Quan
Zefang Liu
LLMAG
84
4
0
16 Jul 2024
Simplifying Deep Temporal Difference Learning
Simplifying Deep Temporal Difference Learning
Matteo Gallici
Mattie Fellows
Benjamin Ellis
B. Pou
Ivan Masmitja
Jakob Foerster
Mario Martin
OffRL
82
21
0
05 Jul 2024
Carbon Footprint Reduction for Sustainable Data Centers in Real-Time
Carbon Footprint Reduction for Sustainable Data Centers in Real-Time
Soumyendu Sarkar
Avisek Naug
Ricardo Luna
Antonio Guillen
Vineet Gundecha
Sahand Ghorbanpour
Sajad Mousavi
Dejan Markovikj
Ashwin Ramesh Babu
AI4CE
44
8
0
21 Mar 2024
Physics-Informed Multi-Agent Reinforcement Learning for Distributed Multi-Robot Problems
Physics-Informed Multi-Agent Reinforcement Learning for Distributed Multi-Robot Problems
Eduardo Sebastián
T. Duong
Nikolay Atanasov
Eduardo Montijano
C. Sagüés
90
3
0
30 Dec 2023
Revisiting Design Choices in Proximal Policy Optimization
Revisiting Design Choices in Proximal Policy Optimization
Chloe Ching-Yun Hsu
Celestine Mendler-Dünner
Moritz Hardt
101
53
0
23 Sep 2020
Implementation Matters in Deep Policy Gradients: A Case Study on PPO and
  TRPO
Implementation Matters in Deep Policy Gradients: A Case Study on PPO and TRPO
Logan Engstrom
Andrew Ilyas
Shibani Santurkar
Dimitris Tsipras
Firdaus Janoos
L. Rudolph
Aleksander Madry
AAML
46
225
0
25 May 2020
Monotonic Value Function Factorisation for Deep Multi-Agent
  Reinforcement Learning
Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Tabish Rashid
Mikayel Samvelyan
Christian Schroeder de Witt
Gregory Farquhar
Jakob N. Foerster
Shimon Whiteson
81
781
0
19 Mar 2020
MAVEN: Multi-Agent Variational Exploration
MAVEN: Multi-Agent Variational Exploration
Anuj Mahajan
Tabish Rashid
Mikayel Samvelyan
Shimon Whiteson
DRL
161
358
0
16 Oct 2019
Deep Coordination Graphs
Deep Coordination Graphs
Wendelin Bohmer
Vitaly Kurin
Shimon Whiteson
GNN
46
175
0
27 Sep 2019
Exploration with Unreliable Intrinsic Reward in Multi-Agent
  Reinforcement Learning
Exploration with Unreliable Intrinsic Reward in Multi-Agent Reinforcement Learning
Wendelin Bohmer
Tabish Rashid
Shimon Whiteson
23
24
0
05 Jun 2019
QTRAN: Learning to Factorize with Transformation for Cooperative
  Multi-Agent Reinforcement Learning
QTRAN: Learning to Factorize with Transformation for Cooperative Multi-Agent Reinforcement Learning
Kyunghwan Son
Daewoo Kim
Wan Ju Kang
D. Hostallero
Yung Yi
OffRL
50
793
0
14 May 2019
Truly Proximal Policy Optimization
Truly Proximal Policy Optimization
Yuhui Wang
Hao He
Chao Wen
Xiaoyang Tan
40
123
0
19 Mar 2019
The StarCraft Multi-Agent Challenge
The StarCraft Multi-Agent Challenge
Mikayel Samvelyan
Tabish Rashid
Christian Schroeder de Witt
Gregory Farquhar
Nantas Nardelli
Tim G. J. Rudner
Chia-Man Hung
Philip Torr
Jakob N. Foerster
Shimon Whiteson
74
941
0
11 Feb 2019
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent
  Reinforcement Learning
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Tabish Rashid
Mikayel Samvelyan
Christian Schroeder de Witt
Gregory Farquhar
Jakob N. Foerster
Shimon Whiteson
118
1,662
0
30 Mar 2018
Guided Deep Reinforcement Learning for Swarm Systems
Guided Deep Reinforcement Learning for Swarm Systems
Maximilian Hüttenrauch
Adrian Šošić
Gerhard Neumann
42
130
0
18 Sep 2017
StarCraft II: A New Challenge for Reinforcement Learning
StarCraft II: A New Challenge for Reinforcement Learning
Oriol Vinyals
T. Ewalds
Sergey Bartunov
Petko Georgiev
A. Vezhnevets
...
Anthony Brunasso
David Lawrence
Anders Ekermo
J. Repp
Rodney Tsing
51
868
0
16 Aug 2017
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
208
18,685
0
20 Jul 2017
Value-Decomposition Networks For Cooperative Multi-Agent Learning
Value-Decomposition Networks For Cooperative Multi-Agent Learning
P. Sunehag
Guy Lever
A. Gruslys
Wojciech M. Czarnecki
V. Zambaldi
...
Marc Lanctot
Nicolas Sonnerat
Joel Z Leibo
K. Tuyls
T. Graepel
64
997
0
16 Jun 2017
Counterfactual Multi-Agent Policy Gradients
Counterfactual Multi-Agent Policy Gradients
Jakob N. Foerster
Gregory Farquhar
Triantafyllos Afouras
Nantas Nardelli
Shimon Whiteson
52
2,062
0
24 May 2017
Asynchronous Methods for Deep Reinforcement Learning
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
161
8,805
0
04 Feb 2016
High-Dimensional Continuous Control Using Generalized Advantage
  Estimation
High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
OffRL
38
3,368
0
08 Jun 2015
Trust Region Policy Optimization
Trust Region Policy Optimization
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
237
6,722
0
19 Feb 2015
Delving Deep into Rectifiers: Surpassing Human-Level Performance on
  ImageNet Classification
Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
VLM
166
18,534
0
06 Feb 2015
1