ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2212.07489
  4. Cited By
SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement
  Learning

SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning

14 December 2022
Benjamin Ellis
Jonathan Cook
S. Moalla
Mikayel Samvelyan
Mingfei Sun
Anuj Mahajan
Jakob N. Foerster
Shimon Whiteson
ArXivPDFHTML

Papers citing "SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning"

50 / 54 papers shown
Title
JaxRobotarium: Training and Deploying Multi-Robot Policies in 10 Minutes
JaxRobotarium: Training and Deploying Multi-Robot Policies in 10 Minutes
Shalin Jain
Jiazhen Liu
Siva Kailas
Harish Ravichandar
137
0
0
10 May 2025
State Combinatorial Generalization In Decision Making With Conditional Diffusion Models
State Combinatorial Generalization In Decision Making With Conditional Diffusion Models
Xintong Duan
Yutong He
Fahim Tajwar
Wen-Tse Chen
Ruslan Salakhutdinov
Jeff Schneider
OffRL
AI4CE
143
1
0
22 Jan 2025
MADiff: Offline Multi-agent Learning with Diffusion Models
MADiff: Offline Multi-agent Learning with Diffusion Models
Zhengbang Zhu
Minghuan Liu
Liyuan Mao
Bingyi Kang
Minkai Xu
Yong Yu
Stefano Ermon
Weinan Zhang
DiffM
OffRL
131
40
0
03 Jan 2025
Harnessing Language for Coordination: A Framework and Benchmark for LLM-Driven Multi-Agent Control
Harnessing Language for Coordination: A Framework and Benchmark for LLM-Driven Multi-Agent Control
Timothée Anne
Noah Syrkis
Meriem Elhosni
Florian Turati
Franck Legendre
Alain Jaquier
Sebastian Risi
LLMAG
134
2
0
16 Dec 2024
Simplifying Deep Temporal Difference Learning
Simplifying Deep Temporal Difference Learning
Matteo Gallici
Mattie Fellows
Benjamin Ellis
B. Pou
Ivan Masmitja
Jakob Foerster
Mario Martin
OffRL
113
25
0
05 Jul 2024
Variational Offline Multi-agent Skill Discovery
Variational Offline Multi-agent Skill Discovery
Jiayu Chen
Bhargav Ganguly
Tian-Shing Lan
OffRL
93
3
0
26 May 2024
Physics-Informed Multi-Agent Reinforcement Learning for Distributed Multi-Robot Problems
Physics-Informed Multi-Agent Reinforcement Learning for Distributed Multi-Robot Problems
Eduardo Sebastián
T. Duong
Nikolay Atanasov
Eduardo Montijano
C. Sagüés
112
3
0
30 Dec 2023
Is Centralized Training with Decentralized Execution Framework Centralized Enough for MARL?
Is Centralized Training with Decentralized Execution Framework Centralized Enough for MARL?
Yihe Zhou
Shunyu Liu
Yunpeng Qing
Kaixuan Chen
Tongya Zheng
Jie Song
Mingli Song
56
19
0
27 May 2023
Towards a Standardised Performance Evaluation Protocol for Cooperative
  MARL
Towards a Standardised Performance Evaluation Protocol for Cooperative MARL
R. Gorsane
Omayma Mahjoub
Ruan de Kock
Roland Dubb
Siddarth S. Singh
Arnu Pretorius
OffRL
73
50
0
21 Sep 2022
Learning Progress Driven Multi-Agent Curriculum
Learning Progress Driven Multi-Agent Curriculum
Wenshuai Zhao
Zhiyuan Li
Joni Pajarinen
82
0
0
20 May 2022
A Generalist Agent
A Generalist Agent
Scott E. Reed
Konrad Zolna
Emilio Parisotto
Sergio Gomez Colmenarejo
Alexander Novikov
...
Yutian Chen
R. Hadsell
Oriol Vinyals
Mahyar Bordbar
Nando de Freitas
LM&Ro
LLMAG
AI4CE
199
817
0
12 May 2022
Trust Region Bounds for Decentralized PPO Under Non-stationarity
Trust Region Bounds for Decentralized PPO Under Non-stationarity
Mingfei Sun
Sam Devlin
Jacob Beck
Katja Hofmann
Shimon Whiteson
106
10
0
31 Jan 2022
You May Not Need Ratio Clipping in PPO
You May Not Need Ratio Clipping in PPO
Mingfei Sun
Vitaly Kurin
Guoqing Liu
Sam Devlin
Tao Qin
Katja Hofmann
Shimon Whiteson
38
16
0
31 Jan 2022
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning
  Research
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Mikayel Samvelyan
Robert Kirk
Vitaly Kurin
Jack Parker-Holder
Minqi Jiang
Eric Hambro
Fabio Petroni
Heinrich Küttler
Edward Grefenstette
Tim Rocktaschel
OffRL
289
90
0
27 Sep 2021
Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting
  Pot
Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot
Joel Z Leibo
Edgar A. Duénez-Guzmán
A. Vezhnevets
J. Agapiou
P. Sunehag
Raphael Köster
Jayd Matyas
Charlie Beattie
Igor Mordatch
T. Graepel
OffRL
86
109
0
14 Jul 2021
Offline Reinforcement Learning as One Big Sequence Modeling Problem
Offline Reinforcement Learning as One Big Sequence Modeling Problem
Michael Janner
Qiyang Li
Sergey Levine
OffRL
131
680
0
03 Jun 2021
Decision Transformer: Reinforcement Learning via Sequence Modeling
Decision Transformer: Reinforcement Learning via Sequence Modeling
Lili Chen
Kevin Lu
Aravind Rajeswaran
Kimin Lee
Aditya Grover
Michael Laskin
Pieter Abbeel
A. Srinivas
Igor Mordatch
OffRL
127
1,640
0
02 Jun 2021
Tesseract: Tensorised Actors for Multi-Agent Reinforcement Learning
Tesseract: Tensorised Actors for Multi-Agent Reinforcement Learning
Anuj Mahajan
Mikayel Samvelyan
Lei Mao
Viktor Makoviychuk
Animesh Garg
Jean Kossaifi
Shimon Whiteson
Yuke Zhu
Anima Anandkumar
71
32
0
31 May 2021
The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games
The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games
Chao Yu
Akash Velu
Eugene Vinitsky
Jiaxuan Gao
Yu Wang
Alexandre M. Bayen
Yi Wu
OffRL
134
1,250
0
02 Mar 2021
Rethinking the Implementation Tricks and Monotonicity Constraint in
  Cooperative Multi-Agent Reinforcement Learning
Rethinking the Implementation Tricks and Monotonicity Constraint in Cooperative Multi-Agent Reinforcement Learning
Jian Hu
Siyang Jiang
Seth Austin Harding
Haibin Wu
Shihua Liao
54
89
0
06 Feb 2021
Is Independent Learning All You Need in the StarCraft Multi-Agent
  Challenge?
Is Independent Learning All You Need in the StarCraft Multi-Agent Challenge?
Christian Schroeder de Witt
Tarun Gupta
Denys Makoviichuk
Viktor Makoviychuk
Philip Torr
Mingfei Sun
Shimon Whiteson
67
333
0
18 Nov 2020
UneVEn: Universal Value Exploration for Multi-Agent Reinforcement
  Learning
UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning
Tarun Gupta
Anuj Mahajan
Bei Peng
Wendelin Bohmer
Shimon Whiteson
OffRL
47
50
0
06 Oct 2020
RODE: Learning Roles to Decompose Multi-Agent Tasks
RODE: Learning Roles to Decompose Multi-Agent Tasks
Tonghan Wang
Tarun Gupta
Anuj Mahajan
Bei Peng
Shimon Whiteson
Chongjie Zhang
OffRL
72
209
0
04 Oct 2020
PettingZoo: Gym for Multi-Agent Reinforcement Learning
PettingZoo: Gym for Multi-Agent Reinforcement Learning
J. K. Terry
Benjamin Black
Nathaniel Grammel
Mario Jayakumar
Ananth Hari
...
Caroline Horsch
Clemens Dieffendahl
Niall L. Williams
Yashas Lokesh
Praveen Ravi
OffRL
79
281
0
30 Sep 2020
QPLEX: Duplex Dueling Multi-Agent Q-Learning
QPLEX: Duplex Dueling Multi-Agent Q-Learning
Jianhao Wang
Zhizhou Ren
Terry Liu
Yang Yu
Chongjie Zhang
OffRL
99
455
0
03 Aug 2020
The NetHack Learning Environment
The NetHack Learning Environment
Heinrich Küttler
Nantas Nardelli
Alexander H. Miller
Roberta Raileanu
Marco Selvatici
Edward Grefenstette
Tim Rocktaschel
67
181
0
24 Jun 2020
Weighted QMIX: Expanding Monotonic Value Function Factorisation for Deep
  Multi-Agent Reinforcement Learning
Weighted QMIX: Expanding Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Tabish Rashid
Gregory Farquhar
Bei Peng
Shimon Whiteson
99
353
0
18 Jun 2020
Randomized Entity-wise Factorization for Multi-Agent Reinforcement
  Learning
Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning
Shariq Iqbal
Christian Schroeder de Witt
Bei Peng
Wendelin Bohmer
Shimon Whiteson
Fei Sha
54
65
0
07 Jun 2020
ROMA: Multi-Agent Reinforcement Learning with Emergent Roles
ROMA: Multi-Agent Reinforcement Learning with Emergent Roles
Tonghan Wang
Heng Dong
V. Lesser
Chongjie Zhang
89
218
0
18 Mar 2020
FACMAC: Factored Multi-Agent Centralised Policy Gradients
FACMAC: Factored Multi-Agent Centralised Policy Gradients
Bei Peng
Tabish Rashid
Christian Schroeder de Witt
Pierre-Alexandre Kamienny
Philip Torr
Wendelin Bohmer
Shimon Whiteson
51
260
0
14 Mar 2020
Reinforcement Learning Upside Down: Don't Predict Rewards -- Just Map
  Them to Actions
Reinforcement Learning Upside Down: Don't Predict Rewards -- Just Map Them to Actions
J. Schmidhuber
53
131
0
05 Dec 2019
Leveraging Procedural Generation to Benchmark Reinforcement Learning
Leveraging Procedural Generation to Benchmark Reinforcement Learning
K. Cobbe
Christopher Hesse
Jacob Hilton
John Schulman
72
555
0
03 Dec 2019
Increasing Generality in Machine Learning through Procedural Content
  Generation
Increasing Generality in Machine Learning through Procedural Content Generation
S. Risi
Julian Togelius
64
125
0
29 Nov 2019
MAVEN: Multi-Agent Variational Exploration
MAVEN: Multi-Agent Variational Exploration
Anuj Mahajan
Tabish Rashid
Mikayel Samvelyan
Shimon Whiteson
DRL
184
361
0
16 Oct 2019
On the Utility of Learning about Humans for Human-AI Coordination
On the Utility of Learning about Humans for Human-AI Coordination
Micah Carroll
Rohin Shah
Mark K. Ho
Thomas Griffiths
Sanjit A. Seshia
Pieter Abbeel
Anca Dragan
HAI
69
394
0
13 Oct 2019
OpenSpiel: A Framework for Reinforcement Learning in Games
OpenSpiel: A Framework for Reinforcement Learning in Games
Marc Lanctot
Edward Lockhart
Jean-Baptiste Lespiau
V. Zambaldi
Satyaki Upadhyay
...
Julian Schrittwieser
Thomas W. Anthony
Edward Hughes
Ivo Danihelka
Jonah Ryan-Davis
OffRL
97
250
0
26 Aug 2019
Google Research Football: A Novel Reinforcement Learning Environment
Google Research Football: A Novel Reinforcement Learning Environment
Karol Kurach
Anton Raichuk
Piotr Stańczyk
Michal Zajac
Olivier Bachem
...
C. Riquelme
Damien Vincent
Marcin Michalski
Olivier Bousquet
Sylvain Gelly
135
403
0
25 Jul 2019
QTRAN: Learning to Factorize with Transformation for Cooperative
  Multi-Agent Reinforcement Learning
QTRAN: Learning to Factorize with Transformation for Cooperative Multi-Agent Reinforcement Learning
Kyunghwan Son
Daewoo Kim
Wan Ju Kang
D. Hostallero
Yung Yi
OffRL
54
807
0
14 May 2019
Neural MMO: A Massively Multiagent Game Environment for Training and
  Evaluating Intelligent Agents
Neural MMO: A Massively Multiagent Game Environment for Training and Evaluating Intelligent Agents
Joseph Suárez
Yilun Du
Phillip Isola
Igor Mordatch
59
71
0
02 Mar 2019
The StarCraft Multi-Agent Challenge
The StarCraft Multi-Agent Challenge
Mikayel Samvelyan
Tabish Rashid
Christian Schroeder de Witt
Gregory Farquhar
Nantas Nardelli
Tim G. J. Rudner
Chia-Man Hung
Philip Torr
Jakob N. Foerster
Shimon Whiteson
93
953
0
11 Feb 2019
Obstacle Tower: A Generalization Challenge in Vision, Control, and
  Planning
Obstacle Tower: A Generalization Challenge in Vision, Control, and Planning
Arthur Juliani
Ahmed Khalifa
Vincent-Pierre Berges
Jonathan Harper
Ervin Teng
Hunter Henry
A. Crespi
Julian Togelius
Danny Lange
52
143
0
04 Feb 2019
The Hanabi Challenge: A New Frontier for AI Research
The Hanabi Challenge: A New Frontier for AI Research
Nolan Bard
Jakob N. Foerster
A. Chandar
Neil Burch
Marc Lanctot
...
Iain Dunning
Shibl Mourad
Hugo Larochelle
Marc G. Bellemare
Michael Bowling
LLMAG
64
352
0
01 Feb 2019
Pommerman: A Multi-Agent Playground
Pommerman: A Multi-Agent Playground
Cinjon Resnick
W. Eldridge
David R Ha
D. Britz
Jakob N. Foerster
Julian Togelius
Kyunghyun Cho
Joan Bruna
LLMAG
55
85
0
19 Sep 2018
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent
  Reinforcement Learning
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Tabish Rashid
Mikayel Samvelyan
Christian Schroeder de Witt
Gregory Farquhar
Jakob N. Foerster
Shimon Whiteson
150
1,671
0
30 Mar 2018
Mean Field Multi-Agent Reinforcement Learning
Mean Field Multi-Agent Reinforcement Learning
Yaodong Yang
Rui Luo
Minne Li
M. Zhou
Weinan Zhang
Jun Wang
AI4CE
61
574
0
15 Feb 2018
StarCraft II: A New Challenge for Reinforcement Learning
StarCraft II: A New Challenge for Reinforcement Learning
Oriol Vinyals
T. Ewalds
Sergey Bartunov
Petko Georgiev
A. Vezhnevets
...
Anthony Brunasso
David Lawrence
Anders Ekermo
J. Repp
Rodney Tsing
76
874
0
16 Aug 2017
Value-Decomposition Networks For Cooperative Multi-Agent Learning
Value-Decomposition Networks For Cooperative Multi-Agent Learning
P. Sunehag
Guy Lever
A. Gruslys
Wojciech M. Czarnecki
V. Zambaldi
...
Marc Lanctot
Nicolas Sonnerat
Joel Z Leibo
K. Tuyls
T. Graepel
69
1,006
0
16 Jun 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
698
131,526
0
12 Jun 2017
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Ryan J. Lowe
Yi Wu
Aviv Tamar
J. Harb
Pieter Abbeel
Igor Mordatch
140
4,482
0
07 Jun 2017
Multi-agent Reinforcement Learning in Sequential Social Dilemmas
Multi-agent Reinforcement Learning in Sequential Social Dilemmas
Joel Z Leibo
V. Zambaldi
Marc Lanctot
J. Marecki
T. Graepel
73
608
0
10 Feb 2017
12
Next