ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.07551
  4. Cited By
MALib: A Parallel Framework for Population-based Multi-agent
  Reinforcement Learning

MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning

5 June 2021
Ming Zhou
Bo Liu
Hanjing Wang
Muning Wen
Runzhe Wu
Ying Wen
Yaodong Yang
Weinan Zhang
Jun Wang
    OffRL
ArXivPDFHTML

Papers citing "MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning"

41 / 41 papers shown
Title
MARFT: Multi-Agent Reinforcement Fine-Tuning
MARFT: Multi-Agent Reinforcement Fine-Tuning
Junwei Liao
Muning Wen
Jun Wang
Weinan Zhang
OffRL
96
3
0
21 Apr 2025
Fast Population-Based Reinforcement Learning on a Single Machine
Fast Population-Based Reinforcement Learning on a Single Machine
Arthur Flajolet
Claire Bizon Monroc
Karim Beguir
Thomas Pierrot
OffRL
67
10
0
17 Jun 2022
Modelling Behavioural Diversity for Learning in Open-Ended Games
Modelling Behavioural Diversity for Learning in Open-Ended Games
Nicolas Perez Nieves
Yaodong Yang
Oliver Slumbers
D. Mguni
Ying Wen
Jun Wang
45
71
0
14 Mar 2021
TLeague: A Framework for Competitive Self-Play based Distributed
  Multi-Agent Reinforcement Learning
TLeague: A Framework for Competitive Self-Play based Distributed Multi-Agent Reinforcement Learning
Peng Sun
Jiechao Xiong
Lei Han
Xinghai Sun
Shuxing Li
Jiawei Xu
Meng Fang
Zhengyou Zhang
OffRL
LRM
40
19
0
25 Nov 2020
PettingZoo: Gym for Multi-Agent Reinforcement Learning
PettingZoo: Gym for Multi-Agent Reinforcement Learning
J. K. Terry
Benjamin Black
Nathaniel Grammel
Mario Jayakumar
Ananth Hari
...
Caroline Horsch
Clemens Dieffendahl
Niall L. Williams
Yashas Lokesh
Praveen Ravi
OffRL
74
279
0
30 Sep 2020
Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with
  Asynchronous Reinforcement Learning
Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with Asynchronous Reinforcement Learning
Aleksei Petrenko
Zhehui Huang
T. Kumar
Gaurav Sukhatme
V. Koltun
49
105
0
21 Jun 2020
Pipeline PSRO: A Scalable Approach for Finding Approximate Nash
  Equilibria in Large Games
Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games
Stephen Marcus McAleer
John Lanier
Roy Fox
Pierre Baldi
24
78
0
15 Jun 2020
Acme: A Research Framework for Distributed Reinforcement Learning
Acme: A Research Framework for Distributed Reinforcement Learning
Matthew W. Hoffman
Bobak Shahriari
John Aslanides
Gabriel Barth-Maron
Nikola Momchev
...
Srivatsan Srinivasan
A. Cowie
Ziyun Wang
Bilal Piot
Nando de Freitas
108
226
0
01 Jun 2020
Fiber: A Platform for Efficient Development and Distributed Training for
  Reinforcement Learning and Population-Based Methods
Fiber: A Platform for Efficient Development and Distributed Training for Reinforcement Learning and Population-Based Methods
Jiale Zhi
Rui Wang
Jeff Clune
Kenneth O. Stanley
OffRL
51
12
0
25 Mar 2020
Dota 2 with Large Scale Deep Reinforcement Learning
Dota 2 with Large Scale Deep Reinforcement Learning
OpenAI OpenAI
:
Christopher Berner
Greg Brockman
Brooke Chan
...
Szymon Sidor
Ilya Sutskever
Jie Tang
Filip Wolski
Susan Zhang
GNN
VLM
CLL
AI4CE
LRM
140
1,820
0
13 Dec 2019
Decentralized Multi-Agent Reinforcement Learning with Networked Agents:
  Recent Advances
Decentralized Multi-Agent Reinforcement Learning with Networked Agents: Recent Advances
Kai Zhang
Zhuoran Yang
Tamer Basar
41
68
0
09 Dec 2019
PyTorch: An Imperative Style, High-Performance Deep Learning Library
PyTorch: An Imperative Style, High-Performance Deep Learning Library
Adam Paszke
Sam Gross
Francisco Massa
Adam Lerer
James Bradbury
...
Sasank Chilamkurthy
Benoit Steiner
Lu Fang
Junjie Bai
Soumith Chintala
ODL
384
42,299
0
03 Dec 2019
On the Utility of Learning about Humans for Human-AI Coordination
On the Utility of Learning about Humans for Human-AI Coordination
Micah Carroll
Rohin Shah
Mark K. Ho
Thomas Griffiths
Sanjit A. Seshia
Pieter Abbeel
Anca Dragan
HAI
67
394
0
13 Oct 2019
Multi-Agent Reinforcement Learning for Order-dispatching via
  Order-Vehicle Distribution Matching
Multi-Agent Reinforcement Learning for Order-dispatching via Order-Vehicle Distribution Matching
Ming Zhou
Jiarui Jin
Weinan Zhang
Zhiwei Qin
Yan Jiao
Chenxi Wang
Guobin Wu
Yong Yu
Jieping Ye
36
86
0
07 Oct 2019
A Generalized Training Approach for Multiagent Learning
A Generalized Training Approach for Multiagent Learning
Paul Muller
Shayegan Omidshafiei
Mark Rowland
K. Tuyls
Julien Perolat
...
Zhe Wang
Guy Lever
N. Heess
T. Graepel
Rémi Munos
42
91
0
27 Sep 2019
$α^α$-Rank: Practically Scaling $α$-Rank through
  Stochastic Optimisation
ααα^ααα-Rank: Practically Scaling ααα-Rank through Stochastic Optimisation
Yaodong Yang
Rasul Tutunov
Phu Sakulwongtana
Haitham Bou-Ammar
54
21
0
25 Sep 2019
OpenSpiel: A Framework for Reinforcement Learning in Games
OpenSpiel: A Framework for Reinforcement Learning in Games
Marc Lanctot
Edward Lockhart
Jean-Baptiste Lespiau
V. Zambaldi
Satyaki Upadhyay
...
Julian Schrittwieser
Thomas W. Anthony
Edward Hughes
Ivo Danihelka
Jonah Ryan-Davis
OffRL
79
250
0
26 Aug 2019
Google Research Football: A Novel Reinforcement Learning Environment
Google Research Football: A Novel Reinforcement Learning Environment
Karol Kurach
Anton Raichuk
Piotr Stańczyk
Michal Zajac
Olivier Bachem
...
C. Riquelme
Damien Vincent
Marcin Michalski
Olivier Bousquet
Sylvain Gelly
133
402
0
25 Jul 2019
$α$-Rank: Multi-Agent Evaluation by Evolution
ααα-Rank: Multi-Agent Evaluation by Evolution
Shayegan Omidshafiei
Christos H. Papadimitriou
Georgios Piliouras
K. Tuyls
Mark Rowland
Jean-Baptiste Lespiau
Wojciech M. Czarnecki
Marc Lanctot
Julien Perolat
Rémi Munos
65
121
0
04 Mar 2019
The StarCraft Multi-Agent Challenge
The StarCraft Multi-Agent Challenge
Mikayel Samvelyan
Tabish Rashid
Christian Schroeder de Witt
Gregory Farquhar
Nantas Nardelli
Tim G. J. Rudner
Chia-Man Hung
Philip Torr
Jakob N. Foerster
Shimon Whiteson
85
950
0
11 Feb 2019
Actor-Attention-Critic for Multi-Agent Reinforcement Learning
Actor-Attention-Critic for Multi-Agent Reinforcement Learning
Shariq Iqbal
Fei Sha
67
749
0
05 Oct 2018
Human-level performance in first-person multiplayer games with
  population-based deep reinforcement learning
Human-level performance in first-person multiplayer games with population-based deep reinforcement learning
Max Jaderberg
Wojciech M. Czarnecki
Iain Dunning
Luke Marris
Guy Lever
...
Joel Z Leibo
David Silver
Demis Hassabis
Koray Kavukcuoglu
T. Graepel
OffRL
101
723
0
03 Jul 2018
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent
  Reinforcement Learning
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Tabish Rashid
Mikayel Samvelyan
Christian Schroeder de Witt
Gregory Farquhar
Jakob N. Foerster
Shimon Whiteson
131
1,669
0
30 Mar 2018
Accelerated Methods for Deep Reinforcement Learning
Accelerated Methods for Deep Reinforcement Learning
Adam Stooke
Pieter Abbeel
OffRL
OnRL
52
135
0
07 Mar 2018
Horovod: fast and easy distributed deep learning in TensorFlow
Horovod: fast and easy distributed deep learning in TensorFlow
Alexander Sergeev
Mike Del Balso
97
1,221
0
15 Feb 2018
IMPALA: Scalable Distributed Deep-RL with Importance Weighted
  Actor-Learner Architectures
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
...
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
189
1,594
0
05 Feb 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
284
8,313
0
04 Jan 2018
Ray: A Distributed Framework for Emerging AI Applications
Ray: A Distributed Framework for Emerging AI Applications
Philipp Moritz
Robert Nishihara
Stephanie Wang
Alexey Tumanov
Richard Liaw
...
Melih Elibol
Zongheng Yang
William Paul
Michael I. Jordan
Ion Stoica
GNN
89
1,256
0
16 Dec 2017
Population Based Training of Neural Networks
Population Based Training of Neural Networks
Max Jaderberg
Valentin Dalibard
Simon Osindero
Wojciech M. Czarnecki
Jeff Donahue
...
Tim Green
Iain Dunning
Karen Simonyan
Chrisantha Fernando
Koray Kavukcuoglu
69
740
0
27 Nov 2017
A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning
A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning
Marc Lanctot
V. Zambaldi
A. Gruslys
Angeliki Lazaridou
K. Tuyls
Julien Perolat
David Silver
T. Graepel
91
635
0
02 Nov 2017
StarCraft II: A New Challenge for Reinforcement Learning
StarCraft II: A New Challenge for Reinforcement Learning
Oriol Vinyals
T. Ewalds
Sergey Bartunov
Petko Georgiev
A. Vezhnevets
...
Anthony Brunasso
David Lawrence
Anders Ekermo
J. Repp
Rodney Tsing
76
872
0
16 Aug 2017
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
444
18,931
0
20 Jul 2017
ELF: An Extensive, Lightweight and Flexible Research Platform for
  Real-time Strategy Games
ELF: An Extensive, Lightweight and Flexible Research Platform for Real-time Strategy Games
Yuandong Tian
Qucheng Gong
Wenling Shang
Yuxin Wu
C. L. Zitnick
OffRL
51
126
0
04 Jul 2017
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Ryan J. Lowe
Yi Wu
Aviv Tamar
J. Harb
Pieter Abbeel
Igor Mordatch
136
4,468
0
07 Jun 2017
Evolution Strategies as a Scalable Alternative to Reinforcement Learning
Evolution Strategies as a Scalable Alternative to Reinforcement Learning
Tim Salimans
Jonathan Ho
Xi Chen
Szymon Sidor
Ilya Sutskever
92
1,536
0
10 Mar 2017
Reinforcement Learning through Asynchronous Advantage Actor-Critic on a
  GPU
Reinforcement Learning through Asynchronous Advantage Actor-Critic on a GPU
Mohammad Babaeizadeh
I. Frosio
Stephen Tyree
Jason Clemons
Jan Kautz
OffRL
54
259
0
18 Nov 2016
Deep Reinforcement Learning from Self-Play in Imperfect-Information
  Games
Deep Reinforcement Learning from Self-Play in Imperfect-Information Games
Johannes Heinrich
David Silver
SSL
48
400
0
03 Mar 2016
Asynchronous Methods for Deep Reinforcement Learning
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
189
8,833
0
04 Feb 2016
Continuous control with deep reinforcement learning
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
308
13,214
0
09 Sep 2015
Massively Parallel Methods for Deep Reinforcement Learning
Massively Parallel Methods for Deep Reinforcement Learning
Arun Nair
Praveen Srinivasan
Sam Blackwell
Cagdas Alcicek
Rory Fearon
...
Stig Petersen
Shane Legg
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
OffRL
AI4CE
GNN
89
503
0
15 Jul 2015
Playing Atari with Deep Reinforcement Learning
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
114
12,201
0
19 Dec 2013
1