Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2210.09646
Cited By
RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning
18 October 2022
Wei Qiu
Xiao Ma
Bo An
S. Obraztsova
Shuicheng Yan
Zhongwen Xu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning"
44 / 44 papers shown
Title
Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
Julien Perolat
Bart De Vylder
Daniel Hennes
Eugene Tarassov
Florian Strub
...
Rémi Munos
David Silver
Satinder Singh
Demis Hassabis
K. Tuyls
101
203
0
30 Jun 2022
Learning Dynamics and Generalization in Reinforcement Learning
Clare Lyle
Mark Rowland
Will Dabney
Marta Z. Kwiatkowska
Y. Gal
OOD
OffRL
69
13
0
05 Jun 2022
Off-Beat Multi-Agent Reinforcement Learning
Wei Qiu
Weixun Wang
Rongpin Wang
Bo An
Yujing Hu
S. Obraztsova
Zinovi Rabinovich
Jianye Hao
Yingfeng Chen
Changjie Fan
OffRL
49
2
0
27 May 2022
Learning to Simulate Self-Driven Particles System with Coordinated Policy Optimization
Zhenghao Peng
Quanyi Li
Ka-Ming Hui
Chunxiao Liu
Bolei Zhou
66
62
0
26 Oct 2021
Open-Ended Learning Leads to Generally Capable Agents
Open-Ended Learning Team
Adam Stooke
Anuj Mahajan
Catarina Barros
Charlie Deck
...
Nicolas Porcel
Roberta Raileanu
Steph Hughes-Fitt
Valentin Dalibard
Wojciech M. Czarnecki
106
190
0
27 Jul 2021
Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot
Joel Z Leibo
Edgar A. Duénez-Guzmán
A. Vezhnevets
J. Agapiou
P. Sunehag
Raphael Köster
Jayd Matyas
Charlie Beattie
Igor Mordatch
T. Graepel
OffRL
93
111
0
14 Jul 2021
Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit Partial Observability
Dibya Ghosh
Jad Rahme
Aviral Kumar
Amy Zhang
Ryan P. Adams
Sergey Levine
OffRL
352
118
0
13 Jul 2021
The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games
Chao Yu
Akash Velu
Eugene Vinitsky
Jiaxuan Gao
Yu Wang
Alexandre M. Bayen
Yi Wu
OffRL
165
1,278
0
02 Mar 2021
Quantifying the effects of environment and population diversity in multi-agent reinforcement learning
Kevin R. McKee
Joel Z Leibo
Charlie Beattie
Richard Everett
95
35
0
16 Feb 2021
Rank the Episodes: A Simple Approach for Exploration in Procedurally-Generated Environments
Daochen Zha
Wenye Ma
Lei Yuan
Helen Zhou
Ji Liu
130
44
0
20 Jan 2021
QPLEX: Duplex Dueling Multi-Agent Q-Learning
Jianhao Wang
Zhizhou Ren
Terry Liu
Yang Yu
Chongjie Zhang
OffRL
110
457
0
03 Aug 2020
Off-Policy Multi-Agent Decomposed Policy Gradients
Yihan Wang
Beining Han
Tonghan Wang
Heng Dong
Chongjie Zhang
93
181
0
24 Jul 2020
Weighted QMIX: Expanding Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Tabish Rashid
Gregory Farquhar
Bei Peng
Shimon Whiteson
127
356
0
18 Jun 2020
Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks
Georgios Papoudakis
Filippos Christianos
Lukas Schafer
Stefano V. Albrecht
OffRL
98
233
0
14 Jun 2020
Transient Non-Stationarity and Generalisation in Deep Reinforcement Learning
Maximilian Igl
Gregory Farquhar
Jelena Luketina
Wendelin Boehmer
Shimon Whiteson
102
88
0
10 Jun 2020
Dota 2 with Large Scale Deep Reinforcement Learning
OpenAI OpenAI
:
Christopher Berner
Greg Brockman
Brooke Chan
...
Szymon Sidor
Ilya Sutskever
Jie Tang
Filip Wolski
Susan Zhang
GNN
VLM
CLL
AI4CE
LRM
172
1,838
0
13 Dec 2019
Observational Overfitting in Reinforcement Learning
Xingyou Song
Yiding Jiang
Stephen Tu
Yilun Du
Behnam Neyshabur
OffRL
118
140
0
06 Dec 2019
Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms
Kai Zhang
Zhuoran Yang
Tamer Basar
221
1,226
0
24 Nov 2019
A Structured Prediction Approach for Generalization in Cooperative Multi-Agent Reinforcement Learning
Nicolas Carion
Gabriel Synnaeve
A. Lazaric
Nicolas Usunier
59
29
0
19 Oct 2019
Emergent Tool Use From Multi-Agent Autocurricula
Bowen Baker
I. Kanitscheider
Todor Markov
Yi Wu
Glenn Powell
Bob McGrew
Igor Mordatch
LRM
68
658
0
17 Sep 2019
Compositionality decomposed: how do neural networks generalise?
Dieuwke Hupkes
Verna Dankers
Mathijs Mul
Elia Bruni
CoGe
162
339
0
22 Aug 2019
Transfer in Deep Reinforcement Learning Using Successor Features and Generalised Policy Improvement
André Barreto
Diana Borsa
John Quan
Tom Schaul
David Silver
Matteo Hessel
D. Mankowitz
Augustin Žídek
Rémi Munos
OffRL
115
164
0
30 Jan 2019
Assessing Generalization in Deep Reinforcement Learning
Charles Packer
Katelyn Gao
Jernej Kos
Philipp Krahenbuhl
V. Koltun
Basel Alomair
OffRL
124
238
0
29 Oct 2018
Human-level performance in first-person multiplayer games with population-based deep reinforcement learning
Max Jaderberg
Wojciech M. Czarnecki
Iain Dunning
Luke Marris
Guy Lever
...
Joel Z Leibo
David Silver
Demis Hassabis
Koray Kavukcuoglu
T. Graepel
OffRL
119
728
0
03 Jul 2018
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Tabish Rashid
Mikayel Samvelyan
Christian Schroeder de Witt
Gregory Farquhar
Jakob N. Foerster
Shimon Whiteson
169
1,677
0
30 Mar 2018
Towards Cooperation in Sequential Prisoner's Dilemmas: a Deep Multiagent Reinforcement Learning Approach
Weixun Wang
Jianye Hao
Yixi Wang
Matthew E. Taylor
52
32
0
01 Mar 2018
Addressing Function Approximation Error in Actor-Critic Methods
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
198
5,226
0
26 Feb 2018
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
...
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
249
1,609
0
05 Feb 2018
Ray: A Distributed Framework for Emerging AI Applications
Philipp Moritz
Robert Nishihara
Stephanie Wang
Alexey Tumanov
Richard Liaw
...
Melih Elibol
Zongheng Yang
William Paul
Michael I. Jordan
Ion Stoica
GNN
110
1,269
0
16 Dec 2017
A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning
Marc Lanctot
V. Zambaldi
A. Gruslys
Angeliki Lazaridou
K. Tuyls
Julien Perolat
David Silver
T. Graepel
129
639
0
02 Nov 2017
Guided Deep Reinforcement Learning for Swarm Systems
Maximilian Hüttenrauch
Adrian Šošić
Gerhard Neumann
70
132
0
18 Sep 2017
DARLA: Improving Zero-Shot Transfer in Reinforcement Learning
I. Higgins
Arka Pal
Andrei A. Rusu
Loic Matthey
Christopher P. Burgess
Alexander Pritzel
M. Botvinick
Charles Blundell
Alexander Lerchner
DRL
126
417
0
26 Jul 2017
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
583
19,315
0
20 Jul 2017
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Ryan J. Lowe
Yi Wu
Aviv Tamar
J. Harb
Pieter Abbeel
Igor Mordatch
164
4,520
0
07 Jun 2017
Multi-agent Reinforcement Learning in Sequential Social Dilemmas
Joel Z Leibo
V. Zambaldi
Marc Lanctot
J. Marecki
T. Graepel
78
612
0
10 Feb 2017
The Option-Critic Architecture
Pierre-Luc Bacon
J. Harb
Doina Precup
OffRL
71
1,089
0
16 Sep 2016
Learning to Communicate with Deep Multi-Agent Reinforcement Learning
Jakob N. Foerster
Yannis Assael
Nando de Freitas
Shimon Whiteson
165
1,614
0
21 May 2016
Deep Reinforcement Learning from Self-Play in Imperfect-Information Games
Johannes Heinrich
David Silver
SSL
81
399
0
03 Mar 2016
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
210
8,882
0
04 Feb 2016
Multiagent Cooperation and Competition with Deep Reinforcement Learning
Ardi Tampuu
Tambet Matiisen
Dorian Kodelja
Ilya Kuzovkin
Kristjan Korjus
Juhan Aru
Jaan Aru
Raul Vicente
106
868
0
27 Nov 2015
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
332
13,295
0
09 Sep 2015
High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
OffRL
137
3,442
0
08 Jun 2015
On the Properties of Neural Machine Translation: Encoder-Decoder Approaches
Kyunghyun Cho
B. V. Merrienboer
Dzmitry Bahdanau
Yoshua Bengio
AI4CE
AIMat
270
6,791
0
03 Sep 2014
Optimal and Approximate Q-value Functions for Decentralized POMDPs
F. Oliehoek
M. Spaan
N. Vlassis
OffRL
116
503
0
31 Oct 2011
1