Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1807.01281
Cited By
Human-level performance in first-person multiplayer games with population-based deep reinforcement learning
3 July 2018
Max Jaderberg
Wojciech M. Czarnecki
Iain Dunning
Luke Marris
Guy Lever
Antonio García Castañeda
Charlie Beattie
Neil C. Rabinowitz
Ari S. Morcos
Avraham Ruderman
Nicolas Sonnerat
Tim Green
Louise Deason
Joel Z Leibo
David Silver
Demis Hassabis
Koray Kavukcuoglu
T. Graepel
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Human-level performance in first-person multiplayer games with population-based deep reinforcement learning"
50 / 363 papers shown
Title
A Survey on Causal Reinforcement Learning
Yan Zeng
Ruichu Cai
Gang Hua
Libo Huang
Zijian Li
CML
148
30
0
10 Feb 2023
Towards Skilled Population Curriculum for Multi-Agent Reinforcement Learning
Rongpin Wang
Longtao Zheng
Wei Qiu
Bowei He
Bo An
Zinovi Rabinovich
Yujing Hu
Yingfeng Chen
Tangjie Lv
Changjie Fan
61
1
0
07 Feb 2023
Population-size-Aware Policy Optimization for Mean-Field Games
Pengdeng Li
Xinrun Wang
Shuxin Li
Hau Chan
Bo An
62
2
0
07 Feb 2023
Mind the Gap: Offline Policy Optimization for Imperfect Rewards
Jianxiong Li
Xiao Hu
Haoran Xu
Jingjing Liu
Xianyuan Zhan
Qing-Shan Jia
Ya Zhang
OffRL
83
20
0
03 Feb 2023
Diversity Through Exclusion (DTE): Niche Identification for Reinforcement Learning through Value-Decomposition
P. Sunehag
A. Vezhnevets
Edgar A. Duénez-Guzmán
Igor Mordach
Joel Z Leibo
61
2
0
02 Feb 2023
Learning Roles with Emergent Social Value Orientations
Wenhao Li
Xiangfeng Wang
Bo Jin
J. Lu
H. Zha
47
3
0
31 Jan 2023
Multi-Agent Interplay in a Competitive Survival Environment
Andrea Fanti
67
0
0
19 Jan 2023
Human-Timescale Adaptation in an Open-Ended Task Space
Adaptive Agent Team
Jakob Bauer
Kate Baumli
Satinder Baveja
Feryal M. P. Behbahani
...
Jakub Sygnowski
K. Tuyls
Sarah York
Alexander Zacherl
Lei Zhang
LM&Ro
OffRL
AI4CE
LRM
139
119
0
18 Jan 2023
NOPA: Neurally-guided Online Probabilistic Assistance for Building Socially Intelligent Home Assistants
Xavier Puig
Tianmin Shu
J. Tenenbaum
Antonio Torralba
58
22
0
12 Jan 2023
Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time Multi-Robot Cooperative Exploration
Chao Yu
Xinyi Yang
Jiaxuan Gao
Jiayu Chen
Yunfei Li
...
Yunfei Xiang
Rui Huang
Huazhong Yang
Yi Wu
Yu Wang
70
38
0
09 Jan 2023
Chat2Map: Efficient Scene Mapping from Multi-Ego Conversations
Sagnik Majumder
Hao Jiang
Pierre Moulon
E. Henderson
P. Calamia
Kristen Grauman
V. Ithapu
EgoV
91
7
0
04 Jan 2023
Ithaca. A Tool for Integrating Fuzzy Logic in Unity
Alfonso Tejedor Moreno
J. A. Piedra-Fernandez
J. J. Ojeda-Castelo
L. Iribarne
35
0
0
01 Jan 2023
Towards automating Codenames spymasters with deep reinforcement learning
Sherman Siu
63
2
0
28 Dec 2022
Learning Latent Representations to Co-Adapt to Humans
Sagar Parekh
Dylan P. Losey
92
12
0
19 Dec 2022
Learning Representations that Enable Generalization in Assistive Tasks
Jerry Zhi-Yang He
Aditi Raghunathan
Daniel S. Brown
Zackory M. Erickson
Anca Dragan
OOD
86
20
0
05 Dec 2022
Launchpad: Learning to Schedule Using Offline and Online RL Methods
V. Venkataswamy
J. E. Grigsby
A. Grimshaw
Yanjun Qi
OffRL
OnRL
63
1
0
01 Dec 2022
Distributed Deep Reinforcement Learning: A Survey and A Multi-Player Multi-Agent Learning Toolbox
Qiyue Yin
Tongtong Yu
S. Shen
Jun Yang
Meijing Zhao
Kaiqi Huang
Bin Liang
Liangsheng Wang
OffRL
76
13
0
01 Dec 2022
Melting Pot 2.0
J. Agapiou
A. Vezhnevets
Edgar A. Duénez-Guzmán
Jayd Matyas
Yiran Mao
...
Sukhdeep Singh
Julia Haas
Igor Mordatch
D. Mobbs
Joel Z Leibo
124
34
0
24 Nov 2022
Contrastive Identity-Aware Learning for Multi-Agent Value Decomposition
Shunyu Liu
Yihe Zhou
Mingli Song
Tongya Zheng
Kaixuan Chen
Tongtian Zhu
Zunlei Feng
Mingli Song
79
23
0
23 Nov 2022
Value-based CTDE Methods in Symmetric Two-team Markov Game: from Cooperation to Team Competition
Pascal Leroy
J. Pisane
D. Ernst
53
3
0
21 Nov 2022
Deep Reinforcement Learning with Vector Quantized Encoding
Liang Zhang
Justin Lieffers
A. Pyarelal
OffRL
58
2
0
12 Nov 2022
Autotelic Reinforcement Learning in Multi-Agent Environments
Eleni Nisioti
E. Masquil
Gautier Hamon
Clément Moulin-Frier
67
1
0
11 Nov 2022
Job Scheduling in Datacenters using Constraint Controlled RL
V. Venkataswamy
43
1
0
10 Nov 2022
Group Cohesion in Multi-Agent Scenarios as an Emergent Behavior
Gianluca Georg Alois Volkmer
Nabil Alsabah
AI4CE
34
0
0
03 Nov 2022
Knowing the Past to Predict the Future: Reinforcement Virtual Learning
Peng Zhang
Yawen Huang
Bingzhang Hu
Shizheng Wang
Haoran Duan
Noura Al Moubayed
Yefeng Zheng
Yang Long
OffRL
62
0
0
02 Nov 2022
RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning
Wei Qiu
Xiao Ma
Bo An
S. Obraztsova
Shuicheng Yan
Zhongwen Xu
72
2
0
18 Oct 2022
Game Theoretic Rating in N-player general-sum games with Equilibria
Luke Marris
Marc Lanctot
I. Gemp
Shayegan Omidshafiei
Stephen Marcus McAleer
Jerome T. Connor
K. Tuyls
T. Graepel
71
3
0
05 Oct 2022
MARLUI: Multi-Agent Reinforcement Learning for Adaptive UIs
T. Langerak
Sammy Christen
Mert Albaba
Christoph Gebhardt
Otmar Hilliges
OffRL
62
0
0
26 Sep 2022
Open-Ended Diverse Solution Discovery with Regulated Behavior Patterns for Cross-Domain Adaptation
Kang Xu
Yan Ma
Bingsheng Wei
Wei Li
84
3
0
24 Sep 2022
Quantification before Selection: Active Dynamics Preference for Robust Reinforcement Learning
Kang Xu
Yan Ma
Wei Li
97
0
0
23 Sep 2022
Lamarckian Platform: Pushing the Boundaries of Evolutionary Reinforcement Learning towards Asynchronous Commercial Games
Hui Bai
R. Shen
Yue Lin
Bo Xu
Ran Cheng
VLM
85
5
0
21 Sep 2022
ESTA: An Esports Trajectory and Action Dataset
Peter Xenopoulos
Claudio Silva
AI4TS
96
8
0
20 Sep 2022
Law Informs Code: A Legal Informatics Approach to Aligning Artificial Intelligence with Humans
John J. Nay
ELM
AILaw
190
29
0
14 Sep 2022
Neural Payoff Machines: Predicting Fair and Stable Payoff Allocations Among Team Members
Daphne Cornelisse
Thomas Rood
Mateusz Malinowski
Yoram Bachrach
Tal Kachman
58
10
0
18 Aug 2022
Reducing Exploitability with Population Based Training
Pavel Czempin
Adam Gleave
AAML
82
6
0
10 Aug 2022
Agents Incorporating Identity and Dynamic Teams in Social Dilemmas
Kyle Tilbury
Jesse Hoey
AI4CE
54
0
0
05 Aug 2022
Transformers as Meta-Learners for Implicit Neural Representations
Yinbo Chen
Xiaolong Wang
AI4CE
106
68
0
04 Aug 2022
Automatic Reward Design via Learning Motivation-Consistent Intrinsic Rewards
Yixiang Wang
Yujing Hu
Feng Wu
Yingfeng Chen
60
2
0
29 Jul 2022
The Game of Hidden Rules: A New Kind of Benchmark Challenge for Machine Learning
Eric Pulick
S. Bharti
Yiding Chen
Vladimir Menkov
Yonatan Dov Mintz
Paul B. Kantor
Vicki M. Bier
25
1
0
20 Jul 2022
Bayesian Generational Population-Based Training
Xingchen Wan
Cong Lu
Jack Parker-Holder
Philip J. Ball
Vu-Linh Nguyen
Binxin Ru
Michael A. Osborne
OffRL
79
19
0
19 Jul 2022
Mimetic Models: Ethical Implications of AI that Acts Like You
Reid McIlroy-Young
Jon M. Kleinberg
S. Sen
Solon Barocas
Ashton Anderson
77
17
0
19 Jul 2022
K-level Reasoning for Zero-Shot Coordination in Hanabi
Brandon Cui
Hengyuan Hu
Luis Pineda
Jakob N. Foerster
OffRL
LRM
86
36
0
14 Jul 2022
Interaction Pattern Disentangling for Multi-Agent Reinforcement Learning
Shunyu Liu
Mingli Song
Yihe Zhou
Na Yu
Kaixuan Chen
Zunlei Feng
Mingli Song
58
10
0
08 Jul 2022
SC2EGSet: StarCraft II Esport Replay and Game-state Dataset
A. Białecki
N. Jakubowska
P. Dobrowolski
P. Białecki
Leszek Krupiñski
Andrzej Szczap
R. Białecki
Jan Gajewski
61
10
0
07 Jul 2022
"Curse of rarity" for autonomous vehicles
Henry X. Liu
Shuo Feng
74
63
0
06 Jul 2022
Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
Julien Perolat
Bart De Vylder
Daniel Hennes
Eugene Tarassov
Florian Strub
...
Rémi Munos
David Silver
Satinder Singh
Demis Hassabis
K. Tuyls
101
206
0
30 Jun 2022
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
Bowen Baker
Ilge Akkaya
Peter Zhokhov
Joost Huizinga
Jie Tang
Adrien Ecoffet
Brandon Houghton
Raul Sampedro
Jeff Clune
OffRL
162
304
0
23 Jun 2022
On the Limitations of Elo: Real-World Games, are Transitive, not Additive
Quentin Bertrand
Wojciech M. Czarnecki
Gauthier Gidel
92
23
0
21 Jun 2022
Beyond Rewards: a Hierarchical Perspective on Offline Multiagent Behavioral Analysis
Shayegan Omidshafiei
A. Kapishnikov
Yannick Assogba
Lucas Dixon
Been Kim
OffRL
66
5
0
17 Jun 2022
Fast Population-Based Reinforcement Learning on a Single Machine
Arthur Flajolet
Claire Bizon Monroc
Karim Beguir
Thomas Pierrot
OffRL
74
10
0
17 Jun 2022
Previous
1
2
3
4
5
6
7
8
Next