ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1807.01281
  4. Cited By
Human-level performance in first-person multiplayer games with
  population-based deep reinforcement learning

Human-level performance in first-person multiplayer games with population-based deep reinforcement learning

3 July 2018
Max Jaderberg
Wojciech M. Czarnecki
Iain Dunning
Luke Marris
Guy Lever
Antonio García Castañeda
Charlie Beattie
Neil C. Rabinowitz
Ari S. Morcos
Avraham Ruderman
Nicolas Sonnerat
Tim Green
Louise Deason
Joel Z Leibo
David Silver
Demis Hassabis
Koray Kavukcuoglu
T. Graepel
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Human-level performance in first-person multiplayer games with population-based deep reinforcement learning"

50 / 363 papers shown
Title
A Survey on Causal Reinforcement Learning
A Survey on Causal Reinforcement Learning
Yan Zeng
Ruichu Cai
Gang Hua
Libo Huang
Zijian Li
CML
148
30
0
10 Feb 2023
Towards Skilled Population Curriculum for Multi-Agent Reinforcement
  Learning
Towards Skilled Population Curriculum for Multi-Agent Reinforcement Learning
Rongpin Wang
Longtao Zheng
Wei Qiu
Bowei He
Bo An
Zinovi Rabinovich
Yujing Hu
Yingfeng Chen
Tangjie Lv
Changjie Fan
61
1
0
07 Feb 2023
Population-size-Aware Policy Optimization for Mean-Field Games
Population-size-Aware Policy Optimization for Mean-Field Games
Pengdeng Li
Xinrun Wang
Shuxin Li
Hau Chan
Bo An
62
2
0
07 Feb 2023
Mind the Gap: Offline Policy Optimization for Imperfect Rewards
Mind the Gap: Offline Policy Optimization for Imperfect Rewards
Jianxiong Li
Xiao Hu
Haoran Xu
Jingjing Liu
Xianyuan Zhan
Qing-Shan Jia
Ya Zhang
OffRL
83
20
0
03 Feb 2023
Diversity Through Exclusion (DTE): Niche Identification for
  Reinforcement Learning through Value-Decomposition
Diversity Through Exclusion (DTE): Niche Identification for Reinforcement Learning through Value-Decomposition
P. Sunehag
A. Vezhnevets
Edgar A. Duénez-Guzmán
Igor Mordach
Joel Z Leibo
61
2
0
02 Feb 2023
Learning Roles with Emergent Social Value Orientations
Learning Roles with Emergent Social Value Orientations
Wenhao Li
Xiangfeng Wang
Bo Jin
J. Lu
H. Zha
47
3
0
31 Jan 2023
Multi-Agent Interplay in a Competitive Survival Environment
Multi-Agent Interplay in a Competitive Survival Environment
Andrea Fanti
67
0
0
19 Jan 2023
Human-Timescale Adaptation in an Open-Ended Task Space
Human-Timescale Adaptation in an Open-Ended Task Space
Adaptive Agent Team
Jakob Bauer
Kate Baumli
Satinder Baveja
Feryal M. P. Behbahani
...
Jakub Sygnowski
K. Tuyls
Sarah York
Alexander Zacherl
Lei Zhang
LM&RoOffRLAI4CELRM
139
119
0
18 Jan 2023
NOPA: Neurally-guided Online Probabilistic Assistance for Building
  Socially Intelligent Home Assistants
NOPA: Neurally-guided Online Probabilistic Assistance for Building Socially Intelligent Home Assistants
Xavier Puig
Tianmin Shu
J. Tenenbaum
Antonio Torralba
58
22
0
12 Jan 2023
Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time
  Multi-Robot Cooperative Exploration
Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time Multi-Robot Cooperative Exploration
Chao Yu
Xinyi Yang
Jiaxuan Gao
Jiayu Chen
Yunfei Li
...
Yunfei Xiang
Rui Huang
Huazhong Yang
Yi Wu
Yu Wang
70
38
0
09 Jan 2023
Chat2Map: Efficient Scene Mapping from Multi-Ego Conversations
Chat2Map: Efficient Scene Mapping from Multi-Ego Conversations
Sagnik Majumder
Hao Jiang
Pierre Moulon
E. Henderson
P. Calamia
Kristen Grauman
V. Ithapu
EgoV
91
7
0
04 Jan 2023
Ithaca. A Tool for Integrating Fuzzy Logic in Unity
Ithaca. A Tool for Integrating Fuzzy Logic in Unity
Alfonso Tejedor Moreno
J. A. Piedra-Fernandez
J. J. Ojeda-Castelo
L. Iribarne
35
0
0
01 Jan 2023
Towards automating Codenames spymasters with deep reinforcement learning
Towards automating Codenames spymasters with deep reinforcement learning
Sherman Siu
63
2
0
28 Dec 2022
Learning Latent Representations to Co-Adapt to Humans
Learning Latent Representations to Co-Adapt to Humans
Sagar Parekh
Dylan P. Losey
92
12
0
19 Dec 2022
Learning Representations that Enable Generalization in Assistive Tasks
Learning Representations that Enable Generalization in Assistive Tasks
Jerry Zhi-Yang He
Aditi Raghunathan
Daniel S. Brown
Zackory M. Erickson
Anca Dragan
OOD
86
20
0
05 Dec 2022
Launchpad: Learning to Schedule Using Offline and Online RL Methods
Launchpad: Learning to Schedule Using Offline and Online RL Methods
V. Venkataswamy
J. E. Grigsby
A. Grimshaw
Yanjun Qi
OffRLOnRL
63
1
0
01 Dec 2022
Distributed Deep Reinforcement Learning: A Survey and A Multi-Player
  Multi-Agent Learning Toolbox
Distributed Deep Reinforcement Learning: A Survey and A Multi-Player Multi-Agent Learning Toolbox
Qiyue Yin
Tongtong Yu
S. Shen
Jun Yang
Meijing Zhao
Kaiqi Huang
Bin Liang
Liangsheng Wang
OffRL
76
13
0
01 Dec 2022
Melting Pot 2.0
Melting Pot 2.0
J. Agapiou
A. Vezhnevets
Edgar A. Duénez-Guzmán
Jayd Matyas
Yiran Mao
...
Sukhdeep Singh
Julia Haas
Igor Mordatch
D. Mobbs
Joel Z Leibo
124
34
0
24 Nov 2022
Contrastive Identity-Aware Learning for Multi-Agent Value Decomposition
Contrastive Identity-Aware Learning for Multi-Agent Value Decomposition
Shunyu Liu
Yihe Zhou
Mingli Song
Tongya Zheng
Kaixuan Chen
Tongtian Zhu
Zunlei Feng
Mingli Song
79
23
0
23 Nov 2022
Value-based CTDE Methods in Symmetric Two-team Markov Game: from
  Cooperation to Team Competition
Value-based CTDE Methods in Symmetric Two-team Markov Game: from Cooperation to Team Competition
Pascal Leroy
J. Pisane
D. Ernst
53
3
0
21 Nov 2022
Deep Reinforcement Learning with Vector Quantized Encoding
Deep Reinforcement Learning with Vector Quantized Encoding
Liang Zhang
Justin Lieffers
A. Pyarelal
OffRL
58
2
0
12 Nov 2022
Autotelic Reinforcement Learning in Multi-Agent Environments
Autotelic Reinforcement Learning in Multi-Agent Environments
Eleni Nisioti
E. Masquil
Gautier Hamon
Clément Moulin-Frier
67
1
0
11 Nov 2022
Job Scheduling in Datacenters using Constraint Controlled RL
Job Scheduling in Datacenters using Constraint Controlled RL
V. Venkataswamy
43
1
0
10 Nov 2022
Group Cohesion in Multi-Agent Scenarios as an Emergent Behavior
Group Cohesion in Multi-Agent Scenarios as an Emergent Behavior
Gianluca Georg Alois Volkmer
Nabil Alsabah
AI4CE
34
0
0
03 Nov 2022
Knowing the Past to Predict the Future: Reinforcement Virtual Learning
Knowing the Past to Predict the Future: Reinforcement Virtual Learning
Peng Zhang
Yawen Huang
Bingzhang Hu
Shizheng Wang
Haoran Duan
Noura Al Moubayed
Yefeng Zheng
Yang Long
OffRL
62
0
0
02 Nov 2022
RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning
RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning
Wei Qiu
Xiao Ma
Bo An
S. Obraztsova
Shuicheng Yan
Zhongwen Xu
72
2
0
18 Oct 2022
Game Theoretic Rating in N-player general-sum games with Equilibria
Game Theoretic Rating in N-player general-sum games with Equilibria
Luke Marris
Marc Lanctot
I. Gemp
Shayegan Omidshafiei
Stephen Marcus McAleer
Jerome T. Connor
K. Tuyls
T. Graepel
71
3
0
05 Oct 2022
MARLUI: Multi-Agent Reinforcement Learning for Adaptive UIs
MARLUI: Multi-Agent Reinforcement Learning for Adaptive UIs
T. Langerak
Sammy Christen
Mert Albaba
Christoph Gebhardt
Otmar Hilliges
OffRL
62
0
0
26 Sep 2022
Open-Ended Diverse Solution Discovery with Regulated Behavior Patterns
  for Cross-Domain Adaptation
Open-Ended Diverse Solution Discovery with Regulated Behavior Patterns for Cross-Domain Adaptation
Kang Xu
Yan Ma
Bingsheng Wei
Wei Li
84
3
0
24 Sep 2022
Quantification before Selection: Active Dynamics Preference for Robust
  Reinforcement Learning
Quantification before Selection: Active Dynamics Preference for Robust Reinforcement Learning
Kang Xu
Yan Ma
Wei Li
97
0
0
23 Sep 2022
Lamarckian Platform: Pushing the Boundaries of Evolutionary
  Reinforcement Learning towards Asynchronous Commercial Games
Lamarckian Platform: Pushing the Boundaries of Evolutionary Reinforcement Learning towards Asynchronous Commercial Games
Hui Bai
R. Shen
Yue Lin
Bo Xu
Ran Cheng
VLM
85
5
0
21 Sep 2022
ESTA: An Esports Trajectory and Action Dataset
ESTA: An Esports Trajectory and Action Dataset
Peter Xenopoulos
Claudio Silva
AI4TS
96
8
0
20 Sep 2022
Law Informs Code: A Legal Informatics Approach to Aligning Artificial
  Intelligence with Humans
Law Informs Code: A Legal Informatics Approach to Aligning Artificial Intelligence with Humans
John J. Nay
ELMAILaw
190
29
0
14 Sep 2022
Neural Payoff Machines: Predicting Fair and Stable Payoff Allocations
  Among Team Members
Neural Payoff Machines: Predicting Fair and Stable Payoff Allocations Among Team Members
Daphne Cornelisse
Thomas Rood
Mateusz Malinowski
Yoram Bachrach
Tal Kachman
58
10
0
18 Aug 2022
Reducing Exploitability with Population Based Training
Reducing Exploitability with Population Based Training
Pavel Czempin
Adam Gleave
AAML
82
6
0
10 Aug 2022
Agents Incorporating Identity and Dynamic Teams in Social Dilemmas
Agents Incorporating Identity and Dynamic Teams in Social Dilemmas
Kyle Tilbury
Jesse Hoey
AI4CE
54
0
0
05 Aug 2022
Transformers as Meta-Learners for Implicit Neural Representations
Transformers as Meta-Learners for Implicit Neural Representations
Yinbo Chen
Xiaolong Wang
AI4CE
106
68
0
04 Aug 2022
Automatic Reward Design via Learning Motivation-Consistent Intrinsic
  Rewards
Automatic Reward Design via Learning Motivation-Consistent Intrinsic Rewards
Yixiang Wang
Yujing Hu
Feng Wu
Yingfeng Chen
60
2
0
29 Jul 2022
The Game of Hidden Rules: A New Kind of Benchmark Challenge for Machine
  Learning
The Game of Hidden Rules: A New Kind of Benchmark Challenge for Machine Learning
Eric Pulick
S. Bharti
Yiding Chen
Vladimir Menkov
Yonatan Dov Mintz
Paul B. Kantor
Vicki M. Bier
25
1
0
20 Jul 2022
Bayesian Generational Population-Based Training
Bayesian Generational Population-Based Training
Xingchen Wan
Cong Lu
Jack Parker-Holder
Philip J. Ball
Vu-Linh Nguyen
Binxin Ru
Michael A. Osborne
OffRL
79
19
0
19 Jul 2022
Mimetic Models: Ethical Implications of AI that Acts Like You
Mimetic Models: Ethical Implications of AI that Acts Like You
Reid McIlroy-Young
Jon M. Kleinberg
S. Sen
Solon Barocas
Ashton Anderson
77
17
0
19 Jul 2022
K-level Reasoning for Zero-Shot Coordination in Hanabi
K-level Reasoning for Zero-Shot Coordination in Hanabi
Brandon Cui
Hengyuan Hu
Luis Pineda
Jakob N. Foerster
OffRLLRM
86
36
0
14 Jul 2022
Interaction Pattern Disentangling for Multi-Agent Reinforcement Learning
Interaction Pattern Disentangling for Multi-Agent Reinforcement Learning
Shunyu Liu
Mingli Song
Yihe Zhou
Na Yu
Kaixuan Chen
Zunlei Feng
Mingli Song
58
10
0
08 Jul 2022
SC2EGSet: StarCraft II Esport Replay and Game-state Dataset
SC2EGSet: StarCraft II Esport Replay and Game-state Dataset
A. Białecki
N. Jakubowska
P. Dobrowolski
P. Białecki
Leszek Krupiñski
Andrzej Szczap
R. Białecki
Jan Gajewski
61
10
0
07 Jul 2022
"Curse of rarity" for autonomous vehicles
"Curse of rarity" for autonomous vehicles
Henry X. Liu
Shuo Feng
74
63
0
06 Jul 2022
Mastering the Game of Stratego with Model-Free Multiagent Reinforcement
  Learning
Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
Julien Perolat
Bart De Vylder
Daniel Hennes
Eugene Tarassov
Florian Strub
...
Rémi Munos
David Silver
Satinder Singh
Demis Hassabis
K. Tuyls
101
206
0
30 Jun 2022
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online
  Videos
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
Bowen Baker
Ilge Akkaya
Peter Zhokhov
Joost Huizinga
Jie Tang
Adrien Ecoffet
Brandon Houghton
Raul Sampedro
Jeff Clune
OffRL
162
304
0
23 Jun 2022
On the Limitations of Elo: Real-World Games, are Transitive, not
  Additive
On the Limitations of Elo: Real-World Games, are Transitive, not Additive
Quentin Bertrand
Wojciech M. Czarnecki
Gauthier Gidel
92
23
0
21 Jun 2022
Beyond Rewards: a Hierarchical Perspective on Offline Multiagent
  Behavioral Analysis
Beyond Rewards: a Hierarchical Perspective on Offline Multiagent Behavioral Analysis
Shayegan Omidshafiei
A. Kapishnikov
Yannick Assogba
Lucas Dixon
Been Kim
OffRL
66
5
0
17 Jun 2022
Fast Population-Based Reinforcement Learning on a Single Machine
Fast Population-Based Reinforcement Learning on a Single Machine
Arthur Flajolet
Claire Bizon Monroc
Karim Beguir
Thomas Pierrot
OffRL
74
10
0
17 Jun 2022
Previous
12345678
Next