ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1807.01281
  4. Cited By
Human-level performance in first-person multiplayer games with
  population-based deep reinforcement learning

Human-level performance in first-person multiplayer games with population-based deep reinforcement learning

3 July 2018
Max Jaderberg
Wojciech M. Czarnecki
Iain Dunning
Luke Marris
Guy Lever
Antonio García Castañeda
Charlie Beattie
Neil C. Rabinowitz
Ari S. Morcos
Avraham Ruderman
Nicolas Sonnerat
Tim Green
Louise Deason
Joel Z Leibo
David Silver
Demis Hassabis
Koray Kavukcuoglu
T. Graepel
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Human-level performance in first-person multiplayer games with population-based deep reinforcement learning"

50 / 363 papers shown
Title
Model-Based Reinforcement Learning for Offline Zero-Sum Markov Games
Model-Based Reinforcement Learning for Offline Zero-Sum Markov Games
Yuling Yan
Gen Li
Yuxin Chen
Jianqing Fan
OffRL
102
12
0
08 Jun 2022
From Attribution Maps to Human-Understandable Explanations through
  Concept Relevance Propagation
From Attribution Maps to Human-Understandable Explanations through Concept Relevance Propagation
Reduan Achtibat
Maximilian Dreyer
Ilona Eisenbraun
S. Bosse
Thomas Wiegand
Wojciech Samek
Sebastian Lapuschkin
FAtt
87
150
0
07 Jun 2022
Double Deep Q Networks for Sensor Management in Space Situational
  Awareness
Double Deep Q Networks for Sensor Management in Space Situational Awareness
Ben D. Oakes
Dominic Richards
Jordi Barr
Jason Ralph
31
8
0
27 May 2022
Exploring the Benefits of Teams in Multiagent Learning
Exploring the Benefits of Teams in Multiagent Learning
David Radke
Kate Larson
Timothy B. Brecht
AI4TS
83
10
0
04 May 2022
The Importance of Credo in Multiagent Learning
The Importance of Credo in Multiagent Learning
David Radke
Kate Larson
Timothy B. Brecht
97
12
0
15 Apr 2022
Continuously Discovering Novel Strategies via Reward-Switching Policy
  Optimization
Continuously Discovering Novel Strategies via Reward-Switching Policy Optimization
Zihan Zhou
Wei Fu
Bingliang Zhang
Yi Wu
85
30
0
04 Apr 2022
Mind the gap: Challenges of deep learning approaches to Theory of Mind
Mind the gap: Challenges of deep learning approaches to Theory of Mind
Jaan Aru
Aqeel Labash
Oriol Corcoll
Raul Vicente
95
27
0
30 Mar 2022
Self-Imitation Learning from Demonstrations
Self-Imitation Learning from Demonstrations
Georgiy Pshikhachev
Dmitry Ivanov
Vladimir Egorov
A. Shpilman
56
6
0
21 Mar 2022
Safe adaptation in multiagent competition
Safe adaptation in multiagent competition
Macheng Shen
Jonathan P. How
TTA
65
1
0
14 Mar 2022
On Credit Assignment in Hierarchical Reinforcement Learning
On Credit Assignment in Hierarchical Reinforcement Learning
Joery A. de Vries
Thomas M. Moerland
Aske Plaat
25
0
0
07 Mar 2022
AutoDIME: Automatic Design of Interesting Multi-Agent Environments
AutoDIME: Automatic Design of Interesting Multi-Agent Environments
I. Kanitscheider
Harrison Edwards
58
0
0
04 Mar 2022
Learning Robust Real-Time Cultural Transmission without Human Data
Learning Robust Real-Time Cultural Transmission without Human Data
Cultural General Intelligence Team
Avishkar Bhoopchand
Bethanie Brownfield
Adrian Collister
Agustin Dal Lago
...
Alex Platonov
Evan Senter
Sukhdeep Singh
Alexander Zacherl
Lei M. Zhang
VLM
125
11
0
01 Mar 2022
Reinforcement Learning in Practice: Opportunities and Challenges
Reinforcement Learning in Practice: Opportunities and Challenges
Yuxi Li
OffRL
71
9
0
23 Feb 2022
Behaviour-Diverse Automatic Penetration Testing: A Curiosity-Driven
  Multi-Objective Deep Reinforcement Learning Approach
Behaviour-Diverse Automatic Penetration Testing: A Curiosity-Driven Multi-Objective Deep Reinforcement Learning Approach
Yizhou Yang
Xin Liu
AAML
65
15
0
22 Feb 2022
Investigations of Performance and Bias in Human-AI Teamwork in Hiring
Investigations of Performance and Bias in Human-AI Teamwork in Hiring
Andi Peng
Besmira Nushi
Emre Kıcıman
K. Inkpen
Ece Kamar
51
32
0
21 Feb 2022
Learning Synthetic Environments and Reward Networks for Reinforcement
  Learning
Learning Synthetic Environments and Reward Networks for Reinforcement Learning
Fabio Ferreira
Thomas Nierhoff
Andreas Saelinger
Frank Hutter
43
4
0
06 Feb 2022
Meta-Reinforcement Learning with Self-Modifying Networks
Meta-Reinforcement Learning with Self-Modifying Networks
Mathieu Chalvidal
Thomas Serre
Rufin VanRullen
KELM
87
5
0
04 Feb 2022
Bayesian sense of time in biological and artificial brains
Bayesian sense of time in biological and artificial brains
Zafeirios Fountas
Alexey Zakharov
62
0
0
14 Jan 2022
Direct Mutation and Crossover in Genetic Algorithms Applied to
  Reinforcement Learning Tasks
Direct Mutation and Crossover in Genetic Algorithms Applied to Reinforcement Learning Tasks
Tarek Faycal
Claudio Zito
26
2
0
13 Jan 2022
Dyna-T: Dyna-Q and Upper Confidence Bounds Applied to Trees
Dyna-T: Dyna-Q and Upper Confidence Bounds Applied to Trees
Tarek Faycal
Claudio Zito
OffRL
29
2
0
12 Jan 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
116
107
0
11 Jan 2022
Learning Cooperative Multi-Agent Policies with Partial Reward Decoupling
Learning Cooperative Multi-Agent Policies with Partial Reward Decoupling
B. Freed
Aditya Kapoor
Ian Abraham
J. Schneider
Howie Choset
78
5
0
23 Dec 2021
Graph augmented Deep Reinforcement Learning in the GameRLand3D
  environment
Graph augmented Deep Reinforcement Learning in the GameRLand3D environment
E. Beeching
Maxim Peter
Philippe Marcotte
Jilles Debangoye
Olivier Simonin
Joshua Romoff
Christian Wolf
91
5
0
22 Dec 2021
Maximum Entropy Population-Based Training for Zero-Shot Human-AI
  Coordination
Maximum Entropy Population-Based Training for Zero-Shot Human-AI Coordination
Rui Zhao
Jinming Song
Yufeng Yuan
Haifeng Hu
Yang Gao
Yi Wu
Zhongqian Sun
Yang Wei
95
69
0
22 Dec 2021
The Partially Observable Asynchronous Multi-Agent Cooperation Challenge
The Partially Observable Asynchronous Multi-Agent Cooperation Challenge
Meng Yao
Qiyue Yin
Jun Yang
Tongtong Yu
S. Shen
Junge Zhang
Bin Liang
Kaiqi Huang
43
5
0
07 Dec 2021
The Power of Communication in a Distributed Multi-Agent System
The Power of Communication in a Distributed Multi-Agent System
P. D. Siedler
53
3
0
30 Nov 2021
Collective Intelligence for Deep Learning: A Survey of Recent
  Developments
Collective Intelligence for Deep Learning: A Survey of Recent Developments
David R Ha
Yu Tang
AI4CE
127
70
0
29 Nov 2021
AI in Human-computer Gaming: Techniques, Challenges and Opportunities
AI in Human-computer Gaming: Techniques, Challenges and Opportunities
Qiyue Yin
Jun Yang
Kaiqi Huang
Meijing Zhao
Wancheng Ni
Bin Liang
Yan Huang
Shu Wu
Liangsheng Wang
61
21
0
15 Nov 2021
Towards convergence to Nash equilibria in two-team zero-sum games
Towards convergence to Nash equilibria in two-team zero-sum games
Fivos Kalogiannis
Ioannis Panageas
Emmanouil-Vasileios Vlatakis-Gkaragkounis
80
5
0
07 Nov 2021
Development of collective behavior in newborn artificial agents
Development of collective behavior in newborn artificial agents
Donsuk Lee
Samantha M. W. Wood
Justin N. Wood
49
2
0
06 Nov 2021
Learning to Cooperate with Unseen Agent via Meta-Reinforcement Learning
Learning to Cooperate with Unseen Agent via Meta-Reinforcement Learning
Rujikorn Charakorn
P. Manoonpong
Nat Dilokthanakul
85
5
0
05 Nov 2021
Learning Diverse Policies in MOBA Games via Macro-Goals
Learning Diverse Policies in MOBA Games via Macro-Goals
Yiming Gao
Bei Shi
Xueying Du
Liang Wang
Guangwei Chen
...
Weixuan Wang
Deheng Ye
Qiang Fu
Wei Yang
Lanxiao Huang
76
11
0
27 Oct 2021
Collaborating with Humans without Human Data
Collaborating with Humans without Human Data
D. Strouse
Kevin R. McKee
M. Botvinick
Edward Hughes
Richard Everett
184
171
0
15 Oct 2021
Effects of Different Optimization Formulations in Evolutionary
  Reinforcement Learning on Diverse Behavior Generation
Effects of Different Optimization Formulations in Evolutionary Reinforcement Learning on Diverse Behavior Generation
Victor Villin
Naoki Masuyama
Yusuke Nojima
82
2
0
15 Oct 2021
The Neural MMO Platform for Massively Multiagent Research
The Neural MMO Platform for Massively Multiagent Research
Joseph Suárez
Yilun Du
Clare Zhu
Igor Mordatch
Phillip Isola
AI4CE
90
25
0
14 Oct 2021
Learning Temporally-Consistent Representations for Data-Efficient
  Reinforcement Learning
Learning Temporally-Consistent Representations for Data-Efficient Reinforcement Learning
Trevor A. McInroe
Lukas Schafer
Stefano V. Albrecht
OffRL
62
8
0
11 Oct 2021
Pick Your Battles: Interaction Graphs as Population-Level Objectives for
  Strategic Diversity
Pick Your Battles: Interaction Graphs as Population-Level Objectives for Strategic Diversity
M. Garnelo
Wojciech M. Czarnecki
Siqi Liu
Dhruva Tirumala
Junhyuk Oh
Gauthier Gidel
H. V. Hasselt
David Balduzzi
114
25
0
08 Oct 2021
Genealogical Population-Based Training for Hyperparameter Optimization
Genealogical Population-Based Training for Hyperparameter Optimization
Antoine Scardigli
P. Fournier
Matteo Vilucchio
D. Naccache
GP
31
0
0
30 Sep 2021
Deep Reinforcement Learning Versus Evolution Strategies: A Comparative
  Survey
Deep Reinforcement Learning Versus Evolution Strategies: A Comparative Survey
Amjad Yousef Majid
Serge Saaybi
Tomas van Rietbergen
Vincent François-Lavet
R. V. Prasad
Chris Verhoeven
OffRL
135
60
0
28 Sep 2021
Faster Improvement Rate Population Based Training
Faster Improvement Rate Population Based Training
Valentin Dalibard
Max Jaderberg
67
13
0
28 Sep 2021
Multi-Agent Embodied Visual Semantic Navigation with Scene Prior
  Knowledge
Multi-Agent Embodied Visual Semantic Navigation with Scene Prior Knowledge
Xinzhu Liu
Di Guo
Huaping Liu
F. Sun
EgoV
77
25
0
20 Sep 2021
ROMAX: Certifiably Robust Deep Multiagent Reinforcement Learning via
  Convex Relaxation
ROMAX: Certifiably Robust Deep Multiagent Reinforcement Learning via Convex Relaxation
Chuangchuang Sun
Dong-Ki Kim
Jonathan P. How
AAML
92
19
0
14 Sep 2021
Eden: A Unified Environment Framework for Booming Reinforcement Learning
  Algorithms
Eden: A Unified Environment Framework for Booming Reinforcement Learning Algorithms
Ruizhi Chen
Xiaoyu Wu
Yansong Pan
Kaizhao Yuan
Ling Li
...
Shaohui Peng
Xishan Zhang
Zidong Du
Qi Guo
Yunji Chen
OffRL
61
3
0
04 Sep 2021
Optimal Actor-Critic Policy with Optimized Training Datasets
Optimal Actor-Critic Policy with Optimized Training Datasets
C. Banerjee
Zhiyong Chen
N. Noman
M. Zamani
OffRL
66
7
0
16 Aug 2021
Bridging the gap between emotion and joint action
Bridging the gap between emotion and joint action
M. Bieńkiewicz
Andrii Smykovskyi
Temitayo A. Olugbade
Stefan Janaqi
A. Camurri
N. Bianchi-Berthouze
Mårten Björkman
B. Bardy
50
22
0
13 Aug 2021
Open-Ended Learning Leads to Generally Capable Agents
Open-Ended Learning Leads to Generally Capable Agents
Open-Ended Learning Team
Adam Stooke
Anuj Mahajan
Catarina Barros
Charlie Deck
...
Nicolas Porcel
Roberta Raileanu
Steph Hughes-Fitt
Valentin Dalibard
Wojciech M. Czarnecki
130
190
0
27 Jul 2021
Megaverse: Simulating Embodied Agents at One Million Experiences per
  Second
Megaverse: Simulating Embodied Agents at One Million Experiences per Second
Aleksei Petrenko
Erik Wijmans
Brennan Shacklett
V. Koltun
LM&RoVGen
88
24
0
17 Jul 2021
Level generation and style enhancement -- deep learning for game
  development overview
Level generation and style enhancement -- deep learning for game development overview
P. Migdal
Bartlomiej Olechno
Bla.zej Podgórski
GANVLM
39
4
0
15 Jul 2021
Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting
  Pot
Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot
Joel Z Leibo
Edgar A. Duénez-Guzmán
A. Vezhnevets
J. Agapiou
P. Sunehag
Raphael Köster
Jayd Matyas
Charlie Beattie
Igor Mordatch
T. Graepel
OffRL
93
111
0
14 Jul 2021
Recent Advances in Leveraging Human Guidance for Sequential
  Decision-Making Tasks
Recent Advances in Leveraging Human Guidance for Sequential Decision-Making Tasks
Ruohan Zhang
F. Torabi
Garrett A. Warnell
Peter Stone
158
29
0
13 Jul 2021
Previous
12345678
Next