ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1807.01281
  4. Cited By
Human-level performance in first-person multiplayer games with
  population-based deep reinforcement learning

Human-level performance in first-person multiplayer games with population-based deep reinforcement learning

3 July 2018
Max Jaderberg
Wojciech M. Czarnecki
Iain Dunning
Luke Marris
Guy Lever
Antonio García Castañeda
Charlie Beattie
Neil C. Rabinowitz
Ari S. Morcos
Avraham Ruderman
Nicolas Sonnerat
Tim Green
Louise Deason
Joel Z Leibo
David Silver
Demis Hassabis
Koray Kavukcuoglu
T. Graepel
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Human-level performance in first-person multiplayer games with population-based deep reinforcement learning"

50 / 363 papers shown
Title
Using Unity to Help Solve Intelligence
Using Unity to Help Solve Intelligence
Tom Ward
Andrew Bolt
Nik Hemmings
Simon Carter
Manuel Sanchez
...
Jay Lemmon
J. Coe
Piotr Trochim
T. Handley
Adrian Bolton
66
18
0
18 Nov 2020
Phoebe: Reuse-Aware Online Caching with Reinforcement Learning for
  Emerging Storage Models
Phoebe: Reuse-Aware Online Caching with Reinforcement Learning for Emerging Storage Models
Nan Wu
Pengcheng Li
53
7
0
13 Nov 2020
Emergent Reciprocity and Team Formation from Randomized Uncertain Social
  Preferences
Emergent Reciprocity and Team Formation from Randomized Uncertain Social Preferences
Bowen Baker
LRM
69
37
0
10 Nov 2020
Playing optical tweezers with deep reinforcement learning: in virtual,
  physical and augmented environments
Playing optical tweezers with deep reinforcement learning: in virtual, physical and augmented environments
M. Praeger
Yunhui Xie
J. Grant-Jacob
R. Eason
B. Mills
64
12
0
05 Nov 2020
Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping
Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping
Yujing Hu
Weixun Wang
Hangtian Jia
Yixiang Wang
Yingfeng Chen
Jianye Hao
Feng Wu
Changjie Fan
OffRL
101
180
0
05 Nov 2020
Meta-trained agents implement Bayes-optimal agents
Meta-trained agents implement Bayes-optimal agents
Vladimir Mikulik
Grégoire Delétang
Tom McGrath
Tim Genewein
Miljan Martic
Shane Legg
Pedro A. Ortega
OODFedML
88
43
0
21 Oct 2020
Negotiating Team Formation Using Deep Reinforcement Learning
Negotiating Team Formation Using Deep Reinforcement Learning
Yoram Bachrach
Richard Everett
Edward Hughes
Angeliki Lazaridou
Joel Z Leibo
Marc Lanctot
Michael Bradley Johanson
Wojciech M. Czarnecki
T. Graepel
109
36
0
20 Oct 2020
Watch-And-Help: A Challenge for Social Perception and Human-AI
  Collaboration
Watch-And-Help: A Challenge for Social Perception and Human-AI Collaboration
Xavier Puig
Tianmin Shu
Shuang Li
Zilin Wang
Yuan-Hong Liao
J. Tenenbaum
Sanja Fidler
Antonio Torralba
LM&Ro
163
130
0
19 Oct 2020
Multi-Agent Collaboration via Reward Attribution Decomposition
Multi-Agent Collaboration via Reward Attribution Decomposition
Tianjun Zhang
Huazhe Xu
Xiaolong Wang
Yi Wu
Kurt Keutzer
Joseph E. Gonzalez
Yuandong Tian
73
40
0
16 Oct 2020
RODE: Learning Roles to Decompose Multi-Agent Tasks
RODE: Learning Roles to Decompose Multi-Agent Tasks
Tonghan Wang
Tarun Gupta
Anuj Mahajan
Bei Peng
Shimon Whiteson
Chongjie Zhang
OffRL
118
211
0
04 Oct 2020
Beyond Tabula-Rasa: a Modular Reinforcement Learning Approach for
  Physically Embedded 3D Sokoban
Beyond Tabula-Rasa: a Modular Reinforcement Learning Approach for Physically Embedded 3D Sokoban
Peter Karkus
M. Berk Mirza
A. Guez
Andrew Jaegle
Timothy Lillicrap
Lars Buesing
N. Heess
T. Weber
OffRL
67
8
0
03 Oct 2020
Learning to Play against Any Mixture of Opponents
Learning to Play against Any Mixture of Opponents
Max O. Smith
Thomas W. Anthony
Yongzhao Wang
Michael P. Wellman
OffRL
75
9
0
29 Sep 2020
Symbolic Relational Deep Reinforcement Learning based on Graph Neural
  Networks and Autoregressive Policy Decomposition
Symbolic Relational Deep Reinforcement Learning based on Graph Neural Networks and Autoregressive Policy Decomposition
Jaromír Janisch
Tomávs Pevný
Viliam Lisý
AI4CE
91
3
0
25 Sep 2020
Decoupling Representation Learning from Reinforcement Learning
Decoupling Representation Learning from Reinforcement Learning
Adam Stooke
Kimin Lee
Pieter Abbeel
Michael Laskin
SSLDRL
380
346
0
14 Sep 2020
Efficient Competitive Self-Play Policy Optimization
Efficient Competitive Self-Play Policy Optimization
Yuanyi Zhong
Yuanshuo Zhou
Jian Peng
13
2
0
13 Sep 2020
RLCFR: Minimize Counterfactual Regret by Deep Reinforcement Learning
RLCFR: Minimize Counterfactual Regret by Deep Reinforcement Learning
Huale Li
Xinyu Wang
Fengwei Jia
Yifan Li
Yulin Wu
Jia-jia Zhang
Shuhan Qi
29
6
0
10 Sep 2020
A community-powered search of machine learning strategy space to find
  NMR property prediction models
A community-powered search of machine learning strategy space to find NMR property prediction models
Lars A. Bratholm
W. Gerrard
Brandon M. Anderson
Shaojie Bai
Sunghwan Choi
...
A. Torrubia
Devin Willmott
C. Butts
David R. Glowacki
Kaggle participants
46
17
0
13 Aug 2020
Deep Model-Based Reinforcement Learning for High-Dimensional Problems, a
  Survey
Deep Model-Based Reinforcement Learning for High-Dimensional Problems, a Survey
Aske Plaat
W. Kosters
Mike Preuss
BDLOffRL
127
17
0
11 Aug 2020
Off-Policy Multi-Agent Decomposed Policy Gradients
Off-Policy Multi-Agent Decomposed Policy Gradients
Yihan Wang
Beining Han
Tonghan Wang
Heng Dong
Chongjie Zhang
102
181
0
24 Jul 2020
Dynamic Relational Inference in Multi-Agent Trajectories
Dynamic Relational Inference in Multi-Agent Trajectories
Ruichao Xiao
Manish Kumar Singh
Rose Yu
125
2
0
16 Jul 2020
A Cordial Sync: Going Beyond Marginal Policies for Multi-Agent Embodied
  Tasks
A Cordial Sync: Going Beyond Marginal Policies for Multi-Agent Embodied Tasks
Unnat Jain
Luca Weihs
Eric Kolve
Ali Farhadi
Svetlana Lazebnik
Aniruddha Kembhavi
Alex Schwing
86
58
0
09 Jul 2020
Responsive Safety in Reinforcement Learning by PID Lagrangian Methods
Responsive Safety in Reinforcement Learning by PID Lagrangian Methods
Adam Stooke
Joshua Achiam
Pieter Abbeel
112
302
0
08 Jul 2020
Deep Reinforcement Learning and its Neuroscientific Implications
Deep Reinforcement Learning and its Neuroscientific Implications
M. Botvinick
Jane X. Wang
Will Dabney
Kevin J. Miller
Z. Kurth-Nelson
OffRLAI4CE
97
176
0
07 Jul 2020
Scaling Imitation Learning in Minecraft
Scaling Imitation Learning in Minecraft
Artemij Amiranashvili
Nicolai Dorka
Wolfram Burgard
V. Koltun
Thomas Brox
MLAU
53
15
0
06 Jul 2020
Perception-Prediction-Reaction Agents for Deep Reinforcement Learning
Perception-Prediction-Reaction Agents for Deep Reinforcement Learning
Adam Stooke
Valentin Dalibard
Siddhant M. Jayakumar
Wojciech M. Czarnecki
Max Jaderberg
53
1
0
26 Jun 2020
A Benchmarking Framework for Interactive 3D Applications in the Cloud
A Benchmarking Framework for Interactive 3D Applications in the Cloud
Tianyi Liu
Sen He
Sunzhou Huang
Danny H. K. Tsang
Lingjia Tang
Jason Mars
Wei Wang
18
11
0
23 Jun 2020
Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with
  Asynchronous Reinforcement Learning
Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with Asynchronous Reinforcement Learning
Aleksei Petrenko
Zhehui Huang
T. Kumar
Gaurav Sukhatme
V. Koltun
103
105
0
21 Jun 2020
Pipeline PSRO: A Scalable Approach for Finding Approximate Nash
  Equilibria in Large Games
Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games
Stephen Marcus McAleer
John Lanier
Roy Fox
Pierre Baldi
63
78
0
15 Jun 2020
Shared Experience Actor-Critic for Multi-Agent Reinforcement Learning
Shared Experience Actor-Critic for Multi-Agent Reinforcement Learning
Filippos Christianos
Lukas Schafer
Stefano V. Albrecht
150
170
0
12 Jun 2020
Learning to Incentivize Other Learning Agents
Learning to Incentivize Other Learning Agents
Jiachen Yang
Ang Li
Mehrdad Farajtabar
P. Sunehag
Edward Hughes
H. Zha
102
70
0
10 Jun 2020
Learning to Play No-Press Diplomacy with Best Response Policy Iteration
Learning to Play No-Press Diplomacy with Best Response Policy Iteration
Thomas W. Anthony
Tom Eccles
Andrea Tacchetti
János Kramár
I. Gemp
...
Richard Everett
Roman Werpachowski
Satinder Singh
T. Graepel
Yoram Bachrach
106
43
0
08 Jun 2020
A Comparison of Self-Play Algorithms Under a Generalized Framework
A Comparison of Self-Play Algorithms Under a Generalized Framework
Daniel Hernández
Kevin Denamganai
Sam Devlin
Spyridon Samothrakis
James Alfred Walker
61
12
0
08 Jun 2020
Causality and Batch Reinforcement Learning: Complementary Approaches To
  Planning In Unknown Domains
Causality and Batch Reinforcement Learning: Complementary Approaches To Planning In Unknown Domains
James Bannon
Bradford T. Windsor
Wenbo Song
Tao Li
CMLOODOffRL
73
20
0
03 Jun 2020
Manipulating the Distributions of Experience used for Self-Play Learning
  in Expert Iteration
Manipulating the Distributions of Experience used for Self-Play Learning in Expert Iteration
Dennis J. N. J. Soemers
Éric Piette
Matthew Stephenson
C. Browne
OffRL
72
6
0
30 May 2020
Accelerating Deep Neuroevolution on Distributed FPGAs for Reinforcement
  Learning Problems
Accelerating Deep Neuroevolution on Distributed FPGAs for Reinforcement Learning Problems
Alexis Asseman
Nicolas Antoine
A. Ozcan
29
4
0
10 May 2020
Navigating the Landscape of Multiplayer Games
Navigating the Landscape of Multiplayer Games
Shayegan Omidshafiei
K. Tuyls
Wojciech M. Czarnecki
Francisco C. Santos
Mark Rowland
...
Paul Muller
Julien Perolat
Bart De Vylder
A. Gruslys
Rémi Munos
32
2
0
04 May 2020
The AI Economist: Improving Equality and Productivity with AI-Driven Tax
  Policies
The AI Economist: Improving Equality and Productivity with AI-Driven Tax Policies
Stephan Zheng
Alexander R. Trott
Sunil Srinivasa
Nikhil Naik
Melvin Gruesbeck
David C. Parkes
R. Socher
60
136
0
28 Apr 2020
Real World Games Look Like Spinning Tops
Real World Games Look Like Spinning Tops
Wojciech M. Czarnecki
Gauthier Gidel
Brendan D. Tracey
K. Tuyls
Shayegan Omidshafiei
David Balduzzi
Max Jaderberg
75
101
0
20 Apr 2020
Energy-Based Imitation Learning
Energy-Based Imitation Learning
Minghuan Liu
Tairan He
Minkai Xu
Weinan Zhang
118
48
0
20 Apr 2020
Meta-Learning in Neural Networks: A Survey
Meta-Learning in Neural Networks: A Survey
Timothy M. Hospedales
Antreas Antoniou
P. Micaelli
Amos Storkey
OOD
420
2,005
0
11 Apr 2020
CURL: Contrastive Unsupervised Representations for Reinforcement
  Learning
CURL: Contrastive Unsupervised Representations for Reinforcement Learning
A. Srinivas
Michael Laskin
Pieter Abbeel
SSLDRLOffRL
148
1,097
0
08 Apr 2020
How Do You Act? An Empirical Study to Understand Behavior of Deep
  Reinforcement Learning Agents
How Do You Act? An Empirical Study to Understand Behavior of Deep Reinforcement Learning Agents
Richard Meyes
Moritz Schneider
Tobias Meisen
55
2
0
07 Apr 2020
Obstacle Tower Without Human Demonstrations: How Far a Deep Feed-Forward
  Network Goes with Reinforcement Learning
Obstacle Tower Without Human Demonstrations: How Far a Deep Feed-Forward Network Goes with Reinforcement Learning
Marco Pleines
J. Jitsev
Mike Preuss
Frank Zimmer
49
2
0
01 Apr 2020
Fiber: A Platform for Efficient Development and Distributed Training for
  Reinforcement Learning and Population-Based Methods
Fiber: A Platform for Efficient Development and Distributed Training for Reinforcement Learning and Population-Based Methods
Jiale Zhi
Rui Wang
Jeff Clune
Kenneth O. Stanley
OffRL
72
12
0
25 Mar 2020
Multi-Agent Reinforcement Learning for Problems with Combined Individual
  and Team Reward
Multi-Agent Reinforcement Learning for Problems with Combined Individual and Team Reward
Hassam Sheikh
Ladislau Bölöni
105
37
0
24 Mar 2020
Decentralized MCTS via Learned Teammate Models
Decentralized MCTS via Learned Teammate Models
A. Czechowski
F. Oliehoek
451
19
0
19 Mar 2020
Accelerating and Improving AlphaZero Using Population Based Training
Accelerating and Improving AlphaZero Using Population Based Training
Ti-Rong Wu
Ting Han Wei
I-Chen Wu
41
17
0
13 Mar 2020
Human AI interaction loop training: New approach for interactive
  reinforcement learning
Human AI interaction loop training: New approach for interactive reinforcement learning
N. Navidi
41
6
0
09 Mar 2020
FormulaZero: Distributionally Robust Online Adaptation via Offline
  Population Synthesis
FormulaZero: Distributionally Robust Online Adaptation via Offline Population Synthesis
Aman Sinha
Matthew O'Kelly
Hongrui Zheng
Rahul Mangharam
John C. Duchi
Russ Tedrake
OffRL
132
27
0
09 Mar 2020
RIDE: Rewarding Impact-Driven Exploration for Procedurally-Generated
  Environments
RIDE: Rewarding Impact-Driven Exploration for Procedurally-Generated Environments
Roberta Raileanu
Tim Rocktaschel
88
174
0
27 Feb 2020
Previous
12345678
Next