ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1807.01281
  4. Cited By
Human-level performance in first-person multiplayer games with
  population-based deep reinforcement learning

Human-level performance in first-person multiplayer games with population-based deep reinforcement learning

3 July 2018
Max Jaderberg
Wojciech M. Czarnecki
Iain Dunning
Luke Marris
Guy Lever
Antonio García Castañeda
Charlie Beattie
Neil C. Rabinowitz
Ari S. Morcos
Avraham Ruderman
Nicolas Sonnerat
Tim Green
Louise Deason
Joel Z Leibo
David Silver
Demis Hassabis
Koray Kavukcuoglu
T. Graepel
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Human-level performance in first-person multiplayer games with population-based deep reinforcement learning"

50 / 363 papers shown
Title
From Chess and Atari to StarCraft and Beyond: How Game AI is Driving the
  World of AI
From Chess and Atari to StarCraft and Beyond: How Game AI is Driving the World of AI
S. Risi
Mike Preuss
82
57
0
24 Feb 2020
Computer-inspired Quantum Experiments
Computer-inspired Quantum Experiments
Mario Krenn
Manuel Erhard
A. Zeilinger
72
75
0
23 Feb 2020
Explore, Discover and Learn: Unsupervised Discovery of State-Covering
  Skills
Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills
Victor Campos
Alexander R. Trott
Caiming Xiong
R. Socher
Xavier Giró-i-Nieto
Jordi Torres
OffRL
101
156
0
10 Feb 2020
Provably Efficient Online Hyperparameter Optimization with
  Population-Based Bandits
Provably Efficient Online Hyperparameter Optimization with Population-Based Bandits
Jack Parker-Holder
Vu Nguyen
Stephen J. Roberts
OffRL
161
86
0
06 Feb 2020
Social diversity and social preferences in mixed-motive reinforcement
  learning
Social diversity and social preferences in mixed-motive reinforcement learning
Kevin R. McKee
I. Gemp
Brian McWilliams
Edgar A. Duénez-Guzmán
Edward Hughes
Joel Z Leibo
97
85
0
06 Feb 2020
Soft Hindsight Experience Replay
Soft Hindsight Experience Replay
Qiwei He
Liansheng Zhuang
Houqiang Li
61
9
0
06 Feb 2020
SoundSpaces: Audio-Visual Navigation in 3D Environments
SoundSpaces: Audio-Visual Navigation in 3D Environments
Changan Chen
Unnat Jain
Carl Schissler
S. V. A. Garí
Ziad Al-Halah
V. Ithapu
Philip Robinson
Kristen Grauman
108
26
0
24 Dec 2019
Variational Recurrent Models for Solving Partially Observable Control
  Tasks
Variational Recurrent Models for Solving Partially Observable Control Tasks
Dongqi Han
Kenji Doya
Jun Tani
DRLOffRL
72
63
0
23 Dec 2019
Predictive Coding for Boosting Deep Reinforcement Learning with Sparse
  Rewards
Predictive Coding for Boosting Deep Reinforcement Learning with Sparse Rewards
Xingyu Lu
Stas Tiomkin
Pieter Abbeel
OffRL
72
5
0
21 Dec 2019
Mastering Complex Control in MOBA Games with Deep Reinforcement Learning
Mastering Complex Control in MOBA Games with Deep Reinforcement Learning
Deheng Ye
Zhao Liu
Mingfei Sun
Bei Shi
P. Zhao
...
Tengfei Shi
Liang Wang
Qiang Fu
Wei Yang
Lanxiao Huang
67
324
0
20 Dec 2019
Coordination in Adversarial Sequential Team Games via Multi-Agent Deep
  Reinforcement Learning
Coordination in Adversarial Sequential Team Games via Multi-Agent Deep Reinforcement Learning
A. Celli
Marco Ciccone
Raffaele Bongo
N. Gatti
61
12
0
16 Dec 2019
Dota 2 with Large Scale Deep Reinforcement Learning
Dota 2 with Large Scale Deep Reinforcement Learning
OpenAI OpenAI
:
Christopher Berner
Greg Brockman
Brooke Chan
...
Szymon Sidor
Ilya Sutskever
Jie Tang
Filip Wolski
Susan Zhang
GNNVLMCLLAI4CELRM
181
1,840
0
13 Dec 2019
Increasing Generality in Machine Learning through Procedural Content
  Generation
Increasing Generality in Machine Learning through Procedural Content Generation
S. Risi
Julian Togelius
67
127
0
29 Nov 2019
A novel method for identifying the deep neural network model with the
  Serial Number
A novel method for identifying the deep neural network model with the Serial Number
Xiangrui Xu
Yaqin Li
Cao Yuan
AAML
41
8
0
19 Nov 2019
SMIX($λ$): Enhancing Centralized Value Functions for Cooperative
  Multi-Agent Reinforcement Learning
SMIX(λλλ): Enhancing Centralized Value Functions for Cooperative Multi-Agent Reinforcement Learning
Xinghu Yao
Chao Wen
Yuhui Wang
Xiaoyang Tan
107
47
0
11 Nov 2019
Adversarial Language Games for Advanced Natural Language Intelligence
Adversarial Language Games for Advanced Natural Language Intelligence
Yuan Yao
Haoxiang Zhong
Zhengyan Zhang
Xu Han
Xiaozhi Wang
Chaojun Xiao
Guoyang Zeng
Zhiyuan Liu
Maosong Sun
AAML
71
7
0
05 Nov 2019
Visual Hide and Seek
Visual Hide and Seek
Boyuan Chen
Shuran Song
Hod Lipson
Carl Vondrick
71
22
0
15 Oct 2019
Stabilizing Transformers for Reinforcement Learning
Stabilizing Transformers for Reinforcement Learning
Emilio Parisotto
H. F. Song
Jack W. Rae
Razvan Pascanu
Çağlar Gülçehre
...
Aidan Clark
Seb Noury
M. Botvinick
N. Heess
R. Hadsell
OffRL
110
368
0
13 Oct 2019
On the Utility of Learning about Humans for Human-AI Coordination
On the Utility of Learning about Humans for Human-AI Coordination
Micah Carroll
Rohin Shah
Mark K. Ho
Thomas Griffiths
Sanjit A. Seshia
Pieter Abbeel
Anca Dragan
HAI
80
405
0
13 Oct 2019
Improving Generalization in Meta Reinforcement Learning using Learned
  Objectives
Improving Generalization in Meta Reinforcement Learning using Learned Objectives
Louis Kirsch
Sjoerd van Steenkiste
Jürgen Schmidhuber
OffRL
95
119
0
09 Oct 2019
Fast Task-Adaptation for Tasks Labeled Using Natural Language in
  Reinforcement Learning
Fast Task-Adaptation for Tasks Labeled Using Natural Language in Reinforcement Learning
Matthias Hutsebaut-Buysse
Kevin Mets
Steven Latré
26
5
0
09 Oct 2019
TorchBeast: A PyTorch Platform for Distributed RL
TorchBeast: A PyTorch Platform for Distributed RL
Heinrich Küttler
Nantas Nardelli
Thibaut Lavril
Marco Selvatici
V. Sivakumar
Tim Rocktaschel
Edward Grefenstette
OffRL
94
58
0
08 Oct 2019
Automated curricula through setter-solver interactions
Automated curricula through setter-solver interactions
S. Racanière
Andrew Kyle Lampinen
Adam Santoro
David P. Reichert
Vlad Firoiu
Timothy Lillicrap
81
53
0
27 Sep 2019
A Generalized Training Approach for Multiagent Learning
A Generalized Training Approach for Multiagent Learning
Paul Muller
Shayegan Omidshafiei
Mark Rowland
K. Tuyls
Julien Perolat
...
Zhe Wang
Guy Lever
N. Heess
T. Graepel
Rémi Munos
102
94
0
27 Sep 2019
V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete
  and Continuous Control
V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
H. F. Song
A. Abdolmaleki
Jost Tobias Springenberg
Aidan Clark
Hubert Soyer
...
Dhruva Tirumala
N. Heess
Dan Belov
Martin Riedmiller
M. Botvinick
116
126
0
26 Sep 2019
Emergent Tool Use From Multi-Agent Autocurricula
Emergent Tool Use From Multi-Agent Autocurricula
Bowen Baker
I. Kanitscheider
Todor Markov
Yi Wu
Glenn Powell
Bob McGrew
Igor Mordatch
LRM
74
658
0
17 Sep 2019
The Animal-AI Environment: Training and Testing Animal-Like Artificial
  Cognition
The Animal-AI Environment: Training and Testing Animal-Like Artificial Cognition
Benjamin Beyret
José Hernández-Orallo
Lucy G. Cheke
Marta Halina
Murray Shanahan
Matthew Crosby
91
35
0
12 Sep 2019
Signal Instructed Coordination in Cooperative Multi-agent Reinforcement
  Learning
Signal Instructed Coordination in Cooperative Multi-agent Reinforcement Learning
Liheng Chen
Hongyi Guo
Yali Du
Fei Fang
Haifeng Zhang
Yaoming Zhu
Ming Zhou
Weinan Zhang
Qing Wang
Yong Yu
56
8
0
10 Sep 2019
Multi-Objective Multi-Agent Decision Making: A Utility-based Analysis
  and Survey
Multi-Objective Multi-Agent Decision Making: A Utility-based Analysis and Survey
Roxana Rădulescu
Patrick Mannion
D. Roijers
A. Nowé
96
146
0
06 Sep 2019
No Press Diplomacy: Modeling Multi-Agent Gameplay
No Press Diplomacy: Modeling Multi-Agent Gameplay
Philip Paquette
Yuchen Lu
Steven Bocco
Max O. Smith
Satya Ortiz-Gagné
Jonathan K. Kummerfeld
Satinder Singh
Joelle Pineau
Aaron Courville
88
59
0
04 Sep 2019
Universal Policies to Learn Them All
Universal Policies to Learn Them All
Hassam Sheikh
Ladislau Bölöni
OffRL
25
1
0
24 Aug 2019
Iterative Update and Unified Representation for Multi-Agent
  Reinforcement Learning
Iterative Update and Unified Representation for Multi-Agent Reinforcement Learning
Jiancheng Long
Hongming Zhang
Tianyang Yu
Bo Xu
20
0
0
16 Aug 2019
From Crystallized Adaptivity to Fluid Adaptivity in Deep Reinforcement
  Learning -- Insights from Biological Systems on Adaptive Flexibility
From Crystallized Adaptivity to Fluid Adaptivity in Deep Reinforcement Learning -- Insights from Biological Systems on Adaptive Flexibility
M. Schilling
Helge J. Ritter
F. Ohl
AI4CE
41
4
0
13 Aug 2019
Improving Deep Reinforcement Learning in Minecraft with Action Advice
Improving Deep Reinforcement Learning in Minecraft with Action Advice
Spencer Frazier
Mark O. Riedl
84
29
0
02 Aug 2019
Arena: a toolkit for Multi-Agent Reinforcement Learning
Arena: a toolkit for Multi-Agent Reinforcement Learning
Qing Wang
Jiechao Xiong
Lei Han
Meng Fang
Xinghai Sun
Zhuobin Zheng
Peng Sun
Zhengyou Zhang
59
4
0
20 Jul 2019
Policy-Gradient Algorithms Have No Guarantees of Convergence in Linear
  Quadratic Games
Policy-Gradient Algorithms Have No Guarantees of Convergence in Linear Quadratic Games
Eric Mazumdar
Lillian J. Ratliff
Michael I. Jordan
S. Shankar Sastry
92
37
0
08 Jul 2019
Neural Network Verification for the Masses (of AI graduates)
Neural Network Verification for the Masses (of AI graduates)
Ekaterina Komendantskaya
Rob Stewart
Kirsy Duncan
Daniel Kienitz
Pierre Le Hen
Pascal Bacchus
21
0
0
02 Jul 2019
MULEX: Disentangling Exploitation from Exploration in Deep RL
MULEX: Disentangling Exploitation from Exploration in Deep RL
Lucas Beyer
Damien Vincent
O. Teboul
Sylvain Gelly
Matthieu Geist
Olivier Pietquin
50
14
0
01 Jul 2019
ORRB -- OpenAI Remote Rendering Backend
ORRB -- OpenAI Remote Rendering Backend
Maciek Chociej
Peter Welinder
Lilian Weng
AI4CE
60
11
0
26 Jun 2019
Ease-of-Teaching and Language Structure from Emergent Communication
Ease-of-Teaching and Language Structure from Emergent Communication
Fushan Li
Michael Bowling
193
102
0
06 Jun 2019
AI-GAs: AI-generating algorithms, an alternate paradigm for producing
  general artificial intelligence
AI-GAs: AI-generating algorithms, an alternate paradigm for producing general artificial intelligence
Jeff Clune
148
122
0
27 May 2019
Arena: A General Evaluation Platform and Building Toolkit for
  Multi-Agent Intelligence
Arena: A General Evaluation Platform and Building Toolkit for Multi-Agent Intelligence
Yuhang Song
Andrzej Wojcicki
Thomas Lukasiewicz
Jianyi Wang
Abi Aryan
Zhenghua Xu
Mai Xu
Zihan Ding
Lianlong Wu
AI4CEELM
120
34
0
17 May 2019
A Conceptual Bio-Inspired Framework for the Evolution of Artificial
  General Intelligence
A Conceptual Bio-Inspired Framework for the Evolution of Artificial General Intelligence
S. Pontes-Filho
Stefano Nichele
AI4CE
45
6
0
25 Mar 2019
Deep Learning for Cognitive Neuroscience
Deep Learning for Cognitive Neuroscience
Katherine R. Storrs
N. Kriegeskorte
NAIAI4CE
80
46
0
04 Mar 2019
Neural MMO: A Massively Multiagent Game Environment for Training and
  Evaluating Intelligent Agents
Neural MMO: A Massively Multiagent Game Environment for Training and Evaluating Intelligent Agents
Joseph Suárez
Yilun Du
Phillip Isola
Igor Mordatch
77
71
0
02 Mar 2019
AlphaStar: An Evolutionary Computation Perspective
AlphaStar: An Evolutionary Computation Perspective
Kai Arulkumaran
Antoine Cully
Julian Togelius
91
185
0
05 Feb 2019
The Hanabi Challenge: A New Frontier for AI Research
The Hanabi Challenge: A New Frontier for AI Research
Nolan Bard
Jakob N. Foerster
A. Chandar
Neil Burch
Marc Lanctot
...
Iain Dunning
Shibl Mourad
Hugo Larochelle
Marc G. Bellemare
Michael Bowling
LLMAG
126
355
0
01 Feb 2019
Self-organization of action hierarchy and compositionality by
  reinforcement learning with recurrent neural networks
Self-organization of action hierarchy and compositionality by reinforcement learning with recurrent neural networks
Dongqi Han
Kenji Doya
Jun Tani
AI4CE
126
20
0
29 Jan 2019
Reward Shaping via Meta-Learning
Reward Shaping via Meta-Learning
Haosheng Zou
Zhaolin Ren
Dong Yan
Hang Su
Jun Zhu
OffRL
96
69
0
27 Jan 2019
Making AI meaningful again
Making AI meaningful again
Jobst Landgrebe
Barry F. Smith
83
35
0
09 Jan 2019
Previous
12345678
Next