ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1807.01281
  4. Cited By
Human-level performance in first-person multiplayer games with
  population-based deep reinforcement learning

Human-level performance in first-person multiplayer games with population-based deep reinforcement learning

3 July 2018
Max Jaderberg
Wojciech M. Czarnecki
Iain Dunning
Luke Marris
Guy Lever
Antonio García Castañeda
Charlie Beattie
Neil C. Rabinowitz
Ari S. Morcos
Avraham Ruderman
Nicolas Sonnerat
Tim Green
Louise Deason
Joel Z Leibo
David Silver
Demis Hassabis
Koray Kavukcuoglu
T. Graepel
    OffRL
ArXivPDFHTML

Papers citing "Human-level performance in first-person multiplayer games with population-based deep reinforcement learning"

41 / 141 papers shown
Title
Deep Neural Networks using a Single Neuron: Folded-in-Time Architecture
  using Feedback-Modulated Delay Loops
Deep Neural Networks using a Single Neuron: Folded-in-Time Architecture using Feedback-Modulated Delay Loops
Florian Stelzer
André Röhm
Raul Vicente
Ingo Fischer
University of Tartu
AI4CE
19
46
0
19 Nov 2020
Phoebe: Reuse-Aware Online Caching with Reinforcement Learning for
  Emerging Storage Models
Phoebe: Reuse-Aware Online Caching with Reinforcement Learning for Emerging Storage Models
Nan Wu
Pengcheng Li
17
7
0
13 Nov 2020
Emergent Reciprocity and Team Formation from Randomized Uncertain Social
  Preferences
Emergent Reciprocity and Team Formation from Randomized Uncertain Social Preferences
Bowen Baker
LRM
18
33
0
10 Nov 2020
Playing optical tweezers with deep reinforcement learning: in virtual,
  physical and augmented environments
Playing optical tweezers with deep reinforcement learning: in virtual, physical and augmented environments
M. Praeger
Yunhui Xie
J. Grant-Jacob
R. Eason
B. Mills
22
11
0
05 Nov 2020
Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping
Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping
Yujing Hu
Weixun Wang
Hangtian Jia
Yixiang Wang
Yingfeng Chen
Jianye Hao
Feng Wu
Changjie Fan
OffRL
16
173
0
05 Nov 2020
Meta-trained agents implement Bayes-optimal agents
Meta-trained agents implement Bayes-optimal agents
Vladimir Mikulik
Grégoire Delétang
Tom McGrath
Tim Genewein
Miljan Martic
Shane Legg
Pedro A. Ortega
OOD
FedML
35
40
0
21 Oct 2020
RODE: Learning Roles to Decompose Multi-Agent Tasks
RODE: Learning Roles to Decompose Multi-Agent Tasks
Tonghan Wang
Tarun Gupta
Anuj Mahajan
Bei Peng
Shimon Whiteson
Chongjie Zhang
OffRL
30
39
0
04 Oct 2020
Learning to Play against Any Mixture of Opponents
Learning to Play against Any Mixture of Opponents
Max O. Smith
Thomas W. Anthony
Yongzhao Wang
Michael P. Wellman
OffRL
33
9
0
29 Sep 2020
A community-powered search of machine learning strategy space to find
  NMR property prediction models
A community-powered search of machine learning strategy space to find NMR property prediction models
Lars A. Bratholm
W. Gerrard
Brandon M. Anderson
Shaojie Bai
Sunghwan Choi
...
A. Torrubia
Devin Willmott
C. Butts
David R. Glowacki
Kaggle participants
21
16
0
13 Aug 2020
Off-Policy Multi-Agent Decomposed Policy Gradients
Off-Policy Multi-Agent Decomposed Policy Gradients
Yihan Wang
Beining Han
Tonghan Wang
Heng Dong
Chongjie Zhang
35
174
0
24 Jul 2020
Responsive Safety in Reinforcement Learning by PID Lagrangian Methods
Responsive Safety in Reinforcement Learning by PID Lagrangian Methods
Adam Stooke
Joshua Achiam
Pieter Abbeel
31
286
0
08 Jul 2020
Learning to Play No-Press Diplomacy with Best Response Policy Iteration
Learning to Play No-Press Diplomacy with Best Response Policy Iteration
Thomas W. Anthony
Tom Eccles
Andrea Tacchetti
János Kramár
I. Gemp
...
Richard Everett
Roman Werpachowski
Satinder Singh
T. Graepel
Yoram Bachrach
19
42
0
08 Jun 2020
The AI Economist: Improving Equality and Productivity with AI-Driven Tax
  Policies
The AI Economist: Improving Equality and Productivity with AI-Driven Tax Policies
Stephan Zheng
Alexander R. Trott
Sunil Srinivasa
Nikhil Naik
Melvin Gruesbeck
David C. Parkes
R. Socher
31
131
0
28 Apr 2020
Meta-Learning in Neural Networks: A Survey
Meta-Learning in Neural Networks: A Survey
Timothy M. Hospedales
Antreas Antoniou
P. Micaelli
Amos Storkey
OOD
82
1,935
0
11 Apr 2020
How Do You Act? An Empirical Study to Understand Behavior of Deep
  Reinforcement Learning Agents
How Do You Act? An Empirical Study to Understand Behavior of Deep Reinforcement Learning Agents
Richard Meyes
Moritz Schneider
Tobias Meisen
28
2
0
07 Apr 2020
Fiber: A Platform for Efficient Development and Distributed Training for
  Reinforcement Learning and Population-Based Methods
Fiber: A Platform for Efficient Development and Distributed Training for Reinforcement Learning and Population-Based Methods
Jiale Zhi
Rui Wang
Jeff Clune
Kenneth O. Stanley
OffRL
30
12
0
25 Mar 2020
Decentralized MCTS via Learned Teammate Models
Decentralized MCTS via Learned Teammate Models
A. Czechowski
F. Oliehoek
209
19
0
19 Mar 2020
FormulaZero: Distributionally Robust Online Adaptation via Offline
  Population Synthesis
FormulaZero: Distributionally Robust Online Adaptation via Offline Population Synthesis
Aman Sinha
Matthew O'Kelly
Hongrui Zheng
Rahul Mangharam
John C. Duchi
Russ Tedrake
OffRL
66
26
0
09 Mar 2020
Computer-inspired Quantum Experiments
Computer-inspired Quantum Experiments
Mario Krenn
Manuel Erhard
A. Zeilinger
27
73
0
23 Feb 2020
Provably Efficient Online Hyperparameter Optimization with
  Population-Based Bandits
Provably Efficient Online Hyperparameter Optimization with Population-Based Bandits
Jack Parker-Holder
Vu Nguyen
Stephen J. Roberts
OffRL
75
83
0
06 Feb 2020
Social diversity and social preferences in mixed-motive reinforcement
  learning
Social diversity and social preferences in mixed-motive reinforcement learning
Kevin R. McKee
I. Gemp
Brian McWilliams
Edgar A. Duénez-Guzmán
Edward Hughes
Joel Z Leibo
20
80
0
06 Feb 2020
Variational Recurrent Models for Solving Partially Observable Control
  Tasks
Variational Recurrent Models for Solving Partially Observable Control Tasks
Dongqi Han
Kenji Doya
Jun Tani
DRL
OffRL
21
59
0
23 Dec 2019
Predictive Coding for Boosting Deep Reinforcement Learning with Sparse
  Rewards
Predictive Coding for Boosting Deep Reinforcement Learning with Sparse Rewards
Xingyu Lu
Stas Tiomkin
Pieter Abbeel
OffRL
31
4
0
21 Dec 2019
Dota 2 with Large Scale Deep Reinforcement Learning
Dota 2 with Large Scale Deep Reinforcement Learning
OpenAI OpenAI
:
Christopher Berner
Greg Brockman
Brooke Chan
...
Szymon Sidor
Ilya Sutskever
Jie Tang
Filip Wolski
Susan Zhang
GNN
VLM
CLL
AI4CE
LRM
46
1,795
0
13 Dec 2019
On the Utility of Learning about Humans for Human-AI Coordination
On the Utility of Learning about Humans for Human-AI Coordination
Micah Carroll
Rohin Shah
Mark K. Ho
Thomas Griffiths
S. Seshia
Pieter Abbeel
Anca Dragan
HAI
11
380
0
13 Oct 2019
A Generalized Training Approach for Multiagent Learning
A Generalized Training Approach for Multiagent Learning
Paul Muller
Shayegan Omidshafiei
Mark Rowland
K. Tuyls
Julien Perolat
...
Zhe Wang
Guy Lever
N. Heess
T. Graepel
Rémi Munos
22
89
0
27 Sep 2019
V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete
  and Continuous Control
V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
H. F. Song
A. Abdolmaleki
Jost Tobias Springenberg
Aidan Clark
Hubert Soyer
...
Dhruva Tirumala
N. Heess
Dan Belov
Martin Riedmiller
M. Botvinick
37
121
0
26 Sep 2019
No Press Diplomacy: Modeling Multi-Agent Gameplay
No Press Diplomacy: Modeling Multi-Agent Gameplay
Philip Paquette
Yuchen Lu
Steven Bocco
Max O. Smith
Satya Ortiz-Gagné
Jonathan K. Kummerfeld
Satinder Singh
Joelle Pineau
Aaron Courville
33
57
0
04 Sep 2019
Iterative Update and Unified Representation for Multi-Agent
  Reinforcement Learning
Iterative Update and Unified Representation for Multi-Agent Reinforcement Learning
Jiancheng Long
Hongming Zhang
Tianyang Yu
Bo Xu
13
0
0
16 Aug 2019
Arena: a toolkit for Multi-Agent Reinforcement Learning
Arena: a toolkit for Multi-Agent Reinforcement Learning
Qing Wang
Jiechao Xiong
Lei Han
Meng Fang
Xinghai Sun
Zhuobin Zheng
Peng Sun
Zhengyou Zhang
31
4
0
20 Jul 2019
ORRB -- OpenAI Remote Rendering Backend
ORRB -- OpenAI Remote Rendering Backend
Maciek Chociej
Peter Welinder
Lilian Weng
AI4CE
21
10
0
26 Jun 2019
Arena: A General Evaluation Platform and Building Toolkit for
  Multi-Agent Intelligence
Arena: A General Evaluation Platform and Building Toolkit for Multi-Agent Intelligence
Yuhang Song
Andrzej Wojcicki
Thomas Lukasiewicz
Jianyi Wang
Abi Aryan
Zhenghua Xu
Mai Xu
Zihan Ding
Lianlong Wu
AI4CE
ELM
27
33
0
17 May 2019
A Conceptual Bio-Inspired Framework for the Evolution of Artificial
  General Intelligence
A Conceptual Bio-Inspired Framework for the Evolution of Artificial General Intelligence
S. Pontes-Filho
Stefano Nichele
AI4CE
19
6
0
25 Mar 2019
Neural MMO: A Massively Multiagent Game Environment for Training and
  Evaluating Intelligent Agents
Neural MMO: A Massively Multiagent Game Environment for Training and Evaluating Intelligent Agents
Joseph Suárez
Yilun Du
Phillip Isola
Igor Mordatch
11
71
0
02 Mar 2019
Making AI meaningful again
Making AI meaningful again
Jobst Landgrebe
Barry F. Smith
16
35
0
09 Jan 2019
Scalable agent alignment via reward modeling: a research direction
Scalable agent alignment via reward modeling: a research direction
Jan Leike
David M. Krueger
Tom Everitt
Miljan Martic
Vishal Maini
Shane Legg
34
396
0
19 Nov 2018
Evolving intrinsic motivations for altruistic behavior
Evolving intrinsic motivations for altruistic behavior
Jane X. Wang
Edward Hughes
Chrisantha Fernando
Wojciech M. Czarnecki
Edgar A. Duénez-Guzmán
Joel Z Leibo
21
76
0
14 Nov 2018
TarMAC: Targeted Multi-Agent Communication
TarMAC: Targeted Multi-Agent Communication
Abhishek Das
Théophile Gervet
Joshua Romoff
Dhruv Batra
Devi Parikh
Michael G. Rabbat
Joelle Pineau
22
378
0
26 Oct 2018
A Survey and Critique of Multiagent Deep Reinforcement Learning
A Survey and Critique of Multiagent Deep Reinforcement Learning
Pablo Hernandez-Leal
Bilal Kartal
Matthew E. Taylor
OffRL
32
550
0
12 Oct 2018
CM3: Cooperative Multi-goal Multi-stage Multi-agent Reinforcement
  Learning
CM3: Cooperative Multi-goal Multi-stage Multi-agent Reinforcement Learning
Jiachen Yang
A. Nakhaei
David Isele
K. Fujimura
H. Zha
29
75
0
13 Sep 2018
Modeling the Formation of Social Conventions from Embodied Real-Time
  Interactions
Modeling the Formation of Social Conventions from Embodied Real-Time Interactions
Ismael T. Freire
Clément Moulin-Frier
Martí Sánchez-Fibla
X. Arsiwalla
P. Verschure
23
15
0
16 Feb 2018
Previous
123