ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1807.01281
  4. Cited By
Human-level performance in first-person multiplayer games with
  population-based deep reinforcement learning

Human-level performance in first-person multiplayer games with population-based deep reinforcement learning

3 July 2018
Max Jaderberg
Wojciech M. Czarnecki
Iain Dunning
Luke Marris
Guy Lever
Antonio García Castañeda
Charlie Beattie
Neil C. Rabinowitz
Ari S. Morcos
Avraham Ruderman
Nicolas Sonnerat
Tim Green
Louise Deason
Joel Z Leibo
David Silver
Demis Hassabis
Koray Kavukcuoglu
T. Graepel
    OffRL
ArXivPDFHTML

Papers citing "Human-level performance in first-person multiplayer games with population-based deep reinforcement learning"

50 / 139 papers shown
Title
Bidirectional Distillation: A Mixed-Play Framework for Multi-Agent Generalizable Behaviors
Bidirectional Distillation: A Mixed-Play Framework for Multi-Agent Generalizable Behaviors
Lang Feng
Jiahao Lin
Dong Xing
Li Zhang
De Ma
Gang Pan
21
0
0
16 May 2025
Diffusion Stochastic Learning Over Adaptive Competing Networks
Diffusion Stochastic Learning Over Adaptive Competing Networks
Yike Zhao
H. Cai
Ali H. Sayed
DiffM
35
0
0
28 Apr 2025
ToMCAT: Theory-of-Mind for Cooperative Agents in Teams via Multiagent Diffusion Policies
ToMCAT: Theory-of-Mind for Cooperative Agents in Teams via Multiagent Diffusion Policies
Pedro Sequeira
Vidyasagar Sadhu
Melinda Gervasio
DiffM
89
0
0
25 Feb 2025
Human-like Bots for Tactical Shooters Using Compute-Efficient Sensors
Niels Justesen
Maria Kaselimi
Sam Snodgrass
Miruna Vozaru
Matthew Schlegel
...
Albert Wang
Christoffer Holmgård
Georgios N. Yannakakis
S. Risi
Julian Togelius
50
0
0
03 Jan 2025
GPT for Games: An Updated Scoping Review (2020-2024)
GPT for Games: An Updated Scoping Review (2020-2024)
Daijin Yang
Erica Kleinman
Casper Harteveld
LLMAG
AI4TS
AI4CE
48
3
0
01 Nov 2024
Transformer Guided Coevolution: Improved Team Selection in Multiagent Adversarial Team Games
Transformer Guided Coevolution: Improved Team Selection in Multiagent Adversarial Team Games
Pranav Rajbhandari
Prithviraj Dasgupta
D. Sofge
21
0
0
17 Oct 2024
Training Interactive Agent in Large FPS Game Map with Rule-enhanced
  Reinforcement Learning
Training Interactive Agent in Large FPS Game Map with Rule-enhanced Reinforcement Learning
Chen Zhang
Huan Hu
Yuan Zhou
Qiyang Cao
Ruochen Liu
Wenya Wei
Elvis S. Liu
AI4CE
26
0
0
07 Oct 2024
A Survey on Self-play Methods in Reinforcement Learning
A Survey on Self-play Methods in Reinforcement Learning
Chao Yu
Zelai Xu
Chengdong Ma
Chao Yu
Weijuan Tu
...
Deheng Ye
Wenbo Ding
Yaodong Yang
Yu Wang
Yu Wang
SyDa
SSL
OnRL
60
8
0
02 Aug 2024
Mimicry and the Emergence of Cooperative Communication
Mimicry and the Emergence of Cooperative Communication
Dylan R. Cope
Peter McBurney
35
0
0
26 May 2024
A social path to human-like artificial intelligence
A social path to human-like artificial intelligence
Edgar A. Duénez-Guzmán
Suzanne Sadedin
Jane X. Wang
Kevin R. McKee
Joel Z Leibo
GNN
31
28
0
22 May 2024
COMBO: Compositional World Models for Embodied Multi-Agent Cooperation
COMBO: Compositional World Models for Embodied Multi-Agent Cooperation
Hongxin Zhang
Zeyuan Wang
Qiushi Lyu
Zheyuan Zhang
Sunli Chen
Tianmin Shu
Yilun Du
Kwonjoon Lee
Yilun Du
Chuang Gan
48
12
0
16 Apr 2024
Effective Reinforcement Learning Based on Structural Information
  Principles
Effective Reinforcement Learning Based on Structural Information Principles
Xianghua Zeng
Hao Peng
Dingli Su
Angsheng Li
40
0
0
15 Apr 2024
Generalized Population-Based Training for Hyperparameter Optimization in
  Reinforcement Learning
Generalized Population-Based Training for Hyperparameter Optimization in Reinforcement Learning
Hui Bai
Ran Cheng
53
4
0
12 Apr 2024
Scaling Artificial Intelligence for Digital Wargaming in Support of
  Decision-Making
Scaling Artificial Intelligence for Digital Wargaming in Support of Decision-Making
Scotty Black
Christian J. Darken
16
2
0
08 Feb 2024
Social Interpretable Reinforcement Learning
Social Interpretable Reinforcement Learning
Leonardo Lucio Custode
Giovanni Iacca
OffRL
42
2
0
27 Jan 2024
Multi-agent Reinforcement Learning: A Comprehensive Survey
Multi-agent Reinforcement Learning: A Comprehensive Survey
Dom Huh
Prasant Mohapatra
AI4CE
36
8
0
15 Dec 2023
Building Open-Ended Embodied Agent via Language-Policy Bidirectional
  Adaptation
Building Open-Ended Embodied Agent via Language-Policy Bidirectional Adaptation
Shaopeng Zhai
Jie Wang
Tianyi Zhang
Fuxian Huang
Qi Zhang
Ming Zhou
Jing Hou
Yu Qiao
Yu Liu
LLMAG
LM&Ro
37
1
0
12 Dec 2023
Reward Shaping for Improved Learning in Real-time Strategy Game Play
Reward Shaping for Improved Learning in Real-time Strategy Game Play
John Kliem
Prithviraj Dasgupta
OffRL
19
1
0
27 Nov 2023
Emergence of Collective Open-Ended Exploration from Decentralized
  Meta-Reinforcement Learning
Emergence of Collective Open-Ended Exploration from Decentralized Meta-Reinforcement Learning
Richard Bornemann
Gautier Hamon
Eleni Nisioti
Clément Moulin-Frier
LRM
27
1
0
01 Nov 2023
Iteratively Learn Diverse Strategies with State Distance Information
Iteratively Learn Diverse Strategies with State Distance Information
Wei Fu
Weihua Du
Jingwei Li
Sunli Chen
Jingzhao Zhang
Yi Wu
51
3
0
23 Oct 2023
Avalon's Game of Thoughts: Battle Against Deception through Recursive
  Contemplation
Avalon's Game of Thoughts: Battle Against Deception through Recursive Contemplation
Shenzhi Wang
Chang Liu
Zilong Zheng
Siyuan Qi
Shuo Chen
Qisen Yang
Andrew Zhao
Chaofei Wang
Shiji Song
Gao Huang
LLMAG
37
63
0
02 Oct 2023
MatrixWorld: A pursuit-evasion platform for safe multi-agent
  coordination and autocurricula
MatrixWorld: A pursuit-evasion platform for safe multi-agent coordination and autocurricula
Lijun Sun
Yu-Cheng Chang
Chao Lyu
Chin-Teng Lin
Yuhui Shi
38
1
0
27 Jul 2023
Building Cooperative Embodied Agents Modularly with Large Language Models
Building Cooperative Embodied Agents Modularly with Large Language Models
Hongxin Zhang
Weihua Du
Jiaming Shan
Qinhong Zhou
Yilun Du
J. Tenenbaum
Tianmin Shu
Chuang Gan
LLMAG
LM&Ro
59
157
0
05 Jul 2023
Decision-Oriented Dialogue for Human-AI Collaboration
Decision-Oriented Dialogue for Human-AI Collaboration
Jessy Lin
Nicholas Tomlin
Jacob Andreas
J. Eisner
LLMAG
35
27
0
31 May 2023
Is Centralized Training with Decentralized Execution Framework Centralized Enough for MARL?
Is Centralized Training with Decentralized Execution Framework Centralized Enough for MARL?
Yihe Zhou
Shunyu Liu
Yunpeng Qing
Kaixuan Chen
Tongya Zheng
Jie Song
Mingli Song
34
18
0
27 May 2023
Robust multi-agent coordination via evolutionary generation of auxiliary
  adversarial attackers
Robust multi-agent coordination via evolutionary generation of auxiliary adversarial attackers
Lei Yuan
Zifei Zhang
Ke Xue
Hao Yin
F. Chen
Cong Guan
Lihe Li
Chao Qian
Yang Yu
AAML
24
17
0
10 May 2023
Towards Effective and Interpretable Human-Agent Collaboration in MOBA
  Games: A Communication Perspective
Towards Effective and Interpretable Human-Agent Collaboration in MOBA Games: A Communication Perspective
Yiming Gao
Feiyu Liu
Liang Wang
Zhenjie Lian
Weixuan Wang
...
Jiawei Wang
Qiang Fu
Wei Yang
Lanxiao Huang
Wei Liu
42
6
0
23 Apr 2023
Mastering Asymmetrical Multiplayer Game with Multi-Agent
  Asymmetric-Evolution Reinforcement Learning
Mastering Asymmetrical Multiplayer Game with Multi-Agent Asymmetric-Evolution Reinforcement Learning
Chenglu Sun
Yi-cui Zhang
Yu Zhang
Ziling Lu
Jingbin Liu
Si-Qi Xu
Weidong Zhang
25
0
0
20 Apr 2023
Cooperative Coevolution for Non-Separable Large-Scale Black-Box
  Optimization: Convergence Analyses and Distributed Accelerations
Cooperative Coevolution for Non-Separable Large-Scale Black-Box Optimization: Convergence Analyses and Distributed Accelerations
Qiqi Duan
Chang Shao
Guochen Zhou
Hao Yang
Qi Zhao
Yuhui Shi
25
4
0
11 Apr 2023
Fast exploration and learning of latent graphs with aliased observations
Fast exploration and learning of latent graphs with aliased observations
Miguel Lazaro-Gredilla
Ishani Deshpande
Siva K. Swaminathan
Meet Dave
Dileep George
23
3
0
13 Mar 2023
Exposure-Based Multi-Agent Inspection of a Tumbling Target Using Deep
  Reinforcement Learning
Exposure-Based Multi-Agent Inspection of a Tumbling Target Using Deep Reinforcement Learning
Joshua Aurand
Steven Cutlip
Henry Lei
Kendra A. Lang
S. Phillips
29
6
0
27 Feb 2023
Diversity Through Exclusion (DTE): Niche Identification for
  Reinforcement Learning through Value-Decomposition
Diversity Through Exclusion (DTE): Niche Identification for Reinforcement Learning through Value-Decomposition
P. Sunehag
A. Vezhnevets
Edgar A. Duénez-Guzmán
Igor Mordach
Joel Z Leibo
26
2
0
02 Feb 2023
Learning Roles with Emergent Social Value Orientations
Learning Roles with Emergent Social Value Orientations
Wenhao Li
Xiangfeng Wang
Bo Jin
J. Lu
H. Zha
18
3
0
31 Jan 2023
Multi-Agent Interplay in a Competitive Survival Environment
Multi-Agent Interplay in a Competitive Survival Environment
Andrea Fanti
23
0
0
19 Jan 2023
Human-Timescale Adaptation in an Open-Ended Task Space
Human-Timescale Adaptation in an Open-Ended Task Space
Adaptive Agent Team
Jakob Bauer
Kate Baumli
Satinder Baveja
Feryal M. P. Behbahani
...
Jakub Sygnowski
K. Tuyls
Sarah York
Alexander Zacherl
Lei Zhang
LM&Ro
OffRL
AI4CE
LRM
38
109
0
18 Jan 2023
Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time
  Multi-Robot Cooperative Exploration
Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time Multi-Robot Cooperative Exploration
Chao Yu
Xinyi Yang
Jiaxuan Gao
Jiayu Chen
Yunfei Li
...
Yunfei Xiang
Rui Huang
Huazhong Yang
Yi Wu
Yu Wang
33
36
0
09 Jan 2023
Chat2Map: Efficient Scene Mapping from Multi-Ego Conversations
Chat2Map: Efficient Scene Mapping from Multi-Ego Conversations
Sagnik Majumder
Hao Jiang
Pierre Moulon
E. Henderson
P. Calamia
Kristen Grauman
V. Ithapu
EgoV
35
7
0
04 Jan 2023
Ithaca. A Tool for Integrating Fuzzy Logic in Unity
Ithaca. A Tool for Integrating Fuzzy Logic in Unity
Alfonso Tejedor Moreno
J. A. Piedra-Fernandez
J. J. Ojeda-Castelo
L. Iribarne
23
0
0
01 Jan 2023
Towards automating Codenames spymasters with deep reinforcement learning
Towards automating Codenames spymasters with deep reinforcement learning
Sherman Siu
22
2
0
28 Dec 2022
Learning Latent Representations to Co-Adapt to Humans
Learning Latent Representations to Co-Adapt to Humans
Sagar Parekh
Dylan P. Losey
20
12
0
19 Dec 2022
Distributed Deep Reinforcement Learning: A Survey and A Multi-Player
  Multi-Agent Learning Toolbox
Distributed Deep Reinforcement Learning: A Survey and A Multi-Player Multi-Agent Learning Toolbox
Qiyue Yin
Tongtong Yu
S. Shen
Jun Yang
Meijing Zhao
Kaiqi Huang
Bin Liang
Liangsheng Wang
OffRL
28
13
0
01 Dec 2022
Melting Pot 2.0
Melting Pot 2.0
J. Agapiou
A. Vezhnevets
Edgar A. Duénez-Guzmán
Jayd Matyas
Yiran Mao
...
Sukhdeep Singh
Julia Haas
Igor Mordatch
D. Mobbs
Joel Z Leibo
35
31
0
24 Nov 2022
Value-based CTDE Methods in Symmetric Two-team Markov Game: from
  Cooperation to Team Competition
Value-based CTDE Methods in Symmetric Two-team Markov Game: from Cooperation to Team Competition
Pascal Leroy
J. Pisane
D. Ernst
25
3
0
21 Nov 2022
Group Cohesion in Multi-Agent Scenarios as an Emergent Behavior
Group Cohesion in Multi-Agent Scenarios as an Emergent Behavior
Gianluca Georg Alois Volkmer
Nabil Alsabah
AI4CE
18
0
0
03 Nov 2022
Knowing the Past to Predict the Future: Reinforcement Virtual Learning
Knowing the Past to Predict the Future: Reinforcement Virtual Learning
Peng Zhang
Yawen Huang
Bingzhang Hu
Shizheng Wang
Haoran Duan
Noura Al Moubayed
Yefeng Zheng
Yang Long
OffRL
27
0
0
02 Nov 2022
MARLUI: Multi-Agent Reinforcement Learning for Adaptive UIs
MARLUI: Multi-Agent Reinforcement Learning for Adaptive UIs
T. Langerak
Sammy Christen
Mert Albaba
Christoph Gebhardt
Otmar Hilliges
OffRL
17
0
0
26 Sep 2022
Open-Ended Diverse Solution Discovery with Regulated Behavior Patterns
  for Cross-Domain Adaptation
Open-Ended Diverse Solution Discovery with Regulated Behavior Patterns for Cross-Domain Adaptation
Kang Xu
Yan Ma
Bingsheng Wei
Wei Li
32
3
0
24 Sep 2022
Lamarckian Platform: Pushing the Boundaries of Evolutionary
  Reinforcement Learning towards Asynchronous Commercial Games
Lamarckian Platform: Pushing the Boundaries of Evolutionary Reinforcement Learning towards Asynchronous Commercial Games
Hui Bai
R. Shen
Yue Lin
Bo Xu
Ran Cheng
VLM
34
5
0
21 Sep 2022
Law Informs Code: A Legal Informatics Approach to Aligning Artificial
  Intelligence with Humans
Law Informs Code: A Legal Informatics Approach to Aligning Artificial Intelligence with Humans
John J. Nay
ELM
AILaw
88
27
0
14 Sep 2022
Neural Payoff Machines: Predicting Fair and Stable Payoff Allocations
  Among Team Members
Neural Payoff Machines: Predicting Fair and Stable Payoff Allocations Among Team Members
Daphne Cornelisse
Thomas Rood
Mateusz Malinowski
Yoram Bachrach
Tal Kachman
35
10
0
18 Aug 2022
123
Next