ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1902.00506
  4. Cited By
The Hanabi Challenge: A New Frontier for AI Research
v1v2 (latest)

The Hanabi Challenge: A New Frontier for AI Research

1 February 2019
Nolan Bard
Jakob N. Foerster
A. Chandar
Neil Burch
Marc Lanctot
H. F. Song
Emilio Parisotto
Vincent Dumoulin
Subhodeep Moitra
Edward Hughes
Iain Dunning
Shibl Mourad
Hugo Larochelle
Marc G. Bellemare
Michael Bowling
    LLMAG
ArXiv (abs)PDFHTML

Papers citing "The Hanabi Challenge: A New Frontier for AI Research"

50 / 176 papers shown
Title
Guarantees for Self-Play in Multiplayer Games via Polymatrix
  Decomposability
Guarantees for Self-Play in Multiplayer Games via Polymatrix Decomposability
Revan MacQueen
James R. Wright
81
2
0
17 Oct 2023
Welfare Diplomacy: Benchmarking Language Model Cooperation
Welfare Diplomacy: Benchmarking Language Model Cooperation
Gabriel Mukobi
Hannah Erlebach
Niklas Lauffer
Lewis Hammond
Alan Chan
Jesse Clifton
LM&Ro
92
27
0
13 Oct 2023
Quantifying Agent Interaction in Multi-agent Reinforcement Learning for
  Cost-efficient Generalization
Quantifying Agent Interaction in Multi-agent Reinforcement Learning for Cost-efficient Generalization
Yuxin Chen
Chen Tang
Ran Tian
Chenran Li
Jinning Li
Masayoshi Tomizuka
Wei Zhan
92
3
0
11 Oct 2023
Fictitious Cross-Play: Learning Global Nash Equilibrium in Mixed
  Cooperative-Competitive Games
Fictitious Cross-Play: Learning Global Nash Equilibrium in Mixed Cooperative-Competitive Games
Zelai Xu
Yancheng Liang
Chao Yu
Yu Wang
Yi Wu
93
9
0
05 Oct 2023
LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language Models
LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language Models
Saaket Agashe
Yue Fan
Anthony Reyna
Xin Eric Wang
LLMAGLRM
162
16
0
05 Oct 2023
Towards Few-shot Coordination: Revisiting Ad-hoc Teamplay Challenge In
  the Game of Hanabi
Towards Few-shot Coordination: Revisiting Ad-hoc Teamplay Challenge In the Game of Hanabi
Hadi Nekoei
Xutong Zhao
Janarthanan Rajendran
Miao Liu
Sarath Chandar
50
5
0
20 Aug 2023
Never Explore Repeatedly in Multi-Agent Reinforcement Learning
Never Explore Repeatedly in Multi-Agent Reinforcement Learning
Chenghao Li
Tonghan Wang
Chongjie Zhang
Qianchuan Zhao
74
2
0
19 Aug 2023
Minimum Coverage Sets for Training Robust Ad Hoc Teamwork Agents
Minimum Coverage Sets for Training Robust Ad Hoc Teamwork Agents
Arrasy Rahman
Jiaxun Cui
Peter Stone
82
13
0
18 Aug 2023
PyTAG: Challenges and Opportunities for Reinforcement Learning in
  Tabletop Games
PyTAG: Challenges and Opportunities for Reinforcement Learning in Tabletop Games
Martin Balla
G. E. Long
Dominik Jeurissen
J. Goodman
Raluca D. Gaina
Diego Perez-Liebana
LMTDOffRLOnRL
87
1
0
19 Jul 2023
Building Cooperative Embodied Agents Modularly with Large Language Models
Building Cooperative Embodied Agents Modularly with Large Language Models
Hongxin Zhang
Weihua Du
Jiaming Shan
Qinhong Zhou
Yilun Du
J. Tenenbaum
Tianmin Shu
Chuang Gan
LLMAGLM&Ro
126
178
0
05 Jul 2023
RaidEnv: Exploring New Challenges in Automated Content Balancing for
  Boss Raid Games
RaidEnv: Exploring New Challenges in Automated Content Balancing for Boss Raid Games
Hyeonchang Jeon
In-Chang Baek
Cheong-mok Bae
Taehwa Park
Wonsang You
Taegwan Ha
Hoyun Jung
Jinha Noh
Seungwon Oh
Kyung-Joong Kim
106
11
0
04 Jul 2023
IMP-MARL: a Suite of Environments for Large-scale Infrastructure
  Management Planning via MARL
IMP-MARL: a Suite of Environments for Large-scale Infrastructure Management Planning via MARL
Pascal Leroy
P. G. Morato
J. Pisane
A. Kolios
D. Ernst
OffRL
100
9
0
20 Jun 2023
Who Needs to Know? Minimal Knowledge for Optimal Coordination
Who Needs to Know? Minimal Knowledge for Optimal Coordination
Niklas Lauffer
Ameesh Shah
Micah Carroll
Michael Dennis
Stuart J. Russell
59
6
0
15 Jun 2023
Tackling Cooperative Incompatibility for Zero-Shot Human-AI Coordination
Tackling Cooperative Incompatibility for Zero-Shot Human-AI Coordination
Yang Li
Shao Zhang
Jichen Sun
Wenhao Zhang
Yali Du
Ying Wen
Xinbing Wang
Wei Pan
105
17
0
05 Jun 2023
Adaptive Coordination in Social Embodied Rearrangement
Adaptive Coordination in Social Embodied Rearrangement
Andrew Szot
Unnat Jain
Dhruv Batra
Z. Kira
Ruta Desai
Akshara Rai
78
14
0
31 May 2023
Decision-Oriented Dialogue for Human-AI Collaboration
Decision-Oriented Dialogue for Human-AI Collaboration
Jessy Lin
Nicholas Tomlin
Jacob Andreas
J. Eisner
LLMAG
118
28
0
31 May 2023
Strategic Reasoning with Language Models
Strategic Reasoning with Language Models
Kanishk Gandhi
Dorsa Sadigh
Noah D. Goodman
LM&RoLRM
81
41
0
30 May 2023
A Hierarchical Approach to Population Training for Human-AI
  Collaboration
A Hierarchical Approach to Population Training for Human-AI Collaboration
Yi Loo
Chen Gong
Malika Meghjani
57
8
0
26 May 2023
SMAClite: A Lightweight Environment for Multi-Agent Reinforcement
  Learning
SMAClite: A Lightweight Environment for Multi-Agent Reinforcement Learning
Adam Michalski
Filippos Christianos
Stefano V. Albrecht
52
4
0
09 May 2023
Games for Artificial Intelligence Research: A Review and Perspectives
Games for Artificial Intelligence Research: A Review and Perspectives
Chengpeng Hu
Yunlong Zhao
Ziqi Wang
Haocheng Du
Jialin Liu
AI4CE
85
13
0
26 Apr 2023
The Update-Equivalence Framework for Decision-Time Planning
The Update-Equivalence Framework for Decision-Time Planning
Samuel Sokota
Gabriele Farina
David J. Wu
Hengyuan Hu
Kevin A. Wang
J. Zico Kolter
Noam Brown
128
4
0
25 Apr 2023
Language Instructed Reinforcement Learning for Human-AI Coordination
Language Instructed Reinforcement Learning for Human-AI Coordination
Hengyuan Hu
Dorsa Sadigh
LM&Ro
96
64
0
13 Apr 2023
CAMEL: Communicative Agents for "Mind" Exploration of Large Language
  Model Society
CAMEL: Communicative Agents for "Mind" Exploration of Large Language Model Society
Ge Li
Hasan Hammoud
Hani Itani
Dmitrii Khizbullin
Guohao Li
SyDaALM
178
521
0
31 Mar 2023
Behavioral Differences is the Key of Ad-hoc Team Cooperation in
  Multiplayer Games Hanabi
Behavioral Differences is the Key of Ad-hoc Team Cooperation in Multiplayer Games Hanabi
Hyeonchang Jeon
Kyung-Joong Kim
27
0
0
12 Mar 2023
Models of symbol emergence in communication: a conceptual review and a
  guide for avoiding local minima
Models of symbol emergence in communication: a conceptual review and a guide for avoiding local minima
Julian Zubek
Tomasz Korbak
J. Rączaszek-Leonardi
60
3
0
08 Mar 2023
Computational Language Acquisition with Theory of Mind
Computational Language Acquisition with Theory of Mind
Andy Liu
Hao Zhu
Emmy Liu
Yonatan Bisk
Graham Neubig
LLMAGAI4CE
80
18
0
02 Mar 2023
Population-based Evaluation in Repeated Rock-Paper-Scissors as a
  Benchmark for Multiagent Reinforcement Learning
Population-based Evaluation in Repeated Rock-Paper-Scissors as a Benchmark for Multiagent Reinforcement Learning
Marc Lanctot
John Schultz
Neil Burch
Max O. Smith
Daniel Hennes
Thomas W. Anthony
Julien Perolat
OffRL
48
5
0
02 Mar 2023
Improving Zero-Shot Coordination Performance Based on Policy Similarity
Improving Zero-Shot Coordination Performance Based on Policy Similarity
Lebin Yu
Yunbo Qiu
Quanming Yao
Xudong Zhang
Jian Wang
84
1
0
10 Feb 2023
Cooperative Open-ended Learning Framework for Zero-shot Coordination
Cooperative Open-ended Learning Framework for Zero-shot Coordination
Yang Li
Shao Zhang
Jichen Sun
Yali Du
Ying Wen
Xinbing Wang
Wei Pan
131
24
0
09 Feb 2023
Dealing With Non-stationarity in Decentralized Cooperative Multi-Agent
  Deep Reinforcement Learning via Multi-Timescale Learning
Dealing With Non-stationarity in Decentralized Cooperative Multi-Agent Deep Reinforcement Learning via Multi-Timescale Learning
Hadi Nekoei
Akilesh Badrinaaraayanan
Amit Sinha
Mohammad Amini
Janarthanan Rajendran
Aditya Mahajan
Sarath Chandar
64
17
0
06 Feb 2023
Abstracting Imperfect Information Away from Two-Player Zero-Sum Games
Abstracting Imperfect Information Away from Two-Player Zero-Sum Games
Samuel Sokota
Ryan DÓrazio
Chun Kai Ling
David J. Wu
J. Zico Kolter
Noam Brown
100
4
0
22 Jan 2023
NOPA: Neurally-guided Online Probabilistic Assistance for Building
  Socially Intelligent Home Assistants
NOPA: Neurally-guided Online Probabilistic Assistance for Building Socially Intelligent Home Assistants
Xavier Puig
Tianmin Shu
J. Tenenbaum
Antonio Torralba
56
22
0
12 Jan 2023
SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement
  Learning
SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning
Benjamin Ellis
Jonathan Cook
S. Moalla
Mikayel Samvelyan
Mingfei Sun
Anuj Mahajan
Jakob N. Foerster
Shimon Whiteson
135
94
0
14 Dec 2022
Credit-cognisant reinforcement learning for multi-agent cooperation
Credit-cognisant reinforcement learning for multi-agent cooperation
F. Bredell
S. M. I. H. A. Engelbrecht
M. I. J. C. Schoeman
25
0
0
18 Nov 2022
Knowing the Past to Predict the Future: Reinforcement Virtual Learning
Knowing the Past to Predict the Future: Reinforcement Virtual Learning
Peng Zhang
Yawen Huang
Bingzhang Hu
Shizheng Wang
Haoran Duan
Noura Al Moubayed
Yefeng Zheng
Yang Long
OffRL
62
0
0
02 Nov 2022
Coordination with Humans via Strategy Matching
Coordination with Humans via Strategy Matching
Michelle Zhao
Reid G. Simmons
H. Admoni
80
10
0
27 Oct 2022
Equivariant Networks for Zero-Shot Coordination
Equivariant Networks for Zero-Shot Coordination
Darius Muglich
Christian Schroeder de Witt
Elise van der Pol
Shimon Whiteson
Jakob N. Foerster
111
14
0
21 Oct 2022
Human-AI Coordination via Human-Regularized Search and Learning
Human-AI Coordination via Human-Regularized Search and Learning
Hengyuan Hu
David J. Wu
Adam Lerer
Jakob N. Foerster
Noam Brown
70
7
0
11 Oct 2022
MARLlib: A Scalable and Efficient Multi-agent Reinforcement Learning
  Library
MARLlib: A Scalable and Efficient Multi-agent Reinforcement Learning Library
Siyi Hu
Yifan Zhong
Minquan Gao
Weixun Wang
Hao Dong
Xiaodan Liang
Zhihui Li
Xiaojun Chang
Yaodong Yang
99
17
0
11 Oct 2022
Combining Theory of Mind and Abduction for Cooperation under Imperfect
  Information
Combining Theory of Mind and Abduction for Cooperation under Imperfect Information
Nieves Montes
Nardine Osman
Carles Sierra
68
4
0
30 Sep 2022
Supervised and Reinforcement Learning from Observations in
  Reconnaissance Blind Chess
Supervised and Reinforcement Learning from Observations in Reconnaissance Blind Chess
T. Bertram
Johannes Furnkranz
Martin Müller
SSLOnRL
100
7
0
03 Aug 2022
Mimetic Models: Ethical Implications of AI that Acts Like You
Mimetic Models: Ethical Implications of AI that Acts Like You
Reid McIlroy-Young
Jon M. Kleinberg
S. Sen
Solon Barocas
Ashton Anderson
77
17
0
19 Jul 2022
K-level Reasoning for Zero-Shot Coordination in Hanabi
K-level Reasoning for Zero-Shot Coordination in Hanabi
Brandon Cui
Hengyuan Hu
Luis Pineda
Jakob N. Foerster
OffRLLRM
86
36
0
14 Jul 2022
Self-Explaining Deviations for Coordination
Self-Explaining Deviations for Coordination
Hengyuan Hu
Samuel Sokota
David J. Wu
A. Bakhtin
Andrei Lupu
Brandon Cui
Jakob N. Foerster
53
2
0
13 Jul 2022
Generalized Beliefs for Cooperative AI
Generalized Beliefs for Cooperative AI
Darius Muglich
L. Zintgraf
Christian Schroeder de Witt
Shimon Whiteson
Jakob N. Foerster
90
7
0
26 Jun 2022
Nocturne: a scalable driving benchmark for bringing multi-agent learning
  one step closer to the real world
Nocturne: a scalable driving benchmark for bringing multi-agent learning one step closer to the real world
Eugene Vinitsky
Nathan Lichtlé
Xiaomeng Yang
Brandon Amos
Jakob N. Foerster
OffRL
150
54
0
20 Jun 2022
Policy Optimization for Markov Games: Unified Framework and Faster
  Convergence
Policy Optimization for Markov Games: Unified Framework and Faster Convergence
Runyu Zhang
Qinghua Liu
Haiquan Wang
Caiming Xiong
Na Li
Yu Bai
87
26
0
06 Jun 2022
Policy Diagnosis via Measuring Role Diversity in Cooperative Multi-agent
  RL
Policy Diagnosis via Measuring Role Diversity in Cooperative Multi-agent RL
Siyi Hu
Chuanlong Xie
Xiaodan Liang
Xiaojun Chang
56
22
0
01 Jun 2022
Continuously Discovering Novel Strategies via Reward-Switching Policy
  Optimization
Continuously Discovering Novel Strategies via Reward-Switching Policy Optimization
Zihan Zhou
Wei Fu
Bingliang Zhang
Yi Wu
85
30
0
04 Apr 2022
Mind the gap: Challenges of deep learning approaches to Theory of Mind
Mind the gap: Challenges of deep learning approaches to Theory of Mind
Jaan Aru
Aqeel Labash
Oriol Corcoll
Raul Vicente
95
27
0
30 Mar 2022
Previous
1234
Next