Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1902.00506
Cited By
v1
v2 (latest)
The Hanabi Challenge: A New Frontier for AI Research
1 February 2019
Nolan Bard
Jakob N. Foerster
A. Chandar
Neil Burch
Marc Lanctot
H. F. Song
Emilio Parisotto
Vincent Dumoulin
Subhodeep Moitra
Edward Hughes
Iain Dunning
Shibl Mourad
Hugo Larochelle
Marc G. Bellemare
Michael Bowling
LLMAG
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The Hanabi Challenge: A New Frontier for AI Research"
50 / 176 papers shown
Title
Is Vanilla Policy Gradient Overlooked? Analyzing Deep Reinforcement Learning for Hanabi
Bram Grooten
Jelle Wemmenhove
Maurice Poot
J. Portegies
52
3
0
22 Mar 2022
Model-based Multi-agent Reinforcement Learning: Recent Progress and Prospects
Xihuai Wang
Zhicheng Zhang
Weinan Zhang
84
25
0
20 Mar 2022
On-the-fly Strategy Adaptation for ad-hoc Agent Coordination
Jaleh Zand
Jack Parker-Holder
Stephen J. Roberts
63
14
0
08 Mar 2022
Learning Robust Real-Time Cultural Transmission without Human Data
Cultural General Intelligence Team
Avishkar Bhoopchand
Bethanie Brownfield
Adrian Collister
Agustin Dal Lago
...
Alex Platonov
Evan Senter
Sukhdeep Singh
Alexander Zacherl
Lei M. Zhang
VLM
125
11
0
01 Mar 2022
Reinforcement Learning in Practice: Opportunities and Challenges
Yuxi Li
OffRL
71
9
0
23 Feb 2022
Compute Trends Across Three Eras of Machine Learning
J. Sevilla
Lennart Heim
A. Ho
T. Besiroglu
Marius Hobbhahn
Pablo Villalobos
116
279
0
11 Feb 2022
Learning Intuitive Policies Using Action Features
Mingwei Ma
Jizhou Liu
Samuel Sokota
Max Kleiman-Weiner
Jakob N. Foerster
95
4
0
29 Jan 2022
Any-Play: An Intrinsic Augmentation for Zero-Shot Coordination
Keane Lucas
R. Allen
103
26
0
28 Jan 2022
Conditional Imitation Learning for Multi-Agent Games
Andy Shih
Stefano Ermon
Dorsa Sadigh
88
11
0
05 Jan 2022
Towards Controllable Agent in MOBA Games with Generative Modeling
Shubao Zhang
61
0
0
15 Dec 2021
Modeling Strong and Human-Like Gameplay with KL-Regularized Search
Athul Paul Jacob
David J. Wu
Gabriele Farina
Adam Lerer
Hengyuan Hu
A. Bakhtin
Jacob Andreas
Noam Brown
60
54
0
14 Dec 2021
Student of Games: A unified learning algorithm for both perfect and imperfect information games
Martin Schmid
Matej Moravcík
Neil Burch
Rudolf Kadlec
Josh Davidson
...
Marc Lanctot
G. Z. Holland
Elnaz Davoodi
Alden Christianson
Michael Bowling
93
22
0
06 Dec 2021
Reinforcement Learning on Human Decision Models for Uniquely Collaborative AI Teammates
Nicholas Kantack
33
2
0
18 Nov 2021
Variational Automatic Curriculum Learning for Sparse-Reward Cooperative Multi-Agent Problems
Jiayu Chen
Yuanxin Zhang
Yuanfan Xu
Huimin Ma
Huazhong Yang
Jiaming Song
Yu Wang
Yi Wu
VLM
DRL
82
32
0
08 Nov 2021
Learning to Cooperate with Unseen Agent via Meta-Reinforcement Learning
Rujikorn Charakorn
P. Manoonpong
Nat Dilokthanakul
85
5
0
05 Nov 2021
Instructive artificial intelligence (AI) for human training, assistance, and explainability
Nicholas Kantack
Nina Cohen
Nathan D. Bos
Corey Lowman
James Everett
Timothy Endres
27
2
0
02 Nov 2021
ToM2C: Target-oriented Multi-agent Communication and Cooperation with Theory of Mind
Yuan-Fang Wang
Fangwei Zhong
Jing Xu
Yizhou Wang
LLMAG
116
70
0
15 Oct 2021
Collaborating with Humans without Human Data
D. Strouse
Kevin R. McKee
M. Botvinick
Edward Hughes
Richard Everett
184
171
0
15 Oct 2021
Scalable Online Planning via Reinforcement Learning Fine-Tuning
Arnaud Fickinger
Hengyuan Hu
Brandon Amos
Stuart J. Russell
Noam Brown
97
21
0
30 Sep 2021
Two Approaches to Building Collaborative, Task-Oriented Dialog Agents through Self-Play
Arkady Arkhangorodsky
Scot Fang
Victoria F. Knight
Ajay Nagesh
Maria Ryskina
Kevin Knight
LLMAG
27
0
0
20 Sep 2021
Evaluation of Human-AI Teams for Learned and Rule-Based Agents in Hanabi
H. Siu
Jaime D. Peña
Edenna Chen
Yutai Zhou
Victor J. Lopez
Kyle Palko
K. Chang
R. Allen
138
58
0
15 Jul 2021
Centralized Model and Exploration Policy for Multi-Agent RL
Qizhen Zhang
Chris Xiaoxuan Lu
Animesh Garg
Jakob N. Foerster
72
15
0
14 Jul 2021
Cogment: Open Source Framework For Distributed Multi-actor Training, Deployment & Operations
AI Redefined
S. Gottipati
Sagar Kurandwad
Clodéric Mars
Gregory Szriftgiser
Franccois Chabot
67
8
0
21 Jun 2021
Multi-Agent Curricula and Emergent Implicit Signaling
Niko A. Grupen
Daniel D. Lee
B. Selman
113
6
0
21 Jun 2021
Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings
Hengyuan Hu
Adam Lerer
Noam Brown
Jakob N. Foerster
118
20
0
16 Jun 2021
On the Critical Role of Conventions in Adaptive Human-AI Collaboration
Andy Shih
Arjun Sawhney
J. Kondic
Stefano Ermon
Dorsa Sadigh
103
37
0
07 Apr 2021
Esports Agents with a Theory of Mind: Towards Better Engagement, Education, and Engineering
Murtuza N. Shergadwala
M. S. El-Nasr
28
7
0
08 Mar 2021
Off-Belief Learning
Hengyuan Hu
Adam Lerer
Brandon Cui
David J. Wu
Luis Pineda
Noam Brown
Jakob N. Foerster
OffRL
65
73
0
06 Mar 2021
Continuous Coordination As a Realistic Scenario for Lifelong Learning
Hadi Nekoei
Akilesh Badrinaaraayanan
Aaron Courville
Sarath Chandar
CLL
OffRL
84
41
0
04 Mar 2021
The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games
Chao Yu
Akash Velu
Eugene Vinitsky
Jiaxuan Gao
Yu Wang
Alexandre M. Bayen
Yi Wu
OffRL
229
1,294
0
02 Mar 2021
Diverse Auto-Curriculum is Critical for Successful Real-World Multiagent Learning Systems
Yaodong Yang
Jun Luo
Ying Wen
Oliver Slumbers
D. Graves
H. Ammar
Jun Wang
Matthew E. Taylor
71
39
0
15 Feb 2021
Neural Recursive Belief States in Multi-Agent Reinforcement Learning
Pol Moreno
Edward Hughes
Kevin R. McKee
Bernardo Avila-Pires
T. Weber
66
23
0
03 Feb 2021
Emergent Communication under Competition
Michael Noukhovitch
Travis LaCroix
Angeliki Lazaridou
Aaron Courville
96
26
0
25 Jan 2021
Theory of Mind for Deep Reinforcement Learning in Hanabi
Andrew Fuchs
Michael Walton
Theresa Chadwick
Doug Lange
80
11
0
22 Jan 2021
Open Problems in Cooperative AI
Allan Dafoe
Edward Hughes
Yoram Bachrach
Tantum Collins
Kevin R. McKee
Joel Z Leibo
Kate Larson
T. Graepel
121
203
0
15 Dec 2020
Applied Machine Learning for Games: A Graduate School Course
Yilei Zeng
Aayush Shah
Jameson Thai
M. Zyda
AI4CE
67
3
0
30 Nov 2020
Ridge Rider: Finding Diverse Solutions by Following Eigenvectors of the Hessian
Jack Parker-Holder
Luke Metz
Cinjon Resnick
Hengyuan Hu
Adam Lerer
Alistair Letcher
A. Peysakhovich
Aldo Pacchiano
Jakob N. Foerster
50
24
0
12 Nov 2020
Watch-And-Help: A Challenge for Social Perception and Human-AI Collaboration
Xavier Puig
Tianmin Shu
Shuang Li
Zilin Wang
Yuan-Hong Liao
J. Tenenbaum
Sanja Fidler
Antonio Torralba
LM&Ro
161
130
0
19 Oct 2020
Prioritized Level Replay
Minqi Jiang
Edward Grefenstette
Tim Rocktaschel
OffRL
128
160
0
08 Oct 2020
Creative Captioning: An AI Grand Challenge Based on the Dixit Board Game
M. Kunda
Irina Rabkina
27
3
0
30 Sep 2020
PettingZoo: Gym for Multi-Agent Reinforcement Learning
J. K. Terry
Benjamin Black
Nathaniel Grammel
Mario Jayakumar
Ananth Hari
...
Caroline Horsch
Clemens Dieffendahl
Niall L. Williams
Yashas Lokesh
Praveen Ravi
OffRL
151
288
0
30 Sep 2020
Learning to Play against Any Mixture of Opponents
Max O. Smith
Thomas W. Anthony
Yongzhao Wang
Michael P. Wellman
OffRL
75
9
0
29 Sep 2020
Multiplayer Support for the Arcade Learning Environment
J. K. Terry
Benjamin Black
Luis Santos
74
13
0
20 Sep 2020
The Curse of Shared Knowledge: Recursive Belief Reasoning in a Coordination Game with Imperfect Information
Thomas Bolander
R. Engelhardt
Thomas S. Nicolet
21
1
0
20 Aug 2020
Joint Policy Search for Multi-agent Collaboration with Imperfect Information
Yuandong Tian
Qucheng Gong
Tina Jiang
103
19
0
14 Aug 2020
DREAM: Deep Regret minimization with Advantage baselines and Model-free learning
Eric Steinberger
Adam Lerer
Noam Brown
132
54
0
18 Jun 2020
Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks
Georgios Papoudakis
Filippos Christianos
Lukas Schafer
Stefano V. Albrecht
OffRL
107
233
0
14 Jun 2020
Learning to Play No-Press Diplomacy with Best Response Policy Iteration
Thomas W. Anthony
Tom Eccles
Andrea Tacchetti
János Kramár
I. Gemp
...
Richard Everett
Roman Werpachowski
Satinder Singh
T. Graepel
Yoram Bachrach
106
43
0
08 Jun 2020
Emergent Multi-Agent Communication in the Deep Learning Era
Angeliki Lazaridou
Marco Baroni
AI4CE
153
206
0
03 Jun 2020
Towards the Role of Theory of Mind in Explanation
Maayan Shvo
Toryn Q. Klassen
Sheila A. McIlraith
66
29
0
06 May 2020
Previous
1
2
3
4
Next