Title
Is Vanilla Policy Gradient Overlooked? Analyzing Deep Reinforcement Learning for Hanabi Bram Grooten Jelle Wemmenhove Maurice Poot J. Portegies 52 3 0 22 Mar 2022
Model-based Multi-agent Reinforcement Learning: Recent Progress and Prospects Xihuai Wang Zhicheng Zhang Weinan Zhang 84 25 0 20 Mar 2022
On-the-fly Strategy Adaptation for ad-hoc Agent Coordination Jaleh Zand Jack Parker-Holder Stephen J. Roberts 63 14 0 08 Mar 2022
Learning Robust Real-Time Cultural Transmission without Human Data Cultural General Intelligence Team Avishkar Bhoopchand Bethanie Brownfield Adrian Collister Agustin Dal Lago ... Alex Platonov Evan Senter Sukhdeep Singh Alexander Zacherl Lei M. Zhang VLM 125 11 0 01 Mar 2022
Reinforcement Learning in Practice: Opportunities and Challenges Yuxi Li OffRL 71 9 0 23 Feb 2022
Compute Trends Across Three Eras of Machine Learning J. Sevilla Lennart Heim A. Ho T. Besiroglu Marius Hobbhahn Pablo Villalobos 116 279 0 11 Feb 2022
Learning Intuitive Policies Using Action Features Mingwei Ma Jizhou Liu Samuel Sokota Max Kleiman-Weiner Jakob N. Foerster 95 4 0 29 Jan 2022
Any-Play: An Intrinsic Augmentation for Zero-Shot Coordination Keane Lucas R. Allen 103 26 0 28 Jan 2022
Conditional Imitation Learning for Multi-Agent Games Andy Shih Stefano Ermon Dorsa Sadigh 88 11 0 05 Jan 2022
Towards Controllable Agent in MOBA Games with Generative Modeling Shubao Zhang 61 0 0 15 Dec 2021
Modeling Strong and Human-Like Gameplay with KL-Regularized Search Athul Paul Jacob David J. Wu Gabriele Farina Adam Lerer Hengyuan Hu A. Bakhtin Jacob Andreas Noam Brown 60 54 0 14 Dec 2021
Student of Games: A unified learning algorithm for both perfect and imperfect information games Martin Schmid Matej Moravcík Neil Burch Rudolf Kadlec Josh Davidson ... Marc Lanctot G. Z. Holland Elnaz Davoodi Alden Christianson Michael Bowling 93 22 0 06 Dec 2021
Reinforcement Learning on Human Decision Models for Uniquely Collaborative AI Teammates Nicholas Kantack 33 2 0 18 Nov 2021
Variational Automatic Curriculum Learning for Sparse-Reward Cooperative Multi-Agent Problems Jiayu Chen Yuanxin Zhang Yuanfan Xu Huimin Ma Huazhong Yang Jiaming Song Yu Wang Yi Wu VLM DRL 82 32 0 08 Nov 2021
Learning to Cooperate with Unseen Agent via Meta-Reinforcement Learning Rujikorn Charakorn P. Manoonpong Nat Dilokthanakul 85 5 0 05 Nov 2021
Instructive artificial intelligence (AI) for human training, assistance, and explainability Nicholas Kantack Nina Cohen Nathan D. Bos Corey Lowman James Everett Timothy Endres 27 2 0 02 Nov 2021
ToM2C: Target-oriented Multi-agent Communication and Cooperation with Theory of Mind Yuan-Fang Wang Fangwei Zhong Jing Xu Yizhou Wang LLMAG 116 70 0 15 Oct 2021
Collaborating with Humans without Human Data D. Strouse Kevin R. McKee M. Botvinick Edward Hughes Richard Everett 184 171 0 15 Oct 2021
Scalable Online Planning via Reinforcement Learning Fine-Tuning Arnaud Fickinger Hengyuan Hu Brandon Amos Stuart J. Russell Noam Brown 97 21 0 30 Sep 2021
Two Approaches to Building Collaborative, Task-Oriented Dialog Agents through Self-Play Arkady Arkhangorodsky Scot Fang Victoria F. Knight Ajay Nagesh Maria Ryskina Kevin Knight LLMAG 27 0 0 20 Sep 2021
Evaluation of Human-AI Teams for Learned and Rule-Based Agents in Hanabi H. Siu Jaime D. Peña Edenna Chen Yutai Zhou Victor J. Lopez Kyle Palko K. Chang R. Allen 138 58 0 15 Jul 2021
Centralized Model and Exploration Policy for Multi-Agent RL Qizhen Zhang Chris Xiaoxuan Lu Animesh Garg Jakob N. Foerster 72 15 0 14 Jul 2021
Cogment: Open Source Framework For Distributed Multi-actor Training, Deployment & Operations AI Redefined S. Gottipati Sagar Kurandwad Clodéric Mars Gregory Szriftgiser Franccois Chabot 67 8 0 21 Jun 2021
Multi-Agent Curricula and Emergent Implicit Signaling Niko A. Grupen Daniel D. Lee B. Selman 113 6 0 21 Jun 2021
Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings Hengyuan Hu Adam Lerer Noam Brown Jakob N. Foerster 118 20 0 16 Jun 2021
On the Critical Role of Conventions in Adaptive Human-AI Collaboration Andy Shih Arjun Sawhney J. Kondic Stefano Ermon Dorsa Sadigh 103 37 0 07 Apr 2021
Esports Agents with a Theory of Mind: Towards Better Engagement, Education, and Engineering Murtuza N. Shergadwala M. S. El-Nasr 28 7 0 08 Mar 2021
Off-Belief Learning Hengyuan Hu Adam Lerer Brandon Cui David J. Wu Luis Pineda Noam Brown Jakob N. Foerster OffRL 65 73 0 06 Mar 2021
Continuous Coordination As a Realistic Scenario for Lifelong Learning Hadi Nekoei Akilesh Badrinaaraayanan Aaron Courville Sarath Chandar CLL OffRL 84 41 0 04 Mar 2021
The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games Chao Yu Akash Velu Eugene Vinitsky Jiaxuan Gao Yu Wang Alexandre M. Bayen Yi Wu OffRL 229 1,294 0 02 Mar 2021
Diverse Auto-Curriculum is Critical for Successful Real-World Multiagent Learning Systems Yaodong Yang Jun Luo Ying Wen Oliver Slumbers D. Graves H. Ammar Jun Wang Matthew E. Taylor 71 39 0 15 Feb 2021
Neural Recursive Belief States in Multi-Agent Reinforcement Learning Pol Moreno Edward Hughes Kevin R. McKee Bernardo Avila-Pires T. Weber 66 23 0 03 Feb 2021
Emergent Communication under Competition Michael Noukhovitch Travis LaCroix Angeliki Lazaridou Aaron Courville 96 26 0 25 Jan 2021
Theory of Mind for Deep Reinforcement Learning in Hanabi Andrew Fuchs Michael Walton Theresa Chadwick Doug Lange 80 11 0 22 Jan 2021
Open Problems in Cooperative AI Allan Dafoe Edward Hughes Yoram Bachrach Tantum Collins Kevin R. McKee Joel Z Leibo Kate Larson T. Graepel 121 203 0 15 Dec 2020
Applied Machine Learning for Games: A Graduate School Course Yilei Zeng Aayush Shah Jameson Thai M. Zyda AI4CE 67 3 0 30 Nov 2020
Ridge Rider: Finding Diverse Solutions by Following Eigenvectors of the Hessian Jack Parker-Holder Luke Metz Cinjon Resnick Hengyuan Hu Adam Lerer Alistair Letcher A. Peysakhovich Aldo Pacchiano Jakob N. Foerster 50 24 0 12 Nov 2020
Watch-And-Help: A Challenge for Social Perception and Human-AI Collaboration Xavier Puig Tianmin Shu Shuang Li Zilin Wang Yuan-Hong Liao J. Tenenbaum Sanja Fidler Antonio Torralba LM&Ro 161 130 0 19 Oct 2020
Prioritized Level Replay Minqi Jiang Edward Grefenstette Tim Rocktaschel OffRL 128 160 0 08 Oct 2020
Creative Captioning: An AI Grand Challenge Based on the Dixit Board Game M. Kunda Irina Rabkina 27 3 0 30 Sep 2020
PettingZoo: Gym for Multi-Agent Reinforcement Learning J. K. Terry Benjamin Black Nathaniel Grammel Mario Jayakumar Ananth Hari ... Caroline Horsch Clemens Dieffendahl Niall L. Williams Yashas Lokesh Praveen Ravi OffRL 151 288 0 30 Sep 2020
Learning to Play against Any Mixture of Opponents Max O. Smith Thomas W. Anthony Yongzhao Wang Michael P. Wellman OffRL 75 9 0 29 Sep 2020
Multiplayer Support for the Arcade Learning Environment J. K. Terry Benjamin Black Luis Santos 74 13 0 20 Sep 2020
The Curse of Shared Knowledge: Recursive Belief Reasoning in a Coordination Game with Imperfect Information Thomas Bolander R. Engelhardt Thomas S. Nicolet 21 1 0 20 Aug 2020
Joint Policy Search for Multi-agent Collaboration with Imperfect Information Yuandong Tian Qucheng Gong Tina Jiang 103 19 0 14 Aug 2020
DREAM: Deep Regret minimization with Advantage baselines and Model-free learning Eric Steinberger Adam Lerer Noam Brown 132 54 0 18 Jun 2020
Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks Georgios Papoudakis Filippos Christianos Lukas Schafer Stefano V. Albrecht OffRL 107 233 0 14 Jun 2020
Learning to Play No-Press Diplomacy with Best Response Policy Iteration Thomas W. Anthony Tom Eccles Andrea Tacchetti János Kramár I. Gemp ... Richard Everett Roman Werpachowski Satinder Singh T. Graepel Yoram Bachrach 106 43 0 08 Jun 2020
Emergent Multi-Agent Communication in the Deep Learning Era Angeliki Lazaridou Marco Baroni AI4CE 153 206 0 03 Jun 2020
Towards the Role of Theory of Mind in Explanation Maayan Shvo Toryn Q. Klassen Sheila A. McIlraith 66 29 0 06 May 2020