v1v2v3 (latest)

"Other-Play" for Zero-Shot Coordination

6 March 2020

Papers citing ""Other-Play" for Zero-Shot Coordination"

50 / 146 papers shown

Title
Learning to Coordinate with Anyone Lei Yuan Lihe Li Ziqian Zhang F. Chen Tianyi Zhang Cong Guan Yang Yu Zhi Zhou LLMAG 107 5 0 22 Sep 2023
Towards Few-shot Coordination: Revisiting Ad-hoc Teamplay Challenge In the Game of Hanabi Hadi Nekoei Xutong Zhao Janarthanan Rajendran Miao Liu Sarath Chandar 52 5 0 20 Aug 2023
Minimum Coverage Sets for Training Robust Ad Hoc Teamwork Agents Arrasy Rahman Jiaxun Cui Peter Stone 82 13 0 18 Aug 2023
ESP: Exploiting Symmetry Prior for Multi-Agent Reinforcement Learning Xin Yu Rongye Shi Pu Feng Yongkai Tian Jie Luo Wenjun Wu 64 7 0 30 Jul 2023
Learning Multi-Agent Communication with Contrastive Learning Y. Lo B. Sengupta Jakob N. Foerster Michael Noukhovitch 91 5 0 03 Jul 2023
Who Needs to Know? Minimal Knowledge for Optimal Coordination Niklas Lauffer Ameesh Shah Micah Carroll Michael Dennis Stuart J. Russell 59 6 0 15 Jun 2023
How to Evaluate Behavioral Models G. dÉon Sophie Greenwood Kevin Leyton-Brown J. R. Wright 92 0 0 07 Jun 2023
Tackling Cooperative Incompatibility for Zero-Shot Human-AI Coordination Yang Li Shao Zhang Jichen Sun Wenhao Zhang Yali Du Ying Wen Xinbing Wang Wei Pan 105 17 0 05 Jun 2023
EMOTE: An Explainable architecture for Modelling the Other Through Empathy M. Senadeera Thommen Karimpanal George Sunil R. Gupta Stephan Jacobs Santu Rana 50 1 0 01 Jun 2023
Adaptive Coordination in Social Embodied Rearrangement Andrew Szot Unnat Jain Dhruv Batra Z. Kira Ruta Desai Akshara Rai 78 14 0 31 May 2023
A Model-Based Solution to the Offline Multi-Agent Reinforcement Learning Coordination Problem Paul Barde Jakob N. Foerster Derek Nowrouzezahrai Amy Zhang OffRL 73 12 0 26 May 2023
A Hierarchical Approach to Population Training for Human-AI Collaboration Yi Loo Chen Gong Malika Meghjani 60 8 0 26 May 2023
Fast Teammate Adaptation in the Presence of Sudden Policy Change Ziqian Zhang Lei Yuan Lihe Li Ke Xue Chengxing Jia Cong Guan Chao Qian Yang Yu 89 9 0 10 May 2023
Robust multi-agent coordination via evolutionary generation of auxiliary adversarial attackers Lei Yuan Zifei Zhang Ke Xue Hao Yin F. Chen Cong Guan Lihe Li Chao Qian Yang Yu AAML 88 18 0 10 May 2023
Multi-agent Continual Coordination via Progressive Task Contextualization Lei Yuan Lihe Li Ziqian Zhang Fuxiang Zhang Cong Guan Yang Yu CLL 83 8 0 07 May 2023
Heterogeneous Social Value Orientation Leads to Meaningful Diversity in Sequential Social Dilemmas Udari Madhushani Kevin R. McKee J. Agapiou Joel Z Leibo Richard Everett Thomas W. Anthony Edward Hughes K. Tuyls Edgar A. Duénez-Guzmán 73 3 0 01 May 2023
Towards Effective and Interpretable Human-Agent Collaboration in MOBA Games: A Communication Perspective Yiming Gao Feiyu Liu Liang Wang Zhenjie Lian Weixuan Wang ... Jiawei Wang Qiang Fu Wei Yang Lanxiao Huang Wei Liu 66 7 0 23 Apr 2023
Language Instructed Reinforcement Learning for Human-AI Coordination Hengyuan Hu Dorsa Sadigh LM&Ro 96 64 0 13 Apr 2023
Reduce, Reuse, Recycle: Selective Reincarnation in Multi-Agent Reinforcement Learning Claude Formanek C. Tilbury Jonathan P. Shock Kale-ab Tessera Arnu Pretorius 74 3 0 31 Mar 2023
Towards the Scalable Evaluation of Cooperativeness in Language Models Alan Chan Maxime Riché Jesse Clifton LLMAG 84 7 0 16 Mar 2023
Population-based Evaluation in Repeated Rock-Paper-Scissors as a Benchmark for Multiagent Reinforcement Learning Marc Lanctot John Schultz Neil Burch Max O. Smith Daniel Hennes Thomas W. Anthony Julien Perolat OffRL 48 5 0 02 Mar 2023
Improving Zero-Shot Coordination Performance Based on Policy Similarity Lebin Yu Yunbo Qiu Quanming Yao Xudong Zhang Jian Wang 84 1 0 10 Feb 2023
Cooperative Open-ended Learning Framework for Zero-shot Coordination Yang Li Shao Zhang Jichen Sun Yali Du Ying Wen Xinbing Wang Wei Pan 135 24 0 09 Feb 2023
Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased Chao Yu Jiaxuan Gao Weiling Liu Bo Xu Hao Tang Jiaqi Yang Yu Wang Yi Wu 109 42 0 03 Feb 2023
Human-Timescale Adaptation in an Open-Ended Task Space Adaptive Agent Team Jakob Bauer Kate Baumli Satinder Baveja Feryal M. P. Behbahani ... Jakub Sygnowski K. Tuyls Sarah York Alexander Zacherl Lei Zhang LM&Ro OffRL AI4CE LRM 139 119 0 18 Jan 2023
PECAN: Leveraging Policy Ensemble for Context-Aware Zero-Shot Human-AI Coordination Xingzhou Lou Jiaxian Guo Junge Zhang Jun Wang Kaiqi Huang Yali Du 76 29 0 16 Jan 2023
NOPA: Neurally-guided Online Probabilistic Assistance for Building Socially Intelligent Home Assistants Xavier Puig Tianmin Shu J. Tenenbaum Antonio Torralba 58 22 0 12 Jan 2023
Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time Multi-Robot Cooperative Exploration Chao Yu Xinyi Yang Jiaxuan Gao Jiayu Chen Yunfei Li ... Yunfei Xiang Rui Huang Huazhong Yang Yi Wu Yu Wang 70 38 0 09 Jan 2023
Discovering Generalizable Spatial Goal Representations via Graph-based Active Reward Learning Aviv Netanyahu Tianmin Shu J. Tenenbaum Pulkit Agrawal 49 5 0 24 Nov 2022
Optimal Behavior Prior: Data-Efficient Human Models for Improved Human-AI Collaboration Mesut Yang Micah Carroll Anca Dragan 100 14 0 03 Nov 2022
Coordination with Humans via Strategy Matching Michelle Zhao Reid G. Simmons H. Admoni 80 10 0 27 Oct 2022
Equivariant Networks for Zero-Shot Coordination Darius Muglich Christian Schroeder de Witt Elise van der Pol Shimon Whiteson Jakob N. Foerster 111 14 0 21 Oct 2022
Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning A. Bakhtin David J. Wu Adam Lerer Jonathan Gray Athul Paul Jacob Gabriele Farina Alexander H. Miller Noam Brown 122 47 0 11 Oct 2022
A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning Arrasy Rahman Ignacio Carlucho Niklas Höpner Stefano V. Albrecht 114 11 0 11 Oct 2022
Human-AI Coordination via Human-Regularized Search and Learning Hengyuan Hu David J. Wu Adam Lerer Jakob N. Foerster Noam Brown 70 7 0 11 Oct 2022
ELIGN: Expectation Alignment as a Multi-Agent Intrinsic Reward Zixian Ma Rose E. Wang Li Fei-Fei Michael S. Bernstein Ranjay Krishna 73 17 0 09 Oct 2022
Law Informs Code: A Legal Informatics Approach to Aligning Artificial Intelligence with Humans John J. Nay ELM AILaw 190 29 0 14 Sep 2022
Generating Teammates for Training Robust Ad Hoc Teamwork Agents via Best-Response Diversity Arrasy Rahman Elliot Fosong Ignacio Carlucho Stefano V. Albrecht 103 10 0 28 Jul 2022
Meta-Referential Games to Learn Compositional Learning Behaviours Kevin Denamganai S. Missaoui James Alfred Walker 66 1 0 16 Jul 2022
K-level Reasoning for Zero-Shot Coordination in Hanabi Brandon Cui Hengyuan Hu Luis Pineda Jakob N. Foerster OffRL LRM 86 36 0 14 Jul 2022
Self-Explaining Deviations for Coordination Hengyuan Hu Samuel Sokota David J. Wu A. Bakhtin Andrei Lupu Brandon Cui Jakob N. Foerster 53 2 0 13 Jul 2022
Grounding Aleatoric Uncertainty for Unsupervised Environment Design Minqi Jiang Michael Dennis Jack Parker-Holder Andrei Lupu Heinrich Küttler Edward Grefenstette Tim Rocktaschel Jakob N. Foerster 100 15 0 11 Jul 2022
For Learning in Symmetric Teams, Local Optima are Global Nash Equilibria Scott Emmons Caspar Oesterheld Andrew Critch Vincent Conitzer Stuart J. Russell 53 10 0 07 Jul 2022
Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning Lukas Schafer Filippos Christianos Amos Storkey Stefano V. Albrecht 51 7 0 05 Jul 2022
Generalized Beliefs for Cooperative AI Darius Muglich L. Zintgraf Christian Schroeder de Witt Shimon Whiteson Jakob N. Foerster 90 7 0 26 Jun 2022
On the Impossibility of Learning to Cooperate with Adaptive Partner Strategies in Repeated Games R. Loftin F. Oliehoek 24 4 0 20 Jun 2022
Nocturne: a scalable driving benchmark for bringing multi-agent learning one step closer to the real world Eugene Vinitsky Nathan Lichtlé Xiaomeng Yang Brandon Amos Jakob N. Foerster OffRL 150 54 0 20 Jun 2022
Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning Wei Fu Chao Yu Zelai Xu Jiaqi Yang Yi Wu 100 35 0 15 Jun 2022
The Boltzmann Policy Distribution: Accounting for Systematic Suboptimality in Human Models Cassidy Laidlaw Anca Dragan OffRL 72 39 0 22 Apr 2022
MA-Dreamer: Coordination and communication through shared imagination Kenzo Lobos-Tsunekawa Akshay Srinivasan Michael Spranger 62 2 0 10 Apr 2022