Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning

4 November 2018

Papers citing "Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning"

50 / 88 papers shown

Title
OvercookedV2: Rethinking Overcooked for Zero-Shot Coordination Tobias Gessler Tin Dizdarevic Ani Calinescu Benjamin Ellis Andrei Lupu Jakob Foerster 61 1 0 22 Mar 2025
A Generalist Hanabi Agent Arjun Vaithilingam Sudhakar Hadi Nekoei Mathieu Reymond Miao Liu Janarthanan Rajendran Sarath Chandar 250 0 0 17 Mar 2025
Spatial-aware decision-making with ring attractors in reinforcement learning systems Marcos Negre Saura Richard Allmendinger Theodore Papamarkou Wei Pan 229 0 0 17 Feb 2025
Configurable Mirror Descent: Towards a Unification of Decision Making Pengdeng Li Shuxin Li Chang Yang Xinrun Wang Shuyue Hu Xiao Huang Hau Chan Bo An 36 1 0 20 May 2024
Symmetry-Breaking Augmentations for Ad Hoc Teamwork Ravi Hammond Dustin Craggs Mingyu Guo Jakob Foerster Ian Reid 34 1 0 15 Feb 2024
The Role of Higher-Order Cognitive Models in Active Learning Oskar Keurulainen G. Alcan Ville Kyrki 46 0 0 09 Jan 2024
Scaling Opponent Shaping to High Dimensional Games Akbir Khan Timon Willi Newton Kwan Andrea Tacchetti Chris Xiaoxuan Lu Edward Grefenstette Tim Rocktaschel Jakob N. Foerster 41 10 0 19 Dec 2023
Efficiently Quantifying Individual Agent Importance in Cooperative MARL Omayma Mahjoub Ruan de Kock Siddarth S. Singh Wiem Khlifi Abidine Vall Kale-ab Tessera Arnu Pretorius FAtt 40 2 0 13 Dec 2023
How much can change in a year? Revisiting Evaluation in Multi-Agent Reinforcement Learning Siddarth S. Singh Omayma Mahjoub Ruan de Kock Wiem Khlifi Abidine Vall Kale-ab Tessera Arnu Pretorius 51 1 0 13 Dec 2023
DanZero+: Dominating the GuanDan Game through Reinforcement Learning Youpeng Zhao Yudong Lu Jian Zhao Wen-gang Zhou Houqiang Li 41 6 0 05 Dec 2023
Learning to Cooperate and Communicate Over Imperfect Channels Jannis Weil Gizem Ekinci Heinz Koeppl Tobias Meuser 23 0 0 24 Nov 2023
LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language Models Saaket Agashe Yue Fan Anthony Reyna Xin Eric Wang LLMAG LRM 102 10 0 05 Oct 2023
Never Explore Repeatedly in Multi-Agent Reinforcement Learning Chenghao Li Tonghan Wang Chongjie Zhang Qianchuan Zhao 40 2 0 19 Aug 2023
Decentralized Inference via Capability Type Structures in Cooperative Multi-Agent Systems Charles Jin Zhang-Wei Hong Farid Arthaud Idan Orzech Martin Rinard 36 0 0 27 Apr 2023
A Novel Point-based Algorithm for Multi-agent Control Using the Common Information Approach Dengwang Tang A. Nayyar R. Jain 16 0 0 10 Apr 2023
Behavioral Differences is the Key of Ad-hoc Team Cooperation in Multiplayer Games Hanabi Hyeonchang Jeon Kyung-Joong Kim 23 0 0 12 Mar 2023
Abstracting Imperfect Information Away from Two-Player Zero-Sum Games Samuel Sokota Ryan DÓrazio Chun Kai Ling David J. Wu J. Zico Kolter Noam Brown 27 4 0 22 Jan 2023
Pragmatics in Language Grounding: Phenomena, Tasks, and Modeling Approaches Daniel Fried Nicholas Tomlin Jennifer Hu Roma Patel Aida Nematzadeh 29 6 0 15 Nov 2022
Knowing the Past to Predict the Future: Reinforcement Virtual Learning Peng Zhang Yawen Huang Bingzhang Hu Shizheng Wang Haoran Duan Noura Al Moubayed Yefeng Zheng Yang Long OffRL 27 0 0 02 Nov 2022
K-level Reasoning for Zero-Shot Coordination in Hanabi Brandon Cui Hengyuan Hu Luis Pineda Jakob N. Foerster OffRL LRM 33 33 0 14 Jul 2022
Self-Explaining Deviations for Coordination Hengyuan Hu Samuel Sokota David J. Wu A. Bakhtin Andrei Lupu Brandon Cui Jakob N. Foerster 36 2 0 13 Jul 2022
On the Impossibility of Learning to Cooperate with Adaptive Partner Strategies in Repeated Games R. Loftin F. Oliehoek 4 3 0 20 Jun 2022
A Marriage between Adversarial Team Games and 2-player Games: Enabling Abstractions, No-regret Learning, and Subgame Solving Luca Carminati Federico Cacciamani Marco Ciccone N. Gatti 27 15 0 18 Jun 2022
Challenges to Solving Combinatorially Hard Long-Horizon Deep RL Tasks Andrew C. Li Pashootan Vaezipoor Rodrigo Toro Icarte Sheila A. McIlraith OffRL LRM 24 4 0 03 Jun 2022
Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: Corrections Dustin Morrill Ryan DÓrazio Marc Lanctot J. R. Wright Michael Bowling Amy Greenwald 51 21 0 24 May 2022
Encouraging Human Interaction with Robot Teams: Legible and Fair Subtask Allocations Soheil Habibian Dylan P. Losey 21 12 0 06 May 2022
Mind the gap: Challenges of deep learning approaches to Theory of Mind Jaan Aru Aqeel Labash Oriol Corcoll Raul Vicente 28 26 0 30 Mar 2022
On the link between conscious function and general intelligence in humans and machines Arthur Juliani Kai Arulkumaran Shuntaro Sasai Ryota Kanai 42 26 0 24 Mar 2022
Is Vanilla Policy Gradient Overlooked? Analyzing Deep Reinforcement Learning for Hanabi Bram Grooten Jelle Wemmenhove Maurice Poot J. Portegies 19 3 0 22 Mar 2022
On-the-fly Strategy Adaptation for ad-hoc Agent Coordination Jaleh Zand Jack Parker-Holder Stephen J. Roberts 22 13 0 08 Mar 2022
The Good Shepherd: An Oracle Agent for Mechanism Design Jan Balaguer R. Koster Christopher Summerfield Andrea Tacchetti 25 16 0 21 Feb 2022
Adaptive Discrete Communication Bottlenecks with Dynamic Vector Quantization Dianbo Liu Alex Lamb Xu Ji Pascal Junior Tikeng Notsawo Michael C. Mozer Yoshua Bengio Kenji Kawaguchi 19 14 0 02 Feb 2022
Any-Play: An Intrinsic Augmentation for Zero-Shot Coordination Keane Lucas R. Allen 56 23 0 28 Jan 2022
Public Information Representation for Adversarial Team Games Luca Carminati Federico Cacciamani Marco Ciccone N. Gatti 6 9 0 25 Jan 2022
Conditional Imitation Learning for Multi-Agent Games Andy Shih Stefano Ermon Dorsa Sadigh 41 11 0 05 Jan 2022
Student of Games: A unified learning algorithm for both perfect and imperfect information games Martin Schmid Matej Moravcík Neil Burch Rudolf Kadlec Josh Davidson ... Marc Lanctot G. Z. Holland Elnaz Davoodi Alden Christianson Michael Bowling 35 20 0 06 Dec 2021
Adversarial Attacks in Cooperative AI Ted Fujimoto Arthur Paul Pedersen AAML 27 2 0 29 Nov 2021
Distributed Policy Gradient with Variance Reduction in Multi-Agent Reinforcement Learning Xiaoxiao Zhao Jinlong Lei Li Li Jie-bin Chen OffRL 20 2 0 25 Nov 2021
Learning to Ground Multi-Agent Communication with Autoencoders Toru Lin Minyoung Huh C. Stauffer Ser-Nam Lim Phillip Isola AI4CE 48 52 0 28 Oct 2021
Common Information based Approximate State Representations in Multi-Agent Reinforcement Learning Shitao Xiao V. Subramanian 29 9 0 25 Oct 2021
Collaborating with Humans without Human Data D. Strouse Kevin R. McKee M. Botvinick Edward Hughes Richard Everett 124 161 0 15 Oct 2021
Emergence of Theory of Mind Collaboration in Multiagent Systems Luyao Yuan Zipeng Fu Linqi Zhou Kexin Yang Song-Chun Zhu 51 10 0 30 Sep 2021
Scalable Online Planning via Reinforcement Learning Fine-Tuning Arnaud Fickinger Hengyuan Hu Brandon Amos Stuart J. Russell Noam Brown 51 21 0 30 Sep 2021
Temporal Induced Self-Play for Stochastic Bayesian Games Weizhe Chen Zihan Zhou Yi Wu Fei Fang 13 3 0 21 Aug 2021
Evaluation of Human-AI Teams for Learned and Rule-Based Agents in Hanabi H. Siu Jaime D. Peña Edenna Chen Yutai Zhou Victor J. Lopez Kyle Palko K. Chang R. Allen 27 57 0 15 Jul 2021
Cogment: Open Source Framework For Distributed Multi-actor Training, Deployment & Operations AI Redefined S. Gottipati Sagar Kurandwad Clodéric Mars Gregory Szriftgiser Franccois Chabot 29 8 0 21 Jun 2021
Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings Hengyuan Hu Adam Lerer Noam Brown Jakob N. Foerster 12 19 0 16 Jun 2021
A New Formalism, Method and Open Issues for Zero-Shot Coordination Johannes Treutlein Michael Dennis Caspar Oesterheld Jakob N. Foerster OffRL 29 35 0 11 Jun 2021
Optimal communication and control strategies in a multi-agent MDP problem Sagar Sudhakara D. Kartik Rahul Jain A. Nayyar 16 3 0 22 Apr 2021
On the Critical Role of Conventions in Adaptive Human-AI Collaboration Andy Shih Arjun Sawhney J. Kondic Stefano Ermon Dorsa Sadigh 44 37 0 07 Apr 2021