ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.01458
  4. Cited By
Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning

Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning

4 November 2018
Jakob N. Foerster
H. F. Song
Edward Hughes
Neil Burch
Iain Dunning
Shimon Whiteson
M. Botvinick
Michael Bowling
ArXivPDFHTML

Papers citing "Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning"

50 / 88 papers shown
Title
OvercookedV2: Rethinking Overcooked for Zero-Shot Coordination
OvercookedV2: Rethinking Overcooked for Zero-Shot Coordination
Tobias Gessler
Tin Dizdarevic
Ani Calinescu
Benjamin Ellis
Andrei Lupu
Jakob Foerster
61
1
0
22 Mar 2025
A Generalist Hanabi Agent
A Generalist Hanabi Agent
Arjun Vaithilingam Sudhakar
Hadi Nekoei
Mathieu Reymond
Miao Liu
Janarthanan Rajendran
Sarath Chandar
250
0
0
17 Mar 2025
Spatial-aware decision-making with ring attractors in reinforcement learning systems
Spatial-aware decision-making with ring attractors in reinforcement learning systems
Marcos Negre Saura
Richard Allmendinger
Theodore Papamarkou
Wei Pan
229
0
0
17 Feb 2025
Configurable Mirror Descent: Towards a Unification of Decision Making
Configurable Mirror Descent: Towards a Unification of Decision Making
Pengdeng Li
Shuxin Li
Chang Yang
Xinrun Wang
Shuyue Hu
Xiao Huang
Hau Chan
Bo An
36
1
0
20 May 2024
Symmetry-Breaking Augmentations for Ad Hoc Teamwork
Symmetry-Breaking Augmentations for Ad Hoc Teamwork
Ravi Hammond
Dustin Craggs
Mingyu Guo
Jakob Foerster
Ian Reid
34
1
0
15 Feb 2024
The Role of Higher-Order Cognitive Models in Active Learning
The Role of Higher-Order Cognitive Models in Active Learning
Oskar Keurulainen
G. Alcan
Ville Kyrki
46
0
0
09 Jan 2024
Scaling Opponent Shaping to High Dimensional Games
Scaling Opponent Shaping to High Dimensional Games
Akbir Khan
Timon Willi
Newton Kwan
Andrea Tacchetti
Chris Xiaoxuan Lu
Edward Grefenstette
Tim Rocktaschel
Jakob N. Foerster
41
10
0
19 Dec 2023
Efficiently Quantifying Individual Agent Importance in Cooperative MARL
Efficiently Quantifying Individual Agent Importance in Cooperative MARL
Omayma Mahjoub
Ruan de Kock
Siddarth S. Singh
Wiem Khlifi
Abidine Vall
Kale-ab Tessera
Arnu Pretorius
FAtt
40
2
0
13 Dec 2023
How much can change in a year? Revisiting Evaluation in Multi-Agent
  Reinforcement Learning
How much can change in a year? Revisiting Evaluation in Multi-Agent Reinforcement Learning
Siddarth S. Singh
Omayma Mahjoub
Ruan de Kock
Wiem Khlifi
Abidine Vall
Kale-ab Tessera
Arnu Pretorius
51
1
0
13 Dec 2023
DanZero+: Dominating the GuanDan Game through Reinforcement Learning
DanZero+: Dominating the GuanDan Game through Reinforcement Learning
Youpeng Zhao
Yudong Lu
Jian Zhao
Wen-gang Zhou
Houqiang Li
41
6
0
05 Dec 2023
Learning to Cooperate and Communicate Over Imperfect Channels
Learning to Cooperate and Communicate Over Imperfect Channels
Jannis Weil
Gizem Ekinci
Heinz Koeppl
Tobias Meuser
23
0
0
24 Nov 2023
LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language Models
LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language Models
Saaket Agashe
Yue Fan
Anthony Reyna
Xin Eric Wang
LLMAG
LRM
102
10
0
05 Oct 2023
Never Explore Repeatedly in Multi-Agent Reinforcement Learning
Never Explore Repeatedly in Multi-Agent Reinforcement Learning
Chenghao Li
Tonghan Wang
Chongjie Zhang
Qianchuan Zhao
40
2
0
19 Aug 2023
Decentralized Inference via Capability Type Structures in Cooperative
  Multi-Agent Systems
Decentralized Inference via Capability Type Structures in Cooperative Multi-Agent Systems
Charles Jin
Zhang-Wei Hong
Farid Arthaud
Idan Orzech
Martin Rinard
36
0
0
27 Apr 2023
A Novel Point-based Algorithm for Multi-agent Control Using the Common
  Information Approach
A Novel Point-based Algorithm for Multi-agent Control Using the Common Information Approach
Dengwang Tang
A. Nayyar
R. Jain
16
0
0
10 Apr 2023
Behavioral Differences is the Key of Ad-hoc Team Cooperation in
  Multiplayer Games Hanabi
Behavioral Differences is the Key of Ad-hoc Team Cooperation in Multiplayer Games Hanabi
Hyeonchang Jeon
Kyung-Joong Kim
23
0
0
12 Mar 2023
Abstracting Imperfect Information Away from Two-Player Zero-Sum Games
Abstracting Imperfect Information Away from Two-Player Zero-Sum Games
Samuel Sokota
Ryan DÓrazio
Chun Kai Ling
David J. Wu
J. Zico Kolter
Noam Brown
27
4
0
22 Jan 2023
Pragmatics in Language Grounding: Phenomena, Tasks, and Modeling
  Approaches
Pragmatics in Language Grounding: Phenomena, Tasks, and Modeling Approaches
Daniel Fried
Nicholas Tomlin
Jennifer Hu
Roma Patel
Aida Nematzadeh
29
6
0
15 Nov 2022
Knowing the Past to Predict the Future: Reinforcement Virtual Learning
Knowing the Past to Predict the Future: Reinforcement Virtual Learning
Peng Zhang
Yawen Huang
Bingzhang Hu
Shizheng Wang
Haoran Duan
Noura Al Moubayed
Yefeng Zheng
Yang Long
OffRL
27
0
0
02 Nov 2022
K-level Reasoning for Zero-Shot Coordination in Hanabi
K-level Reasoning for Zero-Shot Coordination in Hanabi
Brandon Cui
Hengyuan Hu
Luis Pineda
Jakob N. Foerster
OffRL
LRM
33
33
0
14 Jul 2022
Self-Explaining Deviations for Coordination
Self-Explaining Deviations for Coordination
Hengyuan Hu
Samuel Sokota
David J. Wu
A. Bakhtin
Andrei Lupu
Brandon Cui
Jakob N. Foerster
36
2
0
13 Jul 2022
On the Impossibility of Learning to Cooperate with Adaptive Partner
  Strategies in Repeated Games
On the Impossibility of Learning to Cooperate with Adaptive Partner Strategies in Repeated Games
R. Loftin
F. Oliehoek
4
3
0
20 Jun 2022
A Marriage between Adversarial Team Games and 2-player Games: Enabling
  Abstractions, No-regret Learning, and Subgame Solving
A Marriage between Adversarial Team Games and 2-player Games: Enabling Abstractions, No-regret Learning, and Subgame Solving
Luca Carminati
Federico Cacciamani
Marco Ciccone
N. Gatti
27
15
0
18 Jun 2022
Challenges to Solving Combinatorially Hard Long-Horizon Deep RL Tasks
Challenges to Solving Combinatorially Hard Long-Horizon Deep RL Tasks
Andrew C. Li
Pashootan Vaezipoor
Rodrigo Toro Icarte
Sheila A. McIlraith
OffRL
LRM
24
4
0
03 Jun 2022
Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: Corrections
Dustin Morrill
Ryan DÓrazio
Marc Lanctot
J. R. Wright
Michael Bowling
Amy Greenwald
51
21
0
24 May 2022
Encouraging Human Interaction with Robot Teams: Legible and Fair Subtask
  Allocations
Encouraging Human Interaction with Robot Teams: Legible and Fair Subtask Allocations
Soheil Habibian
Dylan P. Losey
21
12
0
06 May 2022
Mind the gap: Challenges of deep learning approaches to Theory of Mind
Mind the gap: Challenges of deep learning approaches to Theory of Mind
Jaan Aru
Aqeel Labash
Oriol Corcoll
Raul Vicente
28
26
0
30 Mar 2022
On the link between conscious function and general intelligence in
  humans and machines
On the link between conscious function and general intelligence in humans and machines
Arthur Juliani
Kai Arulkumaran
Shuntaro Sasai
Ryota Kanai
42
26
0
24 Mar 2022
Is Vanilla Policy Gradient Overlooked? Analyzing Deep Reinforcement
  Learning for Hanabi
Is Vanilla Policy Gradient Overlooked? Analyzing Deep Reinforcement Learning for Hanabi
Bram Grooten
Jelle Wemmenhove
Maurice Poot
J. Portegies
19
3
0
22 Mar 2022
On-the-fly Strategy Adaptation for ad-hoc Agent Coordination
On-the-fly Strategy Adaptation for ad-hoc Agent Coordination
Jaleh Zand
Jack Parker-Holder
Stephen J. Roberts
22
13
0
08 Mar 2022
The Good Shepherd: An Oracle Agent for Mechanism Design
The Good Shepherd: An Oracle Agent for Mechanism Design
Jan Balaguer
R. Koster
Christopher Summerfield
Andrea Tacchetti
25
16
0
21 Feb 2022
Adaptive Discrete Communication Bottlenecks with Dynamic Vector
  Quantization
Adaptive Discrete Communication Bottlenecks with Dynamic Vector Quantization
Dianbo Liu
Alex Lamb
Xu Ji
Pascal Junior Tikeng Notsawo
Michael C. Mozer
Yoshua Bengio
Kenji Kawaguchi
19
14
0
02 Feb 2022
Any-Play: An Intrinsic Augmentation for Zero-Shot Coordination
Any-Play: An Intrinsic Augmentation for Zero-Shot Coordination
Keane Lucas
R. Allen
56
23
0
28 Jan 2022
Public Information Representation for Adversarial Team Games
Public Information Representation for Adversarial Team Games
Luca Carminati
Federico Cacciamani
Marco Ciccone
N. Gatti
6
9
0
25 Jan 2022
Conditional Imitation Learning for Multi-Agent Games
Conditional Imitation Learning for Multi-Agent Games
Andy Shih
Stefano Ermon
Dorsa Sadigh
41
11
0
05 Jan 2022
Student of Games: A unified learning algorithm for both perfect and
  imperfect information games
Student of Games: A unified learning algorithm for both perfect and imperfect information games
Martin Schmid
Matej Moravcík
Neil Burch
Rudolf Kadlec
Josh Davidson
...
Marc Lanctot
G. Z. Holland
Elnaz Davoodi
Alden Christianson
Michael Bowling
35
20
0
06 Dec 2021
Adversarial Attacks in Cooperative AI
Adversarial Attacks in Cooperative AI
Ted Fujimoto
Arthur Paul Pedersen
AAML
27
2
0
29 Nov 2021
Distributed Policy Gradient with Variance Reduction in Multi-Agent
  Reinforcement Learning
Distributed Policy Gradient with Variance Reduction in Multi-Agent Reinforcement Learning
Xiaoxiao Zhao
Jinlong Lei
Li Li
Jie-bin Chen
OffRL
20
2
0
25 Nov 2021
Learning to Ground Multi-Agent Communication with Autoencoders
Learning to Ground Multi-Agent Communication with Autoencoders
Toru Lin
Minyoung Huh
C. Stauffer
Ser-Nam Lim
Phillip Isola
AI4CE
48
52
0
28 Oct 2021
Common Information based Approximate State Representations in
  Multi-Agent Reinforcement Learning
Common Information based Approximate State Representations in Multi-Agent Reinforcement Learning
Shitao Xiao
V. Subramanian
29
9
0
25 Oct 2021
Collaborating with Humans without Human Data
Collaborating with Humans without Human Data
D. Strouse
Kevin R. McKee
M. Botvinick
Edward Hughes
Richard Everett
124
161
0
15 Oct 2021
Emergence of Theory of Mind Collaboration in Multiagent Systems
Emergence of Theory of Mind Collaboration in Multiagent Systems
Luyao Yuan
Zipeng Fu
Linqi Zhou
Kexin Yang
Song-Chun Zhu
51
10
0
30 Sep 2021
Scalable Online Planning via Reinforcement Learning Fine-Tuning
Scalable Online Planning via Reinforcement Learning Fine-Tuning
Arnaud Fickinger
Hengyuan Hu
Brandon Amos
Stuart J. Russell
Noam Brown
51
21
0
30 Sep 2021
Temporal Induced Self-Play for Stochastic Bayesian Games
Temporal Induced Self-Play for Stochastic Bayesian Games
Weizhe Chen
Zihan Zhou
Yi Wu
Fei Fang
13
3
0
21 Aug 2021
Evaluation of Human-AI Teams for Learned and Rule-Based Agents in Hanabi
Evaluation of Human-AI Teams for Learned and Rule-Based Agents in Hanabi
H. Siu
Jaime D. Peña
Edenna Chen
Yutai Zhou
Victor J. Lopez
Kyle Palko
K. Chang
R. Allen
27
57
0
15 Jul 2021
Cogment: Open Source Framework For Distributed Multi-actor Training,
  Deployment & Operations
Cogment: Open Source Framework For Distributed Multi-actor Training, Deployment & Operations
AI Redefined
S. Gottipati
Sagar Kurandwad
Clodéric Mars
Gregory Szriftgiser
Franccois Chabot
29
8
0
21 Jun 2021
Learned Belief Search: Efficiently Improving Policies in Partially
  Observable Settings
Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings
Hengyuan Hu
Adam Lerer
Noam Brown
Jakob N. Foerster
12
19
0
16 Jun 2021
A New Formalism, Method and Open Issues for Zero-Shot Coordination
A New Formalism, Method and Open Issues for Zero-Shot Coordination
Johannes Treutlein
Michael Dennis
Caspar Oesterheld
Jakob N. Foerster
OffRL
29
35
0
11 Jun 2021
Optimal communication and control strategies in a multi-agent MDP
  problem
Optimal communication and control strategies in a multi-agent MDP problem
Sagar Sudhakara
D. Kartik
Rahul Jain
A. Nayyar
16
3
0
22 Apr 2021
On the Critical Role of Conventions in Adaptive Human-AI Collaboration
On the Critical Role of Conventions in Adaptive Human-AI Collaboration
Andy Shih
Arjun Sawhney
J. Kondic
Stefano Ermon
Dorsa Sadigh
44
37
0
07 Apr 2021
12
Next