ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2003.02979
  4. Cited By
"Other-Play" for Zero-Shot Coordination
v1v2v3 (latest)

"Other-Play" for Zero-Shot Coordination

6 March 2020
Hengyuan Hu
Adam Lerer
A. Peysakhovich
Jakob N. Foerster
    VLMOffRL
ArXiv (abs)PDFHTML

Papers citing ""Other-Play" for Zero-Shot Coordination"

50 / 146 papers shown
Title
Learning to Coordinate with Anyone
Learning to Coordinate with Anyone
Lei Yuan
Lihe Li
Ziqian Zhang
F. Chen
Tianyi Zhang
Cong Guan
Yang Yu
Zhi Zhou
LLMAG
107
5
0
22 Sep 2023
Towards Few-shot Coordination: Revisiting Ad-hoc Teamplay Challenge In
  the Game of Hanabi
Towards Few-shot Coordination: Revisiting Ad-hoc Teamplay Challenge In the Game of Hanabi
Hadi Nekoei
Xutong Zhao
Janarthanan Rajendran
Miao Liu
Sarath Chandar
52
5
0
20 Aug 2023
Minimum Coverage Sets for Training Robust Ad Hoc Teamwork Agents
Minimum Coverage Sets for Training Robust Ad Hoc Teamwork Agents
Arrasy Rahman
Jiaxun Cui
Peter Stone
82
13
0
18 Aug 2023
ESP: Exploiting Symmetry Prior for Multi-Agent Reinforcement Learning
ESP: Exploiting Symmetry Prior for Multi-Agent Reinforcement Learning
Xin Yu
Rongye Shi
Pu Feng
Yongkai Tian
Jie Luo
Wenjun Wu
64
7
0
30 Jul 2023
Learning Multi-Agent Communication with Contrastive Learning
Learning Multi-Agent Communication with Contrastive Learning
Y. Lo
B. Sengupta
Jakob N. Foerster
Michael Noukhovitch
91
5
0
03 Jul 2023
Who Needs to Know? Minimal Knowledge for Optimal Coordination
Who Needs to Know? Minimal Knowledge for Optimal Coordination
Niklas Lauffer
Ameesh Shah
Micah Carroll
Michael Dennis
Stuart J. Russell
59
6
0
15 Jun 2023
How to Evaluate Behavioral Models
How to Evaluate Behavioral Models
G. dÉon
Sophie Greenwood
Kevin Leyton-Brown
J. R. Wright
92
0
0
07 Jun 2023
Tackling Cooperative Incompatibility for Zero-Shot Human-AI Coordination
Tackling Cooperative Incompatibility for Zero-Shot Human-AI Coordination
Yang Li
Shao Zhang
Jichen Sun
Wenhao Zhang
Yali Du
Ying Wen
Xinbing Wang
Wei Pan
105
17
0
05 Jun 2023
EMOTE: An Explainable architecture for Modelling the Other Through
  Empathy
EMOTE: An Explainable architecture for Modelling the Other Through Empathy
M. Senadeera
Thommen Karimpanal George
Sunil R. Gupta
Stephan Jacobs
Santu Rana
50
1
0
01 Jun 2023
Adaptive Coordination in Social Embodied Rearrangement
Adaptive Coordination in Social Embodied Rearrangement
Andrew Szot
Unnat Jain
Dhruv Batra
Z. Kira
Ruta Desai
Akshara Rai
78
14
0
31 May 2023
A Model-Based Solution to the Offline Multi-Agent Reinforcement Learning
  Coordination Problem
A Model-Based Solution to the Offline Multi-Agent Reinforcement Learning Coordination Problem
Paul Barde
Jakob N. Foerster
Derek Nowrouzezahrai
Amy Zhang
OffRL
73
12
0
26 May 2023
A Hierarchical Approach to Population Training for Human-AI
  Collaboration
A Hierarchical Approach to Population Training for Human-AI Collaboration
Yi Loo
Chen Gong
Malika Meghjani
60
8
0
26 May 2023
Fast Teammate Adaptation in the Presence of Sudden Policy Change
Fast Teammate Adaptation in the Presence of Sudden Policy Change
Ziqian Zhang
Lei Yuan
Lihe Li
Ke Xue
Chengxing Jia
Cong Guan
Chao Qian
Yang Yu
89
9
0
10 May 2023
Robust multi-agent coordination via evolutionary generation of auxiliary
  adversarial attackers
Robust multi-agent coordination via evolutionary generation of auxiliary adversarial attackers
Lei Yuan
Zifei Zhang
Ke Xue
Hao Yin
F. Chen
Cong Guan
Lihe Li
Chao Qian
Yang Yu
AAML
88
18
0
10 May 2023
Multi-agent Continual Coordination via Progressive Task
  Contextualization
Multi-agent Continual Coordination via Progressive Task Contextualization
Lei Yuan
Lihe Li
Ziqian Zhang
Fuxiang Zhang
Cong Guan
Yang Yu
CLL
83
8
0
07 May 2023
Heterogeneous Social Value Orientation Leads to Meaningful Diversity in
  Sequential Social Dilemmas
Heterogeneous Social Value Orientation Leads to Meaningful Diversity in Sequential Social Dilemmas
Udari Madhushani
Kevin R. McKee
J. Agapiou
Joel Z Leibo
Richard Everett
Thomas W. Anthony
Edward Hughes
K. Tuyls
Edgar A. Duénez-Guzmán
73
3
0
01 May 2023
Towards Effective and Interpretable Human-Agent Collaboration in MOBA
  Games: A Communication Perspective
Towards Effective and Interpretable Human-Agent Collaboration in MOBA Games: A Communication Perspective
Yiming Gao
Feiyu Liu
Liang Wang
Zhenjie Lian
Weixuan Wang
...
Jiawei Wang
Qiang Fu
Wei Yang
Lanxiao Huang
Wei Liu
66
7
0
23 Apr 2023
Language Instructed Reinforcement Learning for Human-AI Coordination
Language Instructed Reinforcement Learning for Human-AI Coordination
Hengyuan Hu
Dorsa Sadigh
LM&Ro
96
64
0
13 Apr 2023
Reduce, Reuse, Recycle: Selective Reincarnation in Multi-Agent
  Reinforcement Learning
Reduce, Reuse, Recycle: Selective Reincarnation in Multi-Agent Reinforcement Learning
Claude Formanek
C. Tilbury
Jonathan P. Shock
Kale-ab Tessera
Arnu Pretorius
74
3
0
31 Mar 2023
Towards the Scalable Evaluation of Cooperativeness in Language Models
Towards the Scalable Evaluation of Cooperativeness in Language Models
Alan Chan
Maxime Riché
Jesse Clifton
LLMAG
84
7
0
16 Mar 2023
Population-based Evaluation in Repeated Rock-Paper-Scissors as a
  Benchmark for Multiagent Reinforcement Learning
Population-based Evaluation in Repeated Rock-Paper-Scissors as a Benchmark for Multiagent Reinforcement Learning
Marc Lanctot
John Schultz
Neil Burch
Max O. Smith
Daniel Hennes
Thomas W. Anthony
Julien Perolat
OffRL
48
5
0
02 Mar 2023
Improving Zero-Shot Coordination Performance Based on Policy Similarity
Improving Zero-Shot Coordination Performance Based on Policy Similarity
Lebin Yu
Yunbo Qiu
Quanming Yao
Xudong Zhang
Jian Wang
84
1
0
10 Feb 2023
Cooperative Open-ended Learning Framework for Zero-shot Coordination
Cooperative Open-ended Learning Framework for Zero-shot Coordination
Yang Li
Shao Zhang
Jichen Sun
Yali Du
Ying Wen
Xinbing Wang
Wei Pan
135
24
0
09 Feb 2023
Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased
Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased
Chao Yu
Jiaxuan Gao
Weiling Liu
Bo Xu
Hao Tang
Jiaqi Yang
Yu Wang
Yi Wu
109
42
0
03 Feb 2023
Human-Timescale Adaptation in an Open-Ended Task Space
Human-Timescale Adaptation in an Open-Ended Task Space
Adaptive Agent Team
Jakob Bauer
Kate Baumli
Satinder Baveja
Feryal M. P. Behbahani
...
Jakub Sygnowski
K. Tuyls
Sarah York
Alexander Zacherl
Lei Zhang
LM&RoOffRLAI4CELRM
139
119
0
18 Jan 2023
PECAN: Leveraging Policy Ensemble for Context-Aware Zero-Shot Human-AI
  Coordination
PECAN: Leveraging Policy Ensemble for Context-Aware Zero-Shot Human-AI Coordination
Xingzhou Lou
Jiaxian Guo
Junge Zhang
Jun Wang
Kaiqi Huang
Yali Du
76
29
0
16 Jan 2023
NOPA: Neurally-guided Online Probabilistic Assistance for Building
  Socially Intelligent Home Assistants
NOPA: Neurally-guided Online Probabilistic Assistance for Building Socially Intelligent Home Assistants
Xavier Puig
Tianmin Shu
J. Tenenbaum
Antonio Torralba
58
22
0
12 Jan 2023
Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time
  Multi-Robot Cooperative Exploration
Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time Multi-Robot Cooperative Exploration
Chao Yu
Xinyi Yang
Jiaxuan Gao
Jiayu Chen
Yunfei Li
...
Yunfei Xiang
Rui Huang
Huazhong Yang
Yi Wu
Yu Wang
70
38
0
09 Jan 2023
Discovering Generalizable Spatial Goal Representations via Graph-based
  Active Reward Learning
Discovering Generalizable Spatial Goal Representations via Graph-based Active Reward Learning
Aviv Netanyahu
Tianmin Shu
J. Tenenbaum
Pulkit Agrawal
49
5
0
24 Nov 2022
Optimal Behavior Prior: Data-Efficient Human Models for Improved
  Human-AI Collaboration
Optimal Behavior Prior: Data-Efficient Human Models for Improved Human-AI Collaboration
Mesut Yang
Micah Carroll
Anca Dragan
100
14
0
03 Nov 2022
Coordination with Humans via Strategy Matching
Coordination with Humans via Strategy Matching
Michelle Zhao
Reid G. Simmons
H. Admoni
80
10
0
27 Oct 2022
Equivariant Networks for Zero-Shot Coordination
Equivariant Networks for Zero-Shot Coordination
Darius Muglich
Christian Schroeder de Witt
Elise van der Pol
Shimon Whiteson
Jakob N. Foerster
111
14
0
21 Oct 2022
Mastering the Game of No-Press Diplomacy via Human-Regularized
  Reinforcement Learning and Planning
Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning
A. Bakhtin
David J. Wu
Adam Lerer
Jonathan Gray
Athul Paul Jacob
Gabriele Farina
Alexander H. Miller
Noam Brown
122
47
0
11 Oct 2022
A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based
  Policy Learning
A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning
Arrasy Rahman
Ignacio Carlucho
Niklas Höpner
Stefano V. Albrecht
114
11
0
11 Oct 2022
Human-AI Coordination via Human-Regularized Search and Learning
Human-AI Coordination via Human-Regularized Search and Learning
Hengyuan Hu
David J. Wu
Adam Lerer
Jakob N. Foerster
Noam Brown
70
7
0
11 Oct 2022
ELIGN: Expectation Alignment as a Multi-Agent Intrinsic Reward
ELIGN: Expectation Alignment as a Multi-Agent Intrinsic Reward
Zixian Ma
Rose E. Wang
Li Fei-Fei
Michael S. Bernstein
Ranjay Krishna
73
17
0
09 Oct 2022
Law Informs Code: A Legal Informatics Approach to Aligning Artificial
  Intelligence with Humans
Law Informs Code: A Legal Informatics Approach to Aligning Artificial Intelligence with Humans
John J. Nay
ELMAILaw
190
29
0
14 Sep 2022
Generating Teammates for Training Robust Ad Hoc Teamwork Agents via
  Best-Response Diversity
Generating Teammates for Training Robust Ad Hoc Teamwork Agents via Best-Response Diversity
Arrasy Rahman
Elliot Fosong
Ignacio Carlucho
Stefano V. Albrecht
103
10
0
28 Jul 2022
Meta-Referential Games to Learn Compositional Learning Behaviours
Meta-Referential Games to Learn Compositional Learning Behaviours
Kevin Denamganai
S. Missaoui
James Alfred Walker
66
1
0
16 Jul 2022
K-level Reasoning for Zero-Shot Coordination in Hanabi
K-level Reasoning for Zero-Shot Coordination in Hanabi
Brandon Cui
Hengyuan Hu
Luis Pineda
Jakob N. Foerster
OffRLLRM
86
36
0
14 Jul 2022
Self-Explaining Deviations for Coordination
Self-Explaining Deviations for Coordination
Hengyuan Hu
Samuel Sokota
David J. Wu
A. Bakhtin
Andrei Lupu
Brandon Cui
Jakob N. Foerster
53
2
0
13 Jul 2022
Grounding Aleatoric Uncertainty for Unsupervised Environment Design
Grounding Aleatoric Uncertainty for Unsupervised Environment Design
Minqi Jiang
Michael Dennis
Jack Parker-Holder
Andrei Lupu
Heinrich Küttler
Edward Grefenstette
Tim Rocktaschel
Jakob N. Foerster
100
15
0
11 Jul 2022
For Learning in Symmetric Teams, Local Optima are Global Nash Equilibria
For Learning in Symmetric Teams, Local Optima are Global Nash Equilibria
Scott Emmons
Caspar Oesterheld
Andrew Critch
Vincent Conitzer
Stuart J. Russell
53
10
0
07 Jul 2022
Learning Task Embeddings for Teamwork Adaptation in Multi-Agent
  Reinforcement Learning
Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning
Lukas Schafer
Filippos Christianos
Amos Storkey
Stefano V. Albrecht
51
7
0
05 Jul 2022
Generalized Beliefs for Cooperative AI
Generalized Beliefs for Cooperative AI
Darius Muglich
L. Zintgraf
Christian Schroeder de Witt
Shimon Whiteson
Jakob N. Foerster
90
7
0
26 Jun 2022
On the Impossibility of Learning to Cooperate with Adaptive Partner
  Strategies in Repeated Games
On the Impossibility of Learning to Cooperate with Adaptive Partner Strategies in Repeated Games
R. Loftin
F. Oliehoek
24
4
0
20 Jun 2022
Nocturne: a scalable driving benchmark for bringing multi-agent learning
  one step closer to the real world
Nocturne: a scalable driving benchmark for bringing multi-agent learning one step closer to the real world
Eugene Vinitsky
Nathan Lichtlé
Xiaomeng Yang
Brandon Amos
Jakob N. Foerster
OffRL
150
54
0
20 Jun 2022
Revisiting Some Common Practices in Cooperative Multi-Agent
  Reinforcement Learning
Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning
Wei Fu
Chao Yu
Zelai Xu
Jiaqi Yang
Yi Wu
100
35
0
15 Jun 2022
The Boltzmann Policy Distribution: Accounting for Systematic
  Suboptimality in Human Models
The Boltzmann Policy Distribution: Accounting for Systematic Suboptimality in Human Models
Cassidy Laidlaw
Anca Dragan
OffRL
72
39
0
22 Apr 2022
MA-Dreamer: Coordination and communication through shared imagination
MA-Dreamer: Coordination and communication through shared imagination
Kenzo Lobos-Tsunekawa
Akshay Srinivasan
Michael Spranger
62
2
0
10 Apr 2022
Previous
123
Next