Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1912.02318
Cited By
Improving Policies via Search in Cooperative Partially Observable Games
5 December 2019
Adam Lerer
Hengyuan Hu
Jakob N. Foerster
Noam Brown
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Improving Policies via Search in Cooperative Partially Observable Games"
23 / 23 papers shown
Title
Beyond Autoregression: Fast LLMs via Self-Distillation Through Time
Justin Deschenaux
Çağlar Gülçehre
47
2
0
28 Oct 2024
Adaptation Procedure in Misinformation Games
Konstantinos Varsos
Merkouris Papamichail
G. Flouris
M. Bitsaki
27
0
0
07 Sep 2024
Abstracting Imperfect Information Away from Two-Player Zero-Sum Games
Samuel Sokota
Ryan DÓrazio
Chun Kai Ling
David J. Wu
J. Zico Kolter
Noam Brown
27
4
0
22 Jan 2023
Safe Subgame Resolving for Extensive Form Correlated Equilibrium
Chun Kai Ling
Fei Fang
15
0
0
29 Dec 2022
Towards automating Codenames spymasters with deep reinforcement learning
Sherman Siu
22
2
0
28 Dec 2022
What is the Solution for State-Adversarial Multi-Agent Reinforcement Learning?
Songyang Han
Sanbao Su
Sihong He
Shuo Han
Haizhao Yang
Shaofeng Zou
Fei Miao
AAML
33
23
0
06 Dec 2022
Human-AI Coordination via Human-Regularized Search and Learning
Hengyuan Hu
David J. Wu
Adam Lerer
Jakob N. Foerster
Noam Brown
19
7
0
11 Oct 2022
Combining Theory of Mind and Abduction for Cooperation under Imperfect Information
Nieves Montes
Nardine Osman
Carles Sierra
26
4
0
30 Sep 2022
Self-Explaining Deviations for Coordination
Hengyuan Hu
Samuel Sokota
David J. Wu
A. Bakhtin
Andrei Lupu
Brandon Cui
Jakob N. Foerster
30
2
0
13 Jul 2022
PerfectDou: Dominating DouDizhu with Perfect Information Distillation
Yang Guan
Minghuan Liu
Weijun Hong
Weinan Zhang
Fei Fang
Guangjun Zeng
Yue Lin
27
26
0
30 Mar 2022
Reinforcement Learning in Practice: Opportunities and Challenges
Yuxi Li
OffRL
36
9
0
23 Feb 2022
Common Information based Approximate State Representations in Multi-Agent Reinforcement Learning
Shitao Xiao
V. Subramanian
29
9
0
25 Oct 2021
Evaluation of Human-AI Teams for Learned and Rule-Based Agents in Hanabi
H. Siu
Jaime D. Peña
Edenna Chen
Yutai Zhou
Victor J. Lopez
Kyle Palko
K. Chang
R. Allen
19
57
0
15 Jul 2021
Communicating Natural Programs to Humans and Machines
Samuel Acquaviva
Yewen Pu
Marta Kryven
Theo Sechopoulos
Catherine Wong
Gabrielle Ecanow
Maxwell Nye
Michael Henry Tessler
J. Tenenbaum
33
40
0
15 Jun 2021
DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning
Daochen Zha
Jingru Xie
Wenye Ma
Sheng Zhang
Xiangru Lian
Xia Hu
Ji Liu
16
117
0
11 Jun 2021
Solving Common-Payoff Games with Approximate Policy Iteration
Samuel Sokota
Edward Lockhart
Finbarr Timbers
Elnaz Davoodi
Ryan DÓrazio
Neil Burch
Martin Schmid
Michael Bowling
Marc Lanctot
42
22
0
11 Jan 2021
Open Problems in Cooperative AI
Allan Dafoe
Edward Hughes
Yoram Bachrach
Tantum Collins
Kevin R. McKee
Joel Z Leibo
Kate Larson
T. Graepel
37
199
0
15 Dec 2020
Human-Agent Cooperation in Bridge Bidding
Edward Lockhart
Neil Burch
Nolan Bard
Sebastian Borgeaud
Tom Eccles
Lucas Smaira
Ray Smith
14
12
0
28 Nov 2020
Joint Policy Search for Multi-agent Collaboration with Imperfect Information
Yuandong Tian
Qucheng Gong
Tina Jiang
31
19
0
14 Aug 2020
Combining Deep Reinforcement Learning and Search for Imperfect-Information Games
Noam Brown
A. Bakhtin
Adam Lerer
Qucheng Gong
20
133
0
27 Jul 2020
Learning to Play No-Press Diplomacy with Best Response Policy Iteration
Thomas W. Anthony
Tom Eccles
Andrea Tacchetti
János Kramár
I. Gemp
...
Richard Everett
Roman Werpachowski
Satinder Singh
T. Graepel
Yoram Bachrach
19
42
0
08 Jun 2020
Evaluating the Rainbow DQN Agent in Hanabi with Unseen Partners
Rodrigo Canaan
Xianbo Gao
Youjin Chung
Julian Togelius
Andy Nealen
Stefan Menzel
19
4
0
28 Apr 2020
Rethinking Formal Models of Partially Observable Multiagent Decision Making
Vojtěch Kovařík
Martin Schmid
Neil Burch
Michael Bowling
Viliam Lisý
OffRL
19
54
0
26 Jun 2019
1