On-the-fly Strategy Adaptation for ad-hoc Agent Coordination

8 March 2022

Papers citing "On-the-fly Strategy Adaptation for ad-hoc Agent Coordination"

34 / 34 papers shown

Title
Collaborating with Humans without Human Data D. Strouse Kevin R. McKee M. Botvinick Edward Hughes Richard Everett 154 167 0 15 Oct 2021
Off-Belief Learning Hengyuan Hu Adam Lerer Brandon Cui David J. Wu Luis Pineda Noam Brown Jakob N. Foerster OffRL 36 71 0 06 Mar 2021
Continuous Coordination As a Realistic Scenario for Lifelong Learning Hadi Nekoei Akilesh Badrinaaraayanan Aaron Courville Sarath Chandar CLL OffRL 65 41 0 04 Mar 2021
Open Problems in Cooperative AI Allan Dafoe Edward Hughes Yoram Bachrach Tantum Collins Kevin R. McKee Joel Z Leibo Kate Larson T. Graepel 98 202 0 15 Dec 2020
Ridge Rider: Finding Diverse Solutions by Following Eigenvectors of the Hessian Jack Parker-Holder Luke Metz Cinjon Resnick Hengyuan Hu Adam Lerer Alistair Letcher A. Peysakhovich Aldo Pacchiano Jakob N. Foerster 35 24 0 12 Nov 2020
Multi-Agent Determinantal Q-Learning Yaodong Yang Ying Wen Lihuan Chen Jun Wang Kun Shao D. Mguni Weinan Zhang 50 73 0 02 Jun 2020
"Other-Play" for Zero-Shot Coordination Hengyuan Hu Adam Lerer A. Peysakhovich Jakob N. Foerster VLM OffRL 164 221 0 06 Mar 2020
Provable Self-Play Algorithms for Competitive Reinforcement Learning Yu Bai Chi Jin SSL 137 149 0 10 Feb 2020
Effective Diversity in Population Based Reinforcement Learning Jack Parker-Holder Aldo Pacchiano K. Choromanski Stephen J. Roberts 97 162 0 03 Feb 2020
Improving Policies via Search in Cooperative Partially Observable Games Adam Lerer Hengyuan Hu Jakob N. Foerster Noam Brown 59 78 0 05 Dec 2019
Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning Hengyuan Hu Jakob N. Foerster OffRL 50 82 0 04 Dec 2019
On the Utility of Learning about Humans for Human-AI Coordination Micah Carroll Rohin Shah Mark K. Ho Thomas Griffiths Sanjit A. Seshia Pieter Abbeel Anca Dragan HAI 67 394 0 13 Oct 2019
Correlation Priors for Reinforcement Learning Bastian Alt Adrian Šošić Heinz Koeppl OffRL 20 12 0 11 Sep 2019
Multi-Agent Adversarial Inverse Reinforcement Learning Lantao Yu Jiaming Song Stefano Ermon 51 134 0 30 Jul 2019
Diverse Agents for Ad-Hoc Cooperation in Hanabi Rodrigo Canaan Julian Togelius Andy Nealen Stefan Menzel 33 46 0 08 Jul 2019
The Hanabi Challenge: A New Frontier for AI Research Nolan Bard Jakob N. Foerster A. Chandar Neil Burch Marc Lanctot ... Iain Dunning Shibl Mourad Hugo Larochelle Marc G. Bellemare Michael Bowling LLMAG 62 352 0 01 Feb 2019
Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning Jakob N. Foerster H. F. Song Edward Hughes Neil Burch Iain Dunning Shimon Whiteson M. Botvinick Michael Bowling 59 148 0 04 Nov 2018
Minibatch Gibbs Sampling on Large Graphical Models Christopher De Sa Vincent Chen W. Wong 45 20 0 15 Jun 2018
Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents Edoardo Conti Vashisht Madhavan F. Such Joel Lehman Kenneth O. Stanley Jeff Clune 61 347 0 18 Dec 2017
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm David Silver Thomas Hubert Julian Schrittwieser Ioannis Antonoglou Matthew Lai ... D. Kumaran T. Graepel Timothy Lillicrap Karen Simonyan Demis Hassabis 139 1,771 0 05 Dec 2017
Rainbow: Combining Improvements in Deep Reinforcement Learning Matteo Hessel Joseph Modayil H. V. Hasselt Tom Schaul Georg Ostrovski Will Dabney Dan Horgan Bilal Piot M. G. Azar David Silver OffRL 107 2,264 0 06 Oct 2017
Prosocial learning agents solve generalized Stag Hunts better than selfish ones A. Peysakhovich Adam Lerer 63 108 0 08 Sep 2017
Proximal Policy Optimization Algorithms John Schulman Filip Wolski Prafulla Dhariwal Alec Radford Oleg Klimov OffRL 478 19,019 0 20 Jul 2017
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments Ryan J. Lowe Yi Wu Aviv Tamar J. Harb Pieter Abbeel Igor Mordatch 140 4,482 0 07 Jun 2017
Emergence of Grounded Compositional Language in Multi-Agent Populations Igor Mordatch Pieter Abbeel LLMAG 113 701 0 15 Mar 2017
DeepStack: Expert-Level Artificial Intelligence in No-Limit Poker Matej Moravcík Martin Schmid Neil Burch Viliam Lisý Dustin Morrill Nolan Bard Trevor Davis Kevin Waugh Michael Bradley Johanson Michael Bowling BDL 155 908 0 06 Jan 2017
Learning Multiagent Communication with Backpropagation Sainbayar Sukhbaatar Arthur Szlam Rob Fergus 214 1,146 0 25 May 2016
Learning to Communicate with Deep Multi-Agent Reinforcement Learning Jakob N. Foerster Yannis Assael Nando de Freitas Shimon Whiteson 152 1,607 0 21 May 2016
Illuminating search spaces by mapping elites Jean-Baptiste Mouret Jeff Clune 74 734 0 20 Apr 2015
A Compilation Target for Probabilistic Programming Languages Brooks Paige Frank Wood 56 80 0 03 Mar 2014
Regret Minimization in Non-Zero-Sum Games with Applications to Building Champion Multiplayer Computer Poker Agents Richard G. Gibson 66 15 0 30 Apr 2013
The Complexity of Decentralized Control of Markov Decision Processes D. Bernstein S. Zilberstein N. Immerman 103 1,590 0 16 Jan 2013
Online Bandit Learning against an Adaptive Adversary: from Regret to Policy Regret R. Arora O. Dekel Ambuj Tewari OffRL 78 195 0 27 Jun 2012
A Tutorial on Bayesian Optimization of Expensive Cost Functions, with Application to Active User Modeling and Hierarchical Reinforcement Learning E. Brochu Vlad M. Cora Nando de Freitas GP 136 2,447 0 12 Dec 2010