ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.08015
  4. Cited By
On-the-fly Strategy Adaptation for ad-hoc Agent Coordination

On-the-fly Strategy Adaptation for ad-hoc Agent Coordination

8 March 2022
Jaleh Zand
Jack Parker-Holder
Stephen J. Roberts
ArXivPDFHTML

Papers citing "On-the-fly Strategy Adaptation for ad-hoc Agent Coordination"

34 / 34 papers shown
Title
Collaborating with Humans without Human Data
Collaborating with Humans without Human Data
D. Strouse
Kevin R. McKee
M. Botvinick
Edward Hughes
Richard Everett
154
167
0
15 Oct 2021
Off-Belief Learning
Off-Belief Learning
Hengyuan Hu
Adam Lerer
Brandon Cui
David J. Wu
Luis Pineda
Noam Brown
Jakob N. Foerster
OffRL
36
71
0
06 Mar 2021
Continuous Coordination As a Realistic Scenario for Lifelong Learning
Continuous Coordination As a Realistic Scenario for Lifelong Learning
Hadi Nekoei
Akilesh Badrinaaraayanan
Aaron Courville
Sarath Chandar
CLL
OffRL
65
41
0
04 Mar 2021
Open Problems in Cooperative AI
Open Problems in Cooperative AI
Allan Dafoe
Edward Hughes
Yoram Bachrach
Tantum Collins
Kevin R. McKee
Joel Z Leibo
Kate Larson
T. Graepel
98
202
0
15 Dec 2020
Ridge Rider: Finding Diverse Solutions by Following Eigenvectors of the
  Hessian
Ridge Rider: Finding Diverse Solutions by Following Eigenvectors of the Hessian
Jack Parker-Holder
Luke Metz
Cinjon Resnick
Hengyuan Hu
Adam Lerer
Alistair Letcher
A. Peysakhovich
Aldo Pacchiano
Jakob N. Foerster
35
24
0
12 Nov 2020
Multi-Agent Determinantal Q-Learning
Multi-Agent Determinantal Q-Learning
Yaodong Yang
Ying Wen
Lihuan Chen
Jun Wang
Kun Shao
D. Mguni
Weinan Zhang
50
73
0
02 Jun 2020
"Other-Play" for Zero-Shot Coordination
"Other-Play" for Zero-Shot Coordination
Hengyuan Hu
Adam Lerer
A. Peysakhovich
Jakob N. Foerster
VLM
OffRL
164
221
0
06 Mar 2020
Provable Self-Play Algorithms for Competitive Reinforcement Learning
Provable Self-Play Algorithms for Competitive Reinforcement Learning
Yu Bai
Chi Jin
SSL
137
149
0
10 Feb 2020
Effective Diversity in Population Based Reinforcement Learning
Effective Diversity in Population Based Reinforcement Learning
Jack Parker-Holder
Aldo Pacchiano
K. Choromanski
Stephen J. Roberts
97
162
0
03 Feb 2020
Improving Policies via Search in Cooperative Partially Observable Games
Improving Policies via Search in Cooperative Partially Observable Games
Adam Lerer
Hengyuan Hu
Jakob N. Foerster
Noam Brown
59
78
0
05 Dec 2019
Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning
Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning
Hengyuan Hu
Jakob N. Foerster
OffRL
50
82
0
04 Dec 2019
On the Utility of Learning about Humans for Human-AI Coordination
On the Utility of Learning about Humans for Human-AI Coordination
Micah Carroll
Rohin Shah
Mark K. Ho
Thomas Griffiths
Sanjit A. Seshia
Pieter Abbeel
Anca Dragan
HAI
67
394
0
13 Oct 2019
Correlation Priors for Reinforcement Learning
Correlation Priors for Reinforcement Learning
Bastian Alt
Adrian Šošić
Heinz Koeppl
OffRL
20
12
0
11 Sep 2019
Multi-Agent Adversarial Inverse Reinforcement Learning
Multi-Agent Adversarial Inverse Reinforcement Learning
Lantao Yu
Jiaming Song
Stefano Ermon
51
134
0
30 Jul 2019
Diverse Agents for Ad-Hoc Cooperation in Hanabi
Diverse Agents for Ad-Hoc Cooperation in Hanabi
Rodrigo Canaan
Julian Togelius
Andy Nealen
Stefan Menzel
33
46
0
08 Jul 2019
The Hanabi Challenge: A New Frontier for AI Research
The Hanabi Challenge: A New Frontier for AI Research
Nolan Bard
Jakob N. Foerster
A. Chandar
Neil Burch
Marc Lanctot
...
Iain Dunning
Shibl Mourad
Hugo Larochelle
Marc G. Bellemare
Michael Bowling
LLMAG
62
352
0
01 Feb 2019
Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning
Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning
Jakob N. Foerster
H. F. Song
Edward Hughes
Neil Burch
Iain Dunning
Shimon Whiteson
M. Botvinick
Michael Bowling
59
148
0
04 Nov 2018
Minibatch Gibbs Sampling on Large Graphical Models
Minibatch Gibbs Sampling on Large Graphical Models
Christopher De Sa
Vincent Chen
W. Wong
45
20
0
15 Jun 2018
Improving Exploration in Evolution Strategies for Deep Reinforcement
  Learning via a Population of Novelty-Seeking Agents
Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents
Edoardo Conti
Vashisht Madhavan
F. Such
Joel Lehman
Kenneth O. Stanley
Jeff Clune
61
347
0
18 Dec 2017
Mastering Chess and Shogi by Self-Play with a General Reinforcement
  Learning Algorithm
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
David Silver
Thomas Hubert
Julian Schrittwieser
Ioannis Antonoglou
Matthew Lai
...
D. Kumaran
T. Graepel
Timothy Lillicrap
Karen Simonyan
Demis Hassabis
139
1,771
0
05 Dec 2017
Rainbow: Combining Improvements in Deep Reinforcement Learning
Rainbow: Combining Improvements in Deep Reinforcement Learning
Matteo Hessel
Joseph Modayil
H. V. Hasselt
Tom Schaul
Georg Ostrovski
Will Dabney
Dan Horgan
Bilal Piot
M. G. Azar
David Silver
OffRL
107
2,264
0
06 Oct 2017
Prosocial learning agents solve generalized Stag Hunts better than
  selfish ones
Prosocial learning agents solve generalized Stag Hunts better than selfish ones
A. Peysakhovich
Adam Lerer
63
108
0
08 Sep 2017
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
478
19,019
0
20 Jul 2017
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Ryan J. Lowe
Yi Wu
Aviv Tamar
J. Harb
Pieter Abbeel
Igor Mordatch
140
4,482
0
07 Jun 2017
Emergence of Grounded Compositional Language in Multi-Agent Populations
Emergence of Grounded Compositional Language in Multi-Agent Populations
Igor Mordatch
Pieter Abbeel
LLMAG
113
701
0
15 Mar 2017
DeepStack: Expert-Level Artificial Intelligence in No-Limit Poker
DeepStack: Expert-Level Artificial Intelligence in No-Limit Poker
Matej Moravcík
Martin Schmid
Neil Burch
Viliam Lisý
Dustin Morrill
Nolan Bard
Trevor Davis
Kevin Waugh
Michael Bradley Johanson
Michael Bowling
BDL
155
908
0
06 Jan 2017
Learning Multiagent Communication with Backpropagation
Learning Multiagent Communication with Backpropagation
Sainbayar Sukhbaatar
Arthur Szlam
Rob Fergus
214
1,146
0
25 May 2016
Learning to Communicate with Deep Multi-Agent Reinforcement Learning
Learning to Communicate with Deep Multi-Agent Reinforcement Learning
Jakob N. Foerster
Yannis Assael
Nando de Freitas
Shimon Whiteson
152
1,607
0
21 May 2016
Illuminating search spaces by mapping elites
Illuminating search spaces by mapping elites
Jean-Baptiste Mouret
Jeff Clune
74
734
0
20 Apr 2015
A Compilation Target for Probabilistic Programming Languages
A Compilation Target for Probabilistic Programming Languages
Brooks Paige
Frank Wood
56
80
0
03 Mar 2014
Regret Minimization in Non-Zero-Sum Games with Applications to Building
  Champion Multiplayer Computer Poker Agents
Regret Minimization in Non-Zero-Sum Games with Applications to Building Champion Multiplayer Computer Poker Agents
Richard G. Gibson
66
15
0
30 Apr 2013
The Complexity of Decentralized Control of Markov Decision Processes
The Complexity of Decentralized Control of Markov Decision Processes
D. Bernstein
S. Zilberstein
N. Immerman
103
1,590
0
16 Jan 2013
Online Bandit Learning against an Adaptive Adversary: from Regret to
  Policy Regret
Online Bandit Learning against an Adaptive Adversary: from Regret to Policy Regret
R. Arora
O. Dekel
Ambuj Tewari
OffRL
78
195
0
27 Jun 2012
A Tutorial on Bayesian Optimization of Expensive Cost Functions, with
  Application to Active User Modeling and Hierarchical Reinforcement Learning
A Tutorial on Bayesian Optimization of Expensive Cost Functions, with Application to Active User Modeling and Hierarchical Reinforcement Learning
E. Brochu
Vlad M. Cora
Nando de Freitas
GP
136
2,447
0
12 Dec 2010
1