Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1705.08926
Cited By
v1
v2 (latest)
Counterfactual Multi-Agent Policy Gradients
24 May 2017
Jakob N. Foerster
Gregory Farquhar
Triantafyllos Afouras
Nantas Nardelli
Shimon Whiteson
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Counterfactual Multi-Agent Policy Gradients"
2 / 52 papers shown
Title
The Optimal Reward Baseline for Gradient-Based Reinforcement Learning
Lex Weaver
Nigel Tao
119
249
0
10 Jan 2013
Optimal and Approximate Q-value Functions for Decentralized POMDPs
F. Oliehoek
M. Spaan
N. Vlassis
OffRL
116
500
0
31 Oct 2011
Previous
1
2