Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2012.09456
Cited By
Stabilizing Q Learning Via Soft Mellowmax Operator
17 December 2020
Yaozhong Gan
Zhe Zhang
Xiaoyang Tan
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Stabilizing Q Learning Via Soft Mellowmax Operator"
7 / 7 papers shown
Title
The StarCraft Multi-Agent Challenge
Mikayel Samvelyan
Tabish Rashid
Christian Schroeder de Witt
Gregory Farquhar
Nantas Nardelli
Tim G. J. Rudner
Chia-Man Hung
Philip Torr
Jakob N. Foerster
Shimon Whiteson
77
947
0
11 Feb 2019
Sparse Markov Decision Processes with Causal Sparse Tsallis Entropy Regularization for Reinforcement Learning
Kyungjae Lee
Sungjoon Choi
Songhwai Oh
44
67
0
19 Sep 2017
Deep Reinforcement Learning that Matters
Peter Henderson
Riashat Islam
Philip Bachman
Joelle Pineau
Doina Precup
David Meger
OffRL
103
1,940
0
19 Sep 2017
Value-Decomposition Networks For Cooperative Multi-Agent Learning
P. Sunehag
Guy Lever
A. Gruslys
Wojciech M. Czarnecki
V. Zambaldi
...
Marc Lanctot
Nicolas Sonnerat
Joel Z Leibo
K. Tuyls
T. Graepel
64
997
0
16 Jun 2017
Deep Reinforcement Learning with Double Q-learning
H. V. Hasselt
A. Guez
David Silver
OffRL
131
7,590
0
22 Sep 2015
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
191
13,174
0
09 Sep 2015
Optimal and Approximate Q-value Functions for Decentralized POMDPs
F. Oliehoek
M. Spaan
N. Vlassis
OffRL
94
494
0
31 Oct 2011
1