Stabilizing Q Learning Via Soft Mellowmax Operator

17 December 2020

Papers citing "Stabilizing Q Learning Via Soft Mellowmax Operator"

7 / 7 papers shown

Title
The StarCraft Multi-Agent Challenge Mikayel Samvelyan Tabish Rashid Christian Schroeder de Witt Gregory Farquhar Nantas Nardelli Tim G. J. Rudner Chia-Man Hung Philip Torr Jakob N. Foerster Shimon Whiteson 77 947 0 11 Feb 2019
Sparse Markov Decision Processes with Causal Sparse Tsallis Entropy Regularization for Reinforcement Learning Kyungjae Lee Sungjoon Choi Songhwai Oh 44 67 0 19 Sep 2017
Deep Reinforcement Learning that Matters Peter Henderson Riashat Islam Philip Bachman Joelle Pineau Doina Precup David Meger OffRL 103 1,940 0 19 Sep 2017
Value-Decomposition Networks For Cooperative Multi-Agent Learning P. Sunehag Guy Lever A. Gruslys Wojciech M. Czarnecki V. Zambaldi ... Marc Lanctot Nicolas Sonnerat Joel Z Leibo K. Tuyls T. Graepel 64 997 0 16 Jun 2017
Deep Reinforcement Learning with Double Q-learning H. V. Hasselt A. Guez David Silver OffRL 131 7,590 0 22 Sep 2015
Continuous control with deep reinforcement learning Timothy Lillicrap Jonathan J. Hunt Alexander Pritzel N. Heess Tom Erez Yuval Tassa David Silver Daan Wierstra 191 13,174 0 09 Sep 2015
Optimal and Approximate Q-value Functions for Decentralized POMDPs F. Oliehoek M. Spaan N. Vlassis OffRL 94 494 0 31 Oct 2011