Stabilizing Q Learning Via Soft Mellowmax Operator

17 December 2020

Papers citing "Stabilizing Q Learning Via Soft Mellowmax Operator"

6 / 6 papers shown

Title
The StarCraft Multi-Agent Challenge Mikayel Samvelyan Tabish Rashid Christian Schroeder de Witt Gregory Farquhar Nantas Nardelli Tim G. J. Rudner Chia-Man Hung Philip Torr Jakob N. Foerster Shimon Whiteson 74 941 0 11 Feb 2019
Sparse Markov Decision Processes with Causal Sparse Tsallis Entropy Regularization for Reinforcement Learning Kyungjae Lee Sungjoon Choi Songhwai Oh 38 67 0 19 Sep 2017
Deep Reinforcement Learning that Matters Peter Henderson Riashat Islam Philip Bachman Joelle Pineau Doina Precup David Meger OffRL 98 1,940 0 19 Sep 2017
Deep Reinforcement Learning with Double Q-learning H. V. Hasselt A. Guez David Silver OffRL 123 7,590 0 22 Sep 2015
Continuous control with deep reinforcement learning Timothy Lillicrap Jonathan J. Hunt Alexander Pritzel N. Heess Tom Erez Yuval Tassa David Silver Daan Wierstra 171 13,174 0 09 Sep 2015
Optimal and Approximate Q-value Functions for Decentralized POMDPs F. Oliehoek M. Spaan N. Vlassis OffRL 83 494 0 31 Oct 2011