ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2012.09456
  4. Cited By
Stabilizing Q Learning Via Soft Mellowmax Operator

Stabilizing Q Learning Via Soft Mellowmax Operator

17 December 2020
Yaozhong Gan
Zhe Zhang
Xiaoyang Tan
ArXivPDFHTML

Papers citing "Stabilizing Q Learning Via Soft Mellowmax Operator"

6 / 6 papers shown
Title
The StarCraft Multi-Agent Challenge
The StarCraft Multi-Agent Challenge
Mikayel Samvelyan
Tabish Rashid
Christian Schroeder de Witt
Gregory Farquhar
Nantas Nardelli
Tim G. J. Rudner
Chia-Man Hung
Philip Torr
Jakob N. Foerster
Shimon Whiteson
74
941
0
11 Feb 2019
Sparse Markov Decision Processes with Causal Sparse Tsallis Entropy
  Regularization for Reinforcement Learning
Sparse Markov Decision Processes with Causal Sparse Tsallis Entropy Regularization for Reinforcement Learning
Kyungjae Lee
Sungjoon Choi
Songhwai Oh
38
67
0
19 Sep 2017
Deep Reinforcement Learning that Matters
Deep Reinforcement Learning that Matters
Peter Henderson
Riashat Islam
Philip Bachman
Joelle Pineau
Doina Precup
David Meger
OffRL
98
1,940
0
19 Sep 2017
Deep Reinforcement Learning with Double Q-learning
Deep Reinforcement Learning with Double Q-learning
H. V. Hasselt
A. Guez
David Silver
OffRL
123
7,590
0
22 Sep 2015
Continuous control with deep reinforcement learning
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
171
13,174
0
09 Sep 2015
Optimal and Approximate Q-value Functions for Decentralized POMDPs
Optimal and Approximate Q-value Functions for Decentralized POMDPs
F. Oliehoek
M. Spaan
N. Vlassis
OffRL
83
494
0
31 Oct 2011
1