Revisiting Gaussian mixture critics in off-policy reinforcement
  learning: a sample-based approach
v1v2 (latest)

Revisiting Gaussian mixture critics in off-policy reinforcement learning: a sample-based approach

Papers citing "Revisiting Gaussian mixture critics in off-policy reinforcement learning: a sample-based approach"

13 / 13 papers shown
Title
Maximum a Posteriori Policy Optimisation
Maximum a Posteriori Policy Optimisation
A. Abdolmaleki
Jost Tobias Springenberg
Yuval Tassa
Rémi Munos
N. Heess
Martin Riedmiller
81
478
0
14 Jun 2018

We use cookies and other tracking technologies to improve your browsing experience on our website, to show you personalized content and targeted ads, to analyze our website traffic, and to understand where our visitors are coming from. See our policy.