Decoupled Exploration and Exploitation Policies for Sample-Efficient
  Reinforcement Learning
v1v2 (latest)

Decoupled Exploration and Exploitation Policies for Sample-Efficient Reinforcement Learning

Michael Bloesch
Jost Tobias Springenberg
Martin Riedmiller

Papers citing "Decoupled Exploration and Exploitation Policies for Sample-Efficient Reinforcement Learning"

33 / 33 papers shown
Title
Maximum a Posteriori Policy Optimisation
Maximum a Posteriori Policy Optimisation
A. Abdolmaleki
Jost Tobias Springenberg
Yuval Tassa
Rémi Munos
N. Heess
Martin Riedmiller
73
478
0
14 Jun 2018

We use cookies and other tracking technologies to improve your browsing experience on our website, to show you personalized content and targeted ads, to analyze our website traffic, and to understand where our visitors are coming from. See our policy.