Residual Reinforcement Learning from Demonstrations

Residual Reinforcement Learning from Demonstrations

Papers citing "Residual Reinforcement Learning from Demonstrations"

25 / 25 papers shown
Title
Maximum a Posteriori Policy Optimisation
Maximum a Posteriori Policy Optimisation
A. Abdolmaleki
Jost Tobias Springenberg
Yuval Tassa
Rémi Munos
N. Heess
Martin Riedmiller
69
471
0
14 Jun 2018

We use cookies and other tracking technologies to improve your browsing experience on our website, to show you personalized content and targeted ads, to analyze our website traffic, and to understand where our visitors are coming from. See our policy.