
v1v2 (latest)
Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning
Papers citing "Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning"
22 / 22 papers shown
Title |
---|
![]() Relative Entropy Regularized Policy Iteration A. Abdolmaleki Jost Tobias Springenberg Jonas Degrave Steven Bohez Yuval Tassa Dan Belov N. Heess Martin Riedmiller |