Title |
---|
![]() Decoupled Exploration and Exploitation Policies for Sample-Efficient
Reinforcement Learning William F. Whitney Michael Bloesch Jost Tobias Springenberg A. Abdolmaleki Kyunghyun Cho Martin Riedmiller |
![]() Local Search for Policy Iteration in Continuous Control Jost Tobias Springenberg N. Heess D. Mankowitz J. Merel Arunkumar Byravan ...Julian Schrittwieser Yuval Tassa J. Buchli Dan Belov Martin Riedmiller |
![]() Relative Entropy Regularized Policy Iteration A. Abdolmaleki Jost Tobias Springenberg Jonas Degrave Steven Bohez Yuval Tassa Dan Belov N. Heess Martin Riedmiller |