
v1v2 (latest)
Muesli: Combining Improvements in Policy Optimization
Ivo Danihelka
David Silver
Papers citing "Muesli: Combining Improvements in Policy Optimization"
50 / 60 papers shown
Title |
---|
![]() Local Search for Policy Iteration in Continuous Control Jost Tobias Springenberg N. Heess D. Mankowitz J. Merel Arunkumar Byravan ...Julian Schrittwieser Yuval Tassa J. Buchli Dan Belov Martin Riedmiller |
![]() Soft Actor-Critic Algorithms and Applications Tuomas Haarnoja Aurick Zhou Kristian Hartikainen George Tucker Sehoon Ha ...Vikash Kumar Henry Zhu Abhishek Gupta Pieter Abbeel Sergey Levine |
![]() Observe and Look Further: Achieving Consistent Performance on Atari Tobias Pohlen Bilal Piot Todd Hester M. G. Azar Dan Horgan ...John Quan Mel Vecerík Matteo Hessel Rémi Munos Olivier Pietquin |