Title |
---|
![]() Data-efficient Hindsight Off-policy Option Learning Markus Wulfmeier Dushyant Rao Roland Hafner Thomas Lampe A. Abdolmaleki ...Michael Neunert Dhruva Tirumala Noah Y. Siegel N. Heess Martin Riedmiller |
![]() Least-Squares Temporal Difference Learning for the Linear Quadratic
Regulator Stephen Tu Benjamin Recht |