Title |
---|
![]() Replay across Experiments: A Natural Extension of Off-Policy RL Dhruva Tirumala Thomas Lampe José Enrique Chen Tuomas Haarnoja Sandy Huang ...Tim Hertweck Leonard Hasenclever Martin Riedmiller N. Heess Markus Wulfmeier |
![]() SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended
Exploration Giulia Vezzani Dhruva Tirumala Markus Wulfmeier Dushyant Rao A. Abdolmaleki ...Tim Hertweck Thomas Lampe Fereshteh Sadeghi N. Heess Martin Riedmiller |