On the Value of Myopic Behavior in Policy ReuseIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023 |
SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended
Exploration Giulia Vezzani Dhruva Tirumala Markus Wulfmeier Dushyant Rao A. Abdolmaleki ...Tim Hertweck Thomas Lampe Fereshteh Sadeghi N. Heess Martin Riedmiller |
CUP: Critic-Guided Policy ReuseNeural Information Processing Systems (NeurIPS), 2022 |
Efficient Use of heuristics for accelerating XCS-based Policy Learning
in Markov GamesSwarm and Evolutionary Computation (Swarm Evol. Comput.), 2020 |