First return, then explore

27 April 2020

Jeff Clune

Papers citing "First return, then explore"

21 / 71 papers shown

Title
Divide & Conquer Imitation Learning Alexandre Chenu Nicolas Perrin-Gilbert Olivier Sigaud 8 5 0 15 Apr 2022
Hierarchical Quality-Diversity for Online Damage Recovery Maxime Allard Simón C. Smith Konstantinos Chatzilygeroudis Antoine Cully 17 12 0 12 Apr 2022
Learning Design and Construction with Varying-Sized Materials via Prioritized Memory Resets Yunfei Li Tao Kong Lei Li Yi Wu 43 4 0 12 Apr 2022
Semantic Exploration from Language Abstractions and Pretrained Representations Allison C. Tam Neil C. Rabinowitz Andrew Kyle Lampinen Nicholas A. Roy Stephanie C. Y. Chan D. Strouse Jane X. Wang Andrea Banino Felix Hill LM&Ro 30 67 0 08 Apr 2022
Jump-Start Reinforcement Learning Ikechukwu Uchendu Ted Xiao Yao Lu Banghua Zhu Mengyuan Yan ... Chuyuan Fu Cong Ma Jiantao Jiao Sergey Levine Karol Hausman OffRL OnRL 33 109 0 05 Apr 2022
When to Go, and When to Explore: The Benefit of Post-Exploration in Intrinsic Motivation Zhao Yang Thomas M. Moerland Mike Preuss Aske Plaat 21 1 0 29 Mar 2022
Collaborative Training of Heterogeneous Reinforcement Learning Agents in Environments with Sparse Rewards: What and When to Share? Alain Andres Esther Villar-Rodriguez Javier Del Ser 12 9 0 24 Feb 2022
Reinforcement Learning in Practice: Opportunities and Challenges Yuxi Li OffRL 36 9 0 23 Feb 2022
Open-Ended Reinforcement Learning with Neural Reward Functions Robert Meier Asier Mujika 37 7 0 16 Feb 2022
Interpretable pipelines with evolutionarily optimized modules for RL tasks with visual inputs Leonardo Lucio Custode Giovanni Iacca 20 13 0 10 Feb 2022
DeepRNG: Towards Deep Reinforcement Learning-Assisted Generative Testing of Software Chuan-Yung Tsai Graham W. Taylor 6 2 0 29 Jan 2022
Provable Hierarchy-Based Meta-Reinforcement Learning Kurtland Chua Qi Lei Jason D. Lee 22 5 0 18 Oct 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain Jianye Hao Tianpei Yang Hongyao Tang Chenjia Bai Jinyi Liu Zhaopeng Meng Peng Liu Zhen Wang OffRL 36 92 0 14 Sep 2021
Active Reinforcement Learning over MDPs Qi Yang Peng Yang K. Tang 35 0 0 05 Aug 2021
Differentiable Quality Diversity Matthew C. Fontaine Stefanos Nikolaidis 40 89 0 07 Jun 2021
Monte Carlo Elites: Quality-Diversity Selection as a Multi-Armed Bandit Problem Konstantinos Sfikas Antonios Liapis Georgios N. Yannakakis 9 20 0 18 Apr 2021
Reinforcement learning for optimization of variational quantum circuit architectures M. Ostaszewski Lea M. Trenkwalder Wojciech Masarczyk Eleanor Scerri Vedran Dunjko 27 135 0 30 Mar 2021
Asymmetric self-play for automatic goal discovery in robotic manipulation OpenAI OpenAI Matthias Plappert Raul Sampedro Tao Xu Ilge Akkaya ... Hyeonwoo Noh Lilian Weng Qiming Yuan Casey Chu Wojciech Zaremba SSL 79 76 0 13 Jan 2021
BeBold: Exploration Beyond the Boundary of Explored Regions Tianjun Zhang Huazhe Xu Xiaolong Wang Yi Wu Kurt Keutzer Joseph E. Gonzalez Yuandong Tian 28 40 0 15 Dec 2020
A Unifying Framework for Reinforcement Learning and Planning Thomas M. Moerland Joost Broekens Aske Plaat Catholijn M. Jonker OffRL 25 9 0 26 Jun 2020
Should artificial agents ask for help in human-robot collaborative problem-solving? Adrien Bennetot V. Charisi Natalia Díaz Rodríguez 21 8 0 25 May 2020