ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.12919
  4. Cited By
First return, then explore

First return, then explore

27 April 2020
Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
ArXivPDFHTML

Papers citing "First return, then explore"

21 / 71 papers shown
Title
Divide & Conquer Imitation Learning
Divide & Conquer Imitation Learning
Alexandre Chenu
Nicolas Perrin-Gilbert
Olivier Sigaud
8
5
0
15 Apr 2022
Hierarchical Quality-Diversity for Online Damage Recovery
Hierarchical Quality-Diversity for Online Damage Recovery
Maxime Allard
Simón C. Smith
Konstantinos Chatzilygeroudis
Antoine Cully
17
12
0
12 Apr 2022
Learning Design and Construction with Varying-Sized Materials via
  Prioritized Memory Resets
Learning Design and Construction with Varying-Sized Materials via Prioritized Memory Resets
Yunfei Li
Tao Kong
Lei Li
Yi Wu
43
4
0
12 Apr 2022
Semantic Exploration from Language Abstractions and Pretrained
  Representations
Semantic Exploration from Language Abstractions and Pretrained Representations
Allison C. Tam
Neil C. Rabinowitz
Andrew Kyle Lampinen
Nicholas A. Roy
Stephanie C. Y. Chan
D. Strouse
Jane X. Wang
Andrea Banino
Felix Hill
LM&Ro
30
67
0
08 Apr 2022
Jump-Start Reinforcement Learning
Jump-Start Reinforcement Learning
Ikechukwu Uchendu
Ted Xiao
Yao Lu
Banghua Zhu
Mengyuan Yan
...
Chuyuan Fu
Cong Ma
Jiantao Jiao
Sergey Levine
Karol Hausman
OffRL
OnRL
33
109
0
05 Apr 2022
When to Go, and When to Explore: The Benefit of Post-Exploration in
  Intrinsic Motivation
When to Go, and When to Explore: The Benefit of Post-Exploration in Intrinsic Motivation
Zhao Yang
Thomas M. Moerland
Mike Preuss
Aske Plaat
21
1
0
29 Mar 2022
Collaborative Training of Heterogeneous Reinforcement Learning Agents in
  Environments with Sparse Rewards: What and When to Share?
Collaborative Training of Heterogeneous Reinforcement Learning Agents in Environments with Sparse Rewards: What and When to Share?
Alain Andres
Esther Villar-Rodriguez
Javier Del Ser
12
9
0
24 Feb 2022
Reinforcement Learning in Practice: Opportunities and Challenges
Reinforcement Learning in Practice: Opportunities and Challenges
Yuxi Li
OffRL
36
9
0
23 Feb 2022
Open-Ended Reinforcement Learning with Neural Reward Functions
Open-Ended Reinforcement Learning with Neural Reward Functions
Robert Meier
Asier Mujika
37
7
0
16 Feb 2022
Interpretable pipelines with evolutionarily optimized modules for RL
  tasks with visual inputs
Interpretable pipelines with evolutionarily optimized modules for RL tasks with visual inputs
Leonardo Lucio Custode
Giovanni Iacca
20
13
0
10 Feb 2022
DeepRNG: Towards Deep Reinforcement Learning-Assisted Generative Testing
  of Software
DeepRNG: Towards Deep Reinforcement Learning-Assisted Generative Testing of Software
Chuan-Yung Tsai
Graham W. Taylor
6
2
0
29 Jan 2022
Provable Hierarchy-Based Meta-Reinforcement Learning
Provable Hierarchy-Based Meta-Reinforcement Learning
Kurtland Chua
Qi Lei
Jason D. Lee
22
5
0
18 Oct 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to
  Multiagent Domain
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
36
92
0
14 Sep 2021
Active Reinforcement Learning over MDPs
Qi Yang
Peng Yang
K. Tang
35
0
0
05 Aug 2021
Differentiable Quality Diversity
Differentiable Quality Diversity
Matthew C. Fontaine
Stefanos Nikolaidis
40
89
0
07 Jun 2021
Monte Carlo Elites: Quality-Diversity Selection as a Multi-Armed Bandit
  Problem
Monte Carlo Elites: Quality-Diversity Selection as a Multi-Armed Bandit Problem
Konstantinos Sfikas
Antonios Liapis
Georgios N. Yannakakis
9
20
0
18 Apr 2021
Reinforcement learning for optimization of variational quantum circuit
  architectures
Reinforcement learning for optimization of variational quantum circuit architectures
M. Ostaszewski
Lea M. Trenkwalder
Wojciech Masarczyk
Eleanor Scerri
Vedran Dunjko
27
135
0
30 Mar 2021
Asymmetric self-play for automatic goal discovery in robotic
  manipulation
Asymmetric self-play for automatic goal discovery in robotic manipulation
OpenAI OpenAI
Matthias Plappert
Raul Sampedro
Tao Xu
Ilge Akkaya
...
Hyeonwoo Noh
Lilian Weng
Qiming Yuan
Casey Chu
Wojciech Zaremba
SSL
79
76
0
13 Jan 2021
BeBold: Exploration Beyond the Boundary of Explored Regions
BeBold: Exploration Beyond the Boundary of Explored Regions
Tianjun Zhang
Huazhe Xu
Xiaolong Wang
Yi Wu
Kurt Keutzer
Joseph E. Gonzalez
Yuandong Tian
28
40
0
15 Dec 2020
A Unifying Framework for Reinforcement Learning and Planning
A Unifying Framework for Reinforcement Learning and Planning
Thomas M. Moerland
Joost Broekens
Aske Plaat
Catholijn M. Jonker
OffRL
25
9
0
26 Jun 2020
Should artificial agents ask for help in human-robot collaborative
  problem-solving?
Should artificial agents ask for help in human-robot collaborative problem-solving?
Adrien Bennetot
V. Charisi
Natalia Díaz Rodríguez
21
8
0
25 May 2020
Previous
12