Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.06070
Cited By
Diversity is All You Need: Learning Skills without a Reward Function
16 February 2018
Benjamin Eysenbach
Abhishek Gupta
Julian Ibarz
Sergey Levine
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Diversity is All You Need: Learning Skills without a Reward Function"
50 / 263 papers shown
Title
Diverse Policies Converge in Reward-free Markov Decision Processe
Fanqing Lin
Shiyu Huang
Weiwei Tu
30
0
0
23 Aug 2023
FoX: Formation-aware exploration in multi-agent reinforcement learning
Yonghyeon Jo
Sunwoo Lee
Junghyuk Yum
Seungyul Han
35
5
0
22 Aug 2023
Skill Transformer: A Monolithic Policy for Mobile Manipulation
Xiaoyu Huang
Dhruv Batra
Akshara Rai
Andrew Szot
LM&Ro
38
21
0
19 Aug 2023
QDax: A Library for Quality-Diversity and Population-based Algorithms with Hardware Acceleration
Félix Chalumeau
Bryan Lim
Raphael Boige
Maxime Allard
Luca Grillotti
Manon Flageat
Valentin Macé
Arthur Flajolet
Thomas Pierrot
Antoine Cully
31
21
0
07 Aug 2023
Wasserstein Diversity-Enriched Regularizer for Hierarchical Reinforcement Learning
Haorui Li
Jiaqi Liang
Linjing Li
D. Zeng
16
0
0
02 Aug 2023
Reinforcement Learning by Guided Safe Exploration
Qisong Yang
T. D. Simão
N. Jansen
Simon Tindemans
M. Spaan
OffRL
OnRL
34
5
0
26 Jul 2023
Foundational Models Defining a New Era in Vision: A Survey and Outlook
Muhammad Awais
Muzammal Naseer
Salman Khan
Rao Muhammad Anwer
Hisham Cholakkal
M. Shah
Ming Yang
Fahad Shahbaz Khan
VLM
40
119
0
25 Jul 2023
Multi-Agent Cooperation via Unsupervised Learning of Joint Intentions
Shanqi Liu
Weiwei Liu
Wenzhou Chen
Guanzhong Tian
Y. Liu
35
0
0
05 Jul 2023
SPRINT: Scalable Policy Pre-Training via Language Instruction Relabeling
Jesse Zhang
Karl Pertsch
Jiahui Zhang
Joseph J. Lim
LM&Ro
45
17
0
20 Jun 2023
On the Value of Myopic Behavior in Policy Reuse
Kang Xu
Chenjia Bai
Shuang Qiu
Haoran He
Bin Zhao
Zhen Wang
Wei Li
Xuelong Li
36
1
0
28 May 2023
Selection for short-term empowerment accelerates the evolution of homeostatic neural cellular automata
Caitlin Grasso
Josh Bongard
31
0
0
24 May 2023
Augmenting Autotelic Agents with Large Language Models
Cédric Colas
Laetitia Teodorescu
Pierre-Yves Oudeyer
Xingdi Yuan
Marc-Alexandre Côté
LLMAG
LM&Ro
33
22
0
21 May 2023
Unsupervised Discovery of Continuous Skills on a Sphere
Takahisa Imagawa
Takuya Hiraoka
Yoshimasa Tsuruoka
35
0
0
21 May 2023
Learning Achievement Structure for Structured Exploration in Domains with Sparse Reward
Zihan Zhou
Animesh Garg
OffRL
27
3
0
30 Apr 2023
Think Before You Act: Unified Policy for Interleaving Language Reasoning with Actions
Lina Mezghani
Piotr Bojanowski
Alahari Karteek
Sainbayar Sukhbaatar
LM&Ro
OffRL
LRM
23
8
0
18 Apr 2023
Efficient Quality-Diversity Optimization through Diverse Quality Species
Ryan Wickman
Bibek Poudel
Taylor Michael Villarreal
Xiaofei Zhang
Weizi Li
38
6
0
14 Apr 2023
Chain-of-Thought Predictive Control
Zhiwei Jia
Vineet Thumuluri
Fangchen Liu
Ling-Hao Chen
Zhiao Huang
H. Su
LM&Ro
41
20
0
03 Apr 2023
Learning to Explore Informative Trajectories and Samples for Embodied Perception
Ya Jing
Tao Kong
27
5
0
20 Mar 2023
Improved Sample Complexity for Reward-free Reinforcement Learning under Low-rank MDPs
Yuan Cheng
Ruiquan Huang
J. Yang
Yitao Liang
OffRL
41
8
0
20 Mar 2023
Latent-Conditioned Policy Gradient for Multi-Objective Deep Reinforcement Learning
T. Kanazawa
Chetan Gupta
29
0
0
15 Mar 2023
Guarded Policy Optimization with Imperfect Online Demonstrations
Zhenghai Xue
Zhenghao Peng
Quanyi Li
Zhihan Liu
Bolei Zhou
OffRL
51
10
0
03 Mar 2023
Scalable Multi-Agent Reinforcement Learning with General Utilities
Donghao Ying
Yuhao Ding
Alec Koppel
Javad Lavaei
40
1
0
15 Feb 2023
ALAN: Autonomously Exploring Robotic Agents in the Real World
Russell Mendonca
Shikhar Bahl
Deepak Pathak
LM&Ro
36
20
0
13 Feb 2023
Cross-domain Random Pre-training with Prototypes for Reinforcement Learning
Xin Liu
Yaran Chen
Haoran Li
Boyu Li
Dong Zhao
SSL
68
10
0
11 Feb 2023
Investigating the role of model-based learning in exploration and transfer
Jacob Walker
Eszter Vértes
Yazhe Li
Gabriel Dulac-Arnold
Ankesh Anand
T. Weber
Jessica B. Hamrick
OffRL
36
7
0
08 Feb 2023
Predictable MDP Abstraction for Unsupervised Model-Based RL
Seohong Park
Sergey Levine
24
9
0
08 Feb 2023
Layered State Discovery for Incremental Autonomous Exploration
Liyu Chen
Andrea Tirinzoni
A. Lazaric
Matteo Pirotta
34
0
0
07 Feb 2023
Developing Driving Strategies Efficiently: A Skill-Based Hierarchical Reinforcement Learning Approach
Yigit Gurses
Kaan Buyukdemirci
Y. Yildiz
31
5
0
04 Feb 2023
Diversity Through Exclusion (DTE): Niche Identification for Reinforcement Learning through Value-Decomposition
P. Sunehag
A. Vezhnevets
Edgar A. Duénez-Guzmán
Igor Mordach
Joel Z Leibo
26
2
0
02 Feb 2023
A general Markov decision process formalism for action-state entropy-regularized reward maximization
D. Grytskyy
Jorge Ramírez-Ruiz
R. Moreno-Bote
22
3
0
02 Feb 2023
Policy Expansion for Bridging Offline-to-Online Reinforcement Learning
Haichao Zhang
Weiwen Xu
Haonan Yu
CLL
OffRL
OnRL
42
62
0
02 Feb 2023
Learning Roles with Emergent Social Value Orientations
Wenhao Li
Xiangfeng Wang
Bo Jin
J. Lu
H. Zha
23
3
0
31 Jan 2023
Skill Decision Transformer
Shyam Sudhakaran
S. Risi
OffRL
29
5
0
31 Jan 2023
Deep Laplacian-based Options for Temporally-Extended Exploration
Martin Klissarov
Marlos C. Machado
OffRL
26
19
0
26 Jan 2023
Learning Goal-Conditioned Policies Offline with Self-Supervised Reward Shaping
Lina Mezghani
Sainbayar Sukhbaatar
Piotr Bojanowski
A. Lazaric
Alahari Karteek
OffRL
49
18
0
05 Jan 2023
Self-Motivated Multi-Agent Exploration
Shaowei Zhang
Jiahan Cao
Lei Yuan
Yang Yu
De-Chuan Zhan
47
5
0
05 Jan 2023
Intrinsic Motivation in Dynamical Control Systems
Stas Tiomkin
I. Nemenman
Daniel Polani
Naftali Tishby
26
5
0
29 Dec 2022
Hierarchical Deep Reinforcement Learning for VWAP Strategy Optimization
Xiaodong Li
Pangjing Wu
Chenxin Zou
Qing Li
27
3
0
11 Dec 2022
Assistive Teaching of Motor Control Tasks to Humans
Megha Srivastava
Erdem Biyik
Suvir Mirchandani
Noah D. Goodman
Dorsa Sadigh
20
6
0
25 Nov 2022
Choreographer: Learning and Adapting Skills in Imagination
Pietro Mazzaglia
Tim Verbelen
Bart Dhoedt
Alexandre Lacoste
Sai Rajeswar
29
22
0
23 Nov 2022
Efficient Exploration using Model-Based Quality-Diversity with Gradients
Bryan Lim
Manon Flageat
Antoine Cully
23
4
0
22 Nov 2022
Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments
Daniel Jarrett
Corentin Tallec
Florent Altché
Thomas Mesnard
Rémi Munos
Michal Valko
48
5
0
18 Nov 2022
Hierarchically Structured Task-Agnostic Continual Learning
Heinke Hihn
Daniel A. Braun
BDL
CLL
23
8
0
14 Nov 2022
Control Transformer: Robot Navigation in Unknown Environments through PRM-Guided Return-Conditioned Sequence Modeling
Daniel Lawson
A. H. Qureshi
27
8
0
11 Nov 2022
Emergency action termination for immediate reaction in hierarchical reinforcement learning
Michal Bortkiewicz
Jakub Lyskawa
Pawel Wawrzyñski
M. Ostaszewski
Artur Grudkowski
Tomasz Trzciñski
24
0
0
11 Nov 2022
Foundation Models for Semantic Novelty in Reinforcement Learning
Tarun Gupta
Peter Karkus
Tong Che
Danfei Xu
Marco Pavone
VLM
OffRL
LRM
45
7
0
09 Nov 2022
Goal Exploration Augmentation via Pre-trained Skills for Sparse-Reward Long-Horizon Goal-Conditioned Reinforcement Learning
Lisheng Wu
Ke Chen
36
3
0
28 Oct 2022
Learning General World Models in a Handful of Reward-Free Deployments
Yingchen Xu
Jack Parker-Holder
Aldo Pacchiano
Philip J. Ball
Oleh Rybkin
Stephen J. Roberts
Tim Rocktaschel
Edward Grefenstette
OffRL
62
9
0
23 Oct 2022
Augmentative Topology Agents For Open-Ended Learning
Muhammad Umair Nasir
Michael Beukman
Steven D. James
C. Cleghorn
39
3
0
20 Oct 2022
Online Damage Recovery for Physical Robots with Hierarchical Quality-Diversity
Maxime Allard
Simón C. Smith
Konstantinos Chatzilygeroudis
Bryan Lim
Antoine Cully
27
13
0
18 Oct 2022
Previous
1
2
3
4
5
6
Next