ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.06070
  4. Cited By
Diversity is All You Need: Learning Skills without a Reward Function

Diversity is All You Need: Learning Skills without a Reward Function

16 February 2018
Benjamin Eysenbach
Abhishek Gupta
Julian Ibarz
Sergey Levine
ArXivPDFHTML

Papers citing "Diversity is All You Need: Learning Skills without a Reward Function"

50 / 263 papers shown
Title
Diverse Policies Converge in Reward-free Markov Decision Processe
Diverse Policies Converge in Reward-free Markov Decision Processe
Fanqing Lin
Shiyu Huang
Weiwei Tu
30
0
0
23 Aug 2023
FoX: Formation-aware exploration in multi-agent reinforcement learning
FoX: Formation-aware exploration in multi-agent reinforcement learning
Yonghyeon Jo
Sunwoo Lee
Junghyuk Yum
Seungyul Han
35
5
0
22 Aug 2023
Skill Transformer: A Monolithic Policy for Mobile Manipulation
Skill Transformer: A Monolithic Policy for Mobile Manipulation
Xiaoyu Huang
Dhruv Batra
Akshara Rai
Andrew Szot
LM&Ro
38
21
0
19 Aug 2023
QDax: A Library for Quality-Diversity and Population-based Algorithms
  with Hardware Acceleration
QDax: A Library for Quality-Diversity and Population-based Algorithms with Hardware Acceleration
Félix Chalumeau
Bryan Lim
Raphael Boige
Maxime Allard
Luca Grillotti
Manon Flageat
Valentin Macé
Arthur Flajolet
Thomas Pierrot
Antoine Cully
31
21
0
07 Aug 2023
Wasserstein Diversity-Enriched Regularizer for Hierarchical
  Reinforcement Learning
Wasserstein Diversity-Enriched Regularizer for Hierarchical Reinforcement Learning
Haorui Li
Jiaqi Liang
Linjing Li
D. Zeng
16
0
0
02 Aug 2023
Reinforcement Learning by Guided Safe Exploration
Reinforcement Learning by Guided Safe Exploration
Qisong Yang
T. D. Simão
N. Jansen
Simon Tindemans
M. Spaan
OffRL
OnRL
34
5
0
26 Jul 2023
Foundational Models Defining a New Era in Vision: A Survey and Outlook
Foundational Models Defining a New Era in Vision: A Survey and Outlook
Muhammad Awais
Muzammal Naseer
Salman Khan
Rao Muhammad Anwer
Hisham Cholakkal
M. Shah
Ming Yang
Fahad Shahbaz Khan
VLM
40
119
0
25 Jul 2023
Multi-Agent Cooperation via Unsupervised Learning of Joint Intentions
Multi-Agent Cooperation via Unsupervised Learning of Joint Intentions
Shanqi Liu
Weiwei Liu
Wenzhou Chen
Guanzhong Tian
Y. Liu
35
0
0
05 Jul 2023
SPRINT: Scalable Policy Pre-Training via Language Instruction Relabeling
SPRINT: Scalable Policy Pre-Training via Language Instruction Relabeling
Jesse Zhang
Karl Pertsch
Jiahui Zhang
Joseph J. Lim
LM&Ro
45
17
0
20 Jun 2023
On the Value of Myopic Behavior in Policy Reuse
On the Value of Myopic Behavior in Policy Reuse
Kang Xu
Chenjia Bai
Shuang Qiu
Haoran He
Bin Zhao
Zhen Wang
Wei Li
Xuelong Li
36
1
0
28 May 2023
Selection for short-term empowerment accelerates the evolution of
  homeostatic neural cellular automata
Selection for short-term empowerment accelerates the evolution of homeostatic neural cellular automata
Caitlin Grasso
Josh Bongard
31
0
0
24 May 2023
Augmenting Autotelic Agents with Large Language Models
Augmenting Autotelic Agents with Large Language Models
Cédric Colas
Laetitia Teodorescu
Pierre-Yves Oudeyer
Xingdi Yuan
Marc-Alexandre Côté
LLMAG
LM&Ro
33
22
0
21 May 2023
Unsupervised Discovery of Continuous Skills on a Sphere
Unsupervised Discovery of Continuous Skills on a Sphere
Takahisa Imagawa
Takuya Hiraoka
Yoshimasa Tsuruoka
35
0
0
21 May 2023
Learning Achievement Structure for Structured Exploration in Domains
  with Sparse Reward
Learning Achievement Structure for Structured Exploration in Domains with Sparse Reward
Zihan Zhou
Animesh Garg
OffRL
27
3
0
30 Apr 2023
Think Before You Act: Unified Policy for Interleaving Language Reasoning
  with Actions
Think Before You Act: Unified Policy for Interleaving Language Reasoning with Actions
Lina Mezghani
Piotr Bojanowski
Alahari Karteek
Sainbayar Sukhbaatar
LM&Ro
OffRL
LRM
23
8
0
18 Apr 2023
Efficient Quality-Diversity Optimization through Diverse Quality Species
Efficient Quality-Diversity Optimization through Diverse Quality Species
Ryan Wickman
Bibek Poudel
Taylor Michael Villarreal
Xiaofei Zhang
Weizi Li
38
6
0
14 Apr 2023
Chain-of-Thought Predictive Control
Chain-of-Thought Predictive Control
Zhiwei Jia
Vineet Thumuluri
Fangchen Liu
Ling-Hao Chen
Zhiao Huang
H. Su
LM&Ro
41
20
0
03 Apr 2023
Learning to Explore Informative Trajectories and Samples for Embodied
  Perception
Learning to Explore Informative Trajectories and Samples for Embodied Perception
Ya Jing
Tao Kong
27
5
0
20 Mar 2023
Improved Sample Complexity for Reward-free Reinforcement Learning under
  Low-rank MDPs
Improved Sample Complexity for Reward-free Reinforcement Learning under Low-rank MDPs
Yuan Cheng
Ruiquan Huang
J. Yang
Yitao Liang
OffRL
41
8
0
20 Mar 2023
Latent-Conditioned Policy Gradient for Multi-Objective Deep
  Reinforcement Learning
Latent-Conditioned Policy Gradient for Multi-Objective Deep Reinforcement Learning
T. Kanazawa
Chetan Gupta
29
0
0
15 Mar 2023
Guarded Policy Optimization with Imperfect Online Demonstrations
Guarded Policy Optimization with Imperfect Online Demonstrations
Zhenghai Xue
Zhenghao Peng
Quanyi Li
Zhihan Liu
Bolei Zhou
OffRL
51
10
0
03 Mar 2023
Scalable Multi-Agent Reinforcement Learning with General Utilities
Scalable Multi-Agent Reinforcement Learning with General Utilities
Donghao Ying
Yuhao Ding
Alec Koppel
Javad Lavaei
40
1
0
15 Feb 2023
ALAN: Autonomously Exploring Robotic Agents in the Real World
ALAN: Autonomously Exploring Robotic Agents in the Real World
Russell Mendonca
Shikhar Bahl
Deepak Pathak
LM&Ro
36
20
0
13 Feb 2023
Cross-domain Random Pre-training with Prototypes for Reinforcement Learning
Cross-domain Random Pre-training with Prototypes for Reinforcement Learning
Xin Liu
Yaran Chen
Haoran Li
Boyu Li
Dong Zhao
SSL
68
10
0
11 Feb 2023
Investigating the role of model-based learning in exploration and
  transfer
Investigating the role of model-based learning in exploration and transfer
Jacob Walker
Eszter Vértes
Yazhe Li
Gabriel Dulac-Arnold
Ankesh Anand
T. Weber
Jessica B. Hamrick
OffRL
36
7
0
08 Feb 2023
Predictable MDP Abstraction for Unsupervised Model-Based RL
Predictable MDP Abstraction for Unsupervised Model-Based RL
Seohong Park
Sergey Levine
24
9
0
08 Feb 2023
Layered State Discovery for Incremental Autonomous Exploration
Layered State Discovery for Incremental Autonomous Exploration
Liyu Chen
Andrea Tirinzoni
A. Lazaric
Matteo Pirotta
34
0
0
07 Feb 2023
Developing Driving Strategies Efficiently: A Skill-Based Hierarchical
  Reinforcement Learning Approach
Developing Driving Strategies Efficiently: A Skill-Based Hierarchical Reinforcement Learning Approach
Yigit Gurses
Kaan Buyukdemirci
Y. Yildiz
31
5
0
04 Feb 2023
Diversity Through Exclusion (DTE): Niche Identification for
  Reinforcement Learning through Value-Decomposition
Diversity Through Exclusion (DTE): Niche Identification for Reinforcement Learning through Value-Decomposition
P. Sunehag
A. Vezhnevets
Edgar A. Duénez-Guzmán
Igor Mordach
Joel Z Leibo
26
2
0
02 Feb 2023
A general Markov decision process formalism for action-state
  entropy-regularized reward maximization
A general Markov decision process formalism for action-state entropy-regularized reward maximization
D. Grytskyy
Jorge Ramírez-Ruiz
R. Moreno-Bote
22
3
0
02 Feb 2023
Policy Expansion for Bridging Offline-to-Online Reinforcement Learning
Policy Expansion for Bridging Offline-to-Online Reinforcement Learning
Haichao Zhang
Weiwen Xu
Haonan Yu
CLL
OffRL
OnRL
42
62
0
02 Feb 2023
Learning Roles with Emergent Social Value Orientations
Learning Roles with Emergent Social Value Orientations
Wenhao Li
Xiangfeng Wang
Bo Jin
J. Lu
H. Zha
23
3
0
31 Jan 2023
Skill Decision Transformer
Skill Decision Transformer
Shyam Sudhakaran
S. Risi
OffRL
29
5
0
31 Jan 2023
Deep Laplacian-based Options for Temporally-Extended Exploration
Deep Laplacian-based Options for Temporally-Extended Exploration
Martin Klissarov
Marlos C. Machado
OffRL
26
19
0
26 Jan 2023
Learning Goal-Conditioned Policies Offline with Self-Supervised Reward
  Shaping
Learning Goal-Conditioned Policies Offline with Self-Supervised Reward Shaping
Lina Mezghani
Sainbayar Sukhbaatar
Piotr Bojanowski
A. Lazaric
Alahari Karteek
OffRL
49
18
0
05 Jan 2023
Self-Motivated Multi-Agent Exploration
Self-Motivated Multi-Agent Exploration
Shaowei Zhang
Jiahan Cao
Lei Yuan
Yang Yu
De-Chuan Zhan
47
5
0
05 Jan 2023
Intrinsic Motivation in Dynamical Control Systems
Intrinsic Motivation in Dynamical Control Systems
Stas Tiomkin
I. Nemenman
Daniel Polani
Naftali Tishby
26
5
0
29 Dec 2022
Hierarchical Deep Reinforcement Learning for VWAP Strategy Optimization
Hierarchical Deep Reinforcement Learning for VWAP Strategy Optimization
Xiaodong Li
Pangjing Wu
Chenxin Zou
Qing Li
27
3
0
11 Dec 2022
Assistive Teaching of Motor Control Tasks to Humans
Assistive Teaching of Motor Control Tasks to Humans
Megha Srivastava
Erdem Biyik
Suvir Mirchandani
Noah D. Goodman
Dorsa Sadigh
20
6
0
25 Nov 2022
Choreographer: Learning and Adapting Skills in Imagination
Choreographer: Learning and Adapting Skills in Imagination
Pietro Mazzaglia
Tim Verbelen
Bart Dhoedt
Alexandre Lacoste
Sai Rajeswar
29
22
0
23 Nov 2022
Efficient Exploration using Model-Based Quality-Diversity with Gradients
Efficient Exploration using Model-Based Quality-Diversity with Gradients
Bryan Lim
Manon Flageat
Antoine Cully
23
4
0
22 Nov 2022
Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments
Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments
Daniel Jarrett
Corentin Tallec
Florent Altché
Thomas Mesnard
Rémi Munos
Michal Valko
48
5
0
18 Nov 2022
Hierarchically Structured Task-Agnostic Continual Learning
Hierarchically Structured Task-Agnostic Continual Learning
Heinke Hihn
Daniel A. Braun
BDL
CLL
23
8
0
14 Nov 2022
Control Transformer: Robot Navigation in Unknown Environments through
  PRM-Guided Return-Conditioned Sequence Modeling
Control Transformer: Robot Navigation in Unknown Environments through PRM-Guided Return-Conditioned Sequence Modeling
Daniel Lawson
A. H. Qureshi
27
8
0
11 Nov 2022
Emergency action termination for immediate reaction in hierarchical
  reinforcement learning
Emergency action termination for immediate reaction in hierarchical reinforcement learning
Michal Bortkiewicz
Jakub Lyskawa
Pawel Wawrzyñski
M. Ostaszewski
Artur Grudkowski
Tomasz Trzciñski
24
0
0
11 Nov 2022
Foundation Models for Semantic Novelty in Reinforcement Learning
Foundation Models for Semantic Novelty in Reinforcement Learning
Tarun Gupta
Peter Karkus
Tong Che
Danfei Xu
Marco Pavone
VLM
OffRL
LRM
45
7
0
09 Nov 2022
Goal Exploration Augmentation via Pre-trained Skills for Sparse-Reward
  Long-Horizon Goal-Conditioned Reinforcement Learning
Goal Exploration Augmentation via Pre-trained Skills for Sparse-Reward Long-Horizon Goal-Conditioned Reinforcement Learning
Lisheng Wu
Ke Chen
36
3
0
28 Oct 2022
Learning General World Models in a Handful of Reward-Free Deployments
Learning General World Models in a Handful of Reward-Free Deployments
Yingchen Xu
Jack Parker-Holder
Aldo Pacchiano
Philip J. Ball
Oleh Rybkin
Stephen J. Roberts
Tim Rocktaschel
Edward Grefenstette
OffRL
62
9
0
23 Oct 2022
Augmentative Topology Agents For Open-Ended Learning
Augmentative Topology Agents For Open-Ended Learning
Muhammad Umair Nasir
Michael Beukman
Steven D. James
C. Cleghorn
39
3
0
20 Oct 2022
Online Damage Recovery for Physical Robots with Hierarchical
  Quality-Diversity
Online Damage Recovery for Physical Robots with Hierarchical Quality-Diversity
Maxime Allard
Simón C. Smith
Konstantinos Chatzilygeroudis
Bryan Lim
Antoine Cully
27
13
0
18 Oct 2022
Previous
123456
Next