ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.06070
  4. Cited By
Diversity is All You Need: Learning Skills without a Reward Function
v1v2v3v4v5v6 (latest)

Diversity is All You Need: Learning Skills without a Reward Function

16 February 2018
Benjamin Eysenbach
Abhishek Gupta
Julian Ibarz
Sergey Levine
ArXiv (abs)PDFHTML

Papers citing "Diversity is All You Need: Learning Skills without a Reward Function"

50 / 414 papers shown
Title
Task-Agnostic Exploration via Policy Gradient of a Non-Parametric State
  Entropy Estimate
Task-Agnostic Exploration via Policy Gradient of a Non-Parametric State Entropy Estimate
Mirco Mutti
Lorenzo Pratissoli
Marcello Restelli
73
19
0
09 Jul 2020
Adaptive Procedural Task Generation for Hard-Exploration Problems
Adaptive Procedural Task Generation for Hard-Exploration Problems
Kuan Fang
Yuke Zhu
Silvio Savarese
Li Fei-Fei
78
26
0
01 Jul 2020
Model-based Reinforcement Learning: A Survey
Model-based Reinforcement Learning: A Survey
Thomas M. Moerland
Joost Broekens
Aske Plaat
Catholijn M. Jonker
OffRL
120
49
0
30 Jun 2020
SOAC: The Soft Option Actor-Critic Architecture
SOAC: The Soft Option Actor-Critic Architecture
Chenghao Li
Xiaoteng Ma
Chongjie Zhang
Jun Yang
L. Xia
Qianchuan Zhao
50
6
0
25 Jun 2020
Long-Horizon Visual Planning with Goal-Conditioned Hierarchical
  Predictors
Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors
Karl Pertsch
Oleh Rybkin
F. Ebert
Chelsea Finn
Dinesh Jayaraman
Sergey Levine
81
73
0
23 Jun 2020
On Reward-Free Reinforcement Learning with Linear Function Approximation
On Reward-Free Reinforcement Learning with Linear Function Approximation
Ruosong Wang
S. Du
Lin F. Yang
Ruslan Salakhutdinov
OffRL
83
107
0
19 Jun 2020
Semantic Curiosity for Active Visual Learning
Semantic Curiosity for Active Visual Learning
Devendra Singh Chaplot
Helen Jiang
Saurabh Gupta
Abhinav Gupta
ObjD
66
72
0
16 Jun 2020
Non-local Policy Optimization via Diversity-regularized Collaborative
  Exploration
Non-local Policy Optimization via Diversity-regularized Collaborative Exploration
Zhenghao Peng
Hao Sun
Bolei Zhou
75
19
0
14 Jun 2020
Exploration by Maximizing Rényi Entropy for Reward-Free RL Framework
Exploration by Maximizing Rényi Entropy for Reward-Free RL Framework
Wei Shen
Yuanying Cai
Longbo Huang
Jian Li
OffRL
69
1
0
11 Jun 2020
Continuous Action Reinforcement Learning from a Mixture of Interpretable
  Experts
Continuous Action Reinforcement Learning from a Mixture of Interpretable Experts
R. Akrour
Davide Tateo
Jan Peters
60
22
0
10 Jun 2020
The Emergence of Individuality
The Emergence of Individuality
Jiechuan Jiang
Zongqing Lu
75
40
0
10 Jun 2020
Temporally-Extended ε-Greedy Exploration
Temporally-Extended ε-Greedy Exploration
Will Dabney
Georg Ostrovski
André Barreto
79
34
0
02 Jun 2020
Novel Policy Seeking with Constrained Optimization
Novel Policy Seeking with Constrained Optimization
Hao Sun
Zhenghao Peng
Bo Dai
Jian Guo
Dahua Lin
Bolei Zhou
136
13
0
21 May 2020
Simple Sensor Intentions for Exploration
Simple Sensor Intentions for Exploration
Tim Hertweck
Martin Riedmiller
Michael Bloesch
Jost Tobias Springenberg
Noah Y. Siegel
Markus Wulfmeier
Roland Hafner
N. Heess
68
10
0
15 May 2020
Progressive growing of self-organized hierarchical representations for
  exploration
Progressive growing of self-organized hierarchical representations for exploration
Mayalen Etcheverry
Pierre-Yves Oudeyer
Chris Reinke
52
0
0
13 May 2020
Planning to Explore via Self-Supervised World Models
Planning to Explore via Self-Supervised World Models
Ramanan Sekar
Oleh Rybkin
Kostas Daniilidis
Pieter Abbeel
Danijar Hafner
Deepak Pathak
SSL
118
412
0
12 May 2020
Maximizing Information Gain in Partially Observable Environments via
  Prediction Reward
Maximizing Information Gain in Partially Observable Environments via Prediction Reward
Yash Satsangi
Sungsu Lim
Shimon Whiteson
F. Oliehoek
Martha White
80
15
0
11 May 2020
Emergent Real-World Robotic Skills via Unsupervised Off-Policy
  Reinforcement Learning
Emergent Real-World Robotic Skills via Unsupervised Off-Policy Reinforcement Learning
Archit Sharma
Michael Ahn
Sergey Levine
Vikash Kumar
Karol Hausman
S. Gu
SSLOffRL
54
47
0
27 Apr 2020
Real World Games Look Like Spinning Tops
Real World Games Look Like Spinning Tops
Wojciech M. Czarnecki
Gauthier Gidel
Brendan D. Tracey
K. Tuyls
Shayegan Omidshafiei
David Balduzzi
Max Jaderberg
75
101
0
20 Apr 2020
Agent57: Outperforming the Atari Human Benchmark
Agent57: Outperforming the Atari Human Benchmark
Adria Puigdomenech Badia
Bilal Piot
Steven Kapturowski
Pablo Sprechmann
Alex Vitvitskyi
Daniel Guo
Charles Blundell
OffRL
109
521
0
30 Mar 2020
Monotonic Value Function Factorisation for Deep Multi-Agent
  Reinforcement Learning
Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Tabish Rashid
Mikayel Samvelyan
Christian Schroeder de Witt
Gregory Farquhar
Jakob N. Foerster
Shimon Whiteson
167
827
0
19 Mar 2020
Multimodal Trajectory Optimization for Motion Planning
Multimodal Trajectory Optimization for Motion Planning
Takayuki Osa
95
56
0
16 Mar 2020
Option Discovery in the Absence of Rewards with Manifold Analysis
Option Discovery in the Absence of Rewards with Manifold Analysis
Amitay Bar
Ronen Talmon
Ron Meir
71
5
0
12 Mar 2020
Meta-learning curiosity algorithms
Meta-learning curiosity algorithms
Ferran Alet
Martin Schneider
Tomas Lozano-Perez
L. Kaelbling
99
64
0
11 Mar 2020
Automatic Curriculum Learning For Deep RL: A Short Survey
Automatic Curriculum Learning For Deep RL: A Short Survey
Rémy Portelas
Cédric Colas
Lilian Weng
Katja Hofmann
Pierre-Yves Oudeyer
ODL
119
176
0
10 Mar 2020
Hierarchically Decoupled Imitation for Morphological Transfer
Hierarchically Decoupled Imitation for Morphological Transfer
D. Hejna
Pieter Abbeel
Lerrel Pinto
LM&Ro
77
43
0
03 Mar 2020
EGAD! an Evolved Grasping Analysis Dataset for diversity and
  reproducibility in robotic manipulation
EGAD! an Evolved Grasping Analysis Dataset for diversity and reproducibility in robotic manipulation
D. Morrison
Peter Corke
Jurgen Leitner
200
140
0
03 Mar 2020
RIDE: Rewarding Impact-Driven Exploration for Procedurally-Generated
  Environments
RIDE: Rewarding Impact-Driven Exploration for Procedurally-Generated Environments
Roberta Raileanu
Tim Rocktaschel
88
174
0
27 Feb 2020
Generalized Hindsight for Reinforcement Learning
Generalized Hindsight for Reinforcement Learning
Alexander C. Li
Lerrel Pinto
Pieter Abbeel
67
70
0
26 Feb 2020
Generating Automatic Curricula via Self-Supervised Active Domain
  Randomization
Generating Automatic Curricula via Self-Supervised Active Domain Randomization
Sharath Chandra Raparthy
Bhairav Mehta
Florian Golemo
Liam Paull
87
8
0
18 Feb 2020
Intrinsic Motivation for Encouraging Synergistic Behavior
Intrinsic Motivation for Encouraging Synergistic Behavior
Rohan Chitnis
Shubham Tulsiani
Saurabh Gupta
Abhinav Gupta
40
28
0
12 Feb 2020
Explore, Discover and Learn: Unsupervised Discovery of State-Covering
  Skills
Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills
Victor Campos
Alexander R. Trott
Caiming Xiong
R. Socher
Xavier Giró-i-Nieto
Jordi Torres
OffRL
101
156
0
10 Feb 2020
Effective Diversity in Population Based Reinforcement Learning
Effective Diversity in Population Based Reinforcement Learning
Jack Parker-Holder
Aldo Pacchiano
K. Choromanski
Stephen J. Roberts
130
165
0
03 Feb 2020
Preventing Imitation Learning with Adversarial Policy Ensembles
Preventing Imitation Learning with Adversarial Policy Ensembles
Albert Zhan
Stas Tiomkin
Pieter Abbeel
31
3
0
31 Jan 2020
Making Sense of Reinforcement Learning and Probabilistic Inference
Making Sense of Reinforcement Learning and Probabilistic Inference
Brendan O'Donoghue
Ian Osband
Catalin Ionescu
OffRL
111
49
0
03 Jan 2020
Joint Goal and Strategy Inference across Heterogeneous Demonstrators via
  Reward Network Distillation
Joint Goal and Strategy Inference across Heterogeneous Demonstrators via Reward Network Distillation
Letian Chen
Rohan R. Paleja
Muyleng Ghuy
Matthew C. Gombolay
116
39
0
02 Jan 2020
Entropy Regularization with Discounted Future State Distribution in
  Policy Gradient Methods
Entropy Regularization with Discounted Future State Distribution in Policy Gradient Methods
Riashat Islam
Raihan Seraj
Pierre-Luc Bacon
Doina Precup
50
8
0
11 Dec 2019
Unsupervised Curricula for Visual Meta-Reinforcement Learning
Unsupervised Curricula for Visual Meta-Reinforcement Learning
Allan Jabri
Kyle Hsu
Benjamin Eysenbach
Abhishek Gupta
Sergey Levine
Chelsea Finn
VLMOODSSLOffRL
84
65
0
09 Dec 2019
Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill
  Discovery
Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery
Jiachen Yang
Igor Borovikov
H. Zha
104
77
0
07 Dec 2019
Disentangled Cumulants Help Successor Representations Transfer to New
  Tasks
Disentangled Cumulants Help Successor Representations Transfer to New Tasks
Christopher Grimm
I. Higgins
André Barreto
Denis Teplyashin
Markus Wulfmeier
Tim Hertweck
R. Hadsell
Satinder Singh
69
14
0
25 Nov 2019
Bayesian Curiosity for Efficient Exploration in Reinforcement Learning
Bayesian Curiosity for Efficient Exploration in Reinforcement Learning
Tom Blau
Lionel Ott
Fabio Ramos
25
8
0
20 Nov 2019
Evaluating task-agnostic exploration for fixed-batch learning of
  arbitrary future tasks
Evaluating task-agnostic exploration for fixed-batch learning of arbitrary future tasks
Vibhavari Dasagi
Robert Lee
Jake Bruce
Jurgen Leitner
OffRL
56
2
0
20 Nov 2019
Implicit Generative Modeling for Efficient Exploration
Implicit Generative Modeling for Efficient Exploration
Neale Ratzlaff
Qinxun Bai
Fuxin Li
Wenyuan Xu
70
12
0
19 Nov 2019
Unsupervised Reinforcement Learning of Transferable Meta-Skills for
  Embodied Navigation
Unsupervised Reinforcement Learning of Transferable Meta-Skills for Embodied Navigation
Juncheng Li
Xinze Wang
Siliang Tang
Haizhou Shi
Leilei Gan
Yueting Zhuang
William Yang Wang
SSL
99
70
0
18 Nov 2019
Keeping Your Distance: Solving Sparse Reward Tasks Using Self-Balancing
  Shaped Rewards
Keeping Your Distance: Solving Sparse Reward Tasks Using Self-Balancing Shaped Rewards
Alexander R. Trott
Stephan Zheng
Caiming Xiong
R. Socher
125
112
0
04 Nov 2019
PODNet: A Neural Network for Discovery of Plannable Options
PODNet: A Neural Network for Discovery of Plannable Options
R. Bera
Vinicius G. Goecks
Gregory M. Gremillion
J. Valasek
Nicholas R. Waytowich
38
4
0
01 Nov 2019
Learning Transferable Graph Exploration
Learning Transferable Graph Exploration
H. Dai
Yujia Li
Chenglong Wang
Rishabh Singh
Po-Sen Huang
Pushmeet Kohli
79
22
0
28 Oct 2019
Relay Policy Learning: Solving Long-Horizon Tasks via Imitation and
  Reinforcement Learning
Relay Policy Learning: Solving Long-Horizon Tasks via Imitation and Reinforcement Learning
Abhishek Gupta
Vikash Kumar
Corey Lynch
Sergey Levine
Karol Hausman
118
435
0
25 Oct 2019
Dynamic Subgoal-based Exploration via Bayesian Optimization
Dynamic Subgoal-based Exploration via Bayesian Optimization
Yijia Wang
Matthias Poloczek
Daniel R. Jiang
78
3
0
21 Oct 2019
MAVEN: Multi-Agent Variational Exploration
MAVEN: Multi-Agent Variational Exploration
Anuj Mahajan
Tabish Rashid
Mikayel Samvelyan
Shimon Whiteson
DRL
216
366
0
16 Oct 2019
Previous
123456789
Next