Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.06070
Cited By
v1
v2
v3
v4
v5
v6 (latest)
Diversity is All You Need: Learning Skills without a Reward Function
16 February 2018
Benjamin Eysenbach
Abhishek Gupta
Julian Ibarz
Sergey Levine
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Diversity is All You Need: Learning Skills without a Reward Function"
50 / 414 papers shown
Title
Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards
Siyuan Li
Rui Wang
Minxue Tang
Chongjie Zhang
77
83
0
10 Oct 2019
On the Possibility of Rewarding Structure Learning Agents: Mutual Information on Linguistic Random Sets
Ignacio Arroyo-Fernández
Mauricio Carrasco-Ruiz
J. A. Arias-Aguilar
42
0
0
09 Oct 2019
Automated curricula through setter-solver interactions
S. Racanière
Andrew Kyle Lampinen
Adam Santoro
David P. Reichert
Vlad Firoiu
Timothy Lillicrap
81
53
0
27 Sep 2019
Hierarchical Foresight: Self-Supervised Learning of Long-Horizon Tasks via Visual Subgoal Generation
Suraj Nair
Chelsea Finn
VGen
89
138
0
12 Sep 2019
Unsupervised Learning and Exploration of Reachable Outcome Space
Giuseppe Paolo
Alban Laflaquière
Alexandre Coninx
Stéphane Doncieux
93
37
0
12 Sep 2019
Discovery of Useful Questions as Auxiliary Tasks
Vivek Veeriah
Matteo Hessel
Zhongwen Xu
Richard L. Lewis
Janarthanan Rajendran
Junhyuk Oh
H. V. Hasselt
David Silver
Satinder Singh
LLMAG
81
85
0
10 Sep 2019
Signal Instructed Coordination in Cooperative Multi-agent Reinforcement Learning
Liheng Chen
Hongyi Guo
Yali Du
Fei Fang
Haifeng Zhang
Yaoming Zhu
Ming Zhou
Weinan Zhang
Qing Wang
Yong Yu
56
8
0
10 Sep 2019
A survey on intrinsic motivation in reinforcement learning
A. Aubret
L. Matignon
S. Hassas
AI4CE
112
144
0
19 Aug 2019
IR-VIC: Unsupervised Discovery of Sub-goals for Transfer in RL
Nirbhay Modhe
Prithvijit Chattopadhyay
Mohit Sharma
Abhishek Das
Devi Parikh
Dhruv Batra
Ramakrishna Vedantam
54
1
0
24 Jul 2019
Memory Based Trajectory-conditioned Policies for Learning from Sparse Rewards
Yijie Guo
Jongwook Choi
Marcin Moczulski
Shengyu Feng
Samy Bengio
Mohammad Norouzi
Honglak Lee
86
10
0
24 Jul 2019
Dynamical Distance Learning for Semi-Supervised and Unsupervised Skill Discovery
Kristian Hartikainen
Xinyang Geng
Tuomas Haarnoja
Sergey Levine
SSL
103
82
0
18 Jul 2019
On the Weaknesses of Reinforcement Learning for Neural Machine Translation
Leshem Choshen
Lior Fox
Zohar Aizenbud
Omri Abend
131
110
0
03 Jul 2019
Dynamics-Aware Unsupervised Discovery of Skills
Archit Sharma
S. Gu
Sergey Levine
Vikash Kumar
Karol Hausman
130
414
0
02 Jul 2019
Reinforcement Learning with Competitive Ensembles of Information-Constrained Primitives
Anirudh Goyal
Shagun Sodhani
Jonathan Binas
Xue Bin Peng
Sergey Levine
Yoshua Bengio
91
49
0
25 Jun 2019
Continual Reinforcement Learning with Diversity Exploration and Adversarial Self-Correction
Fengda Zhu
Xiaojun Chang
Runhao Zeng
Mingkui Tan
CLL
52
3
0
21 Jun 2019
Learning-Driven Exploration for Reinforcement Learning
Muhammad Usama
D. Chang
67
11
0
17 Jun 2019
Sub-policy Adaptation for Hierarchical Reinforcement Learning
Alexander C. Li
Carlos Florensa
I. Clavera
Pieter Abbeel
96
74
0
13 Jun 2019
Efficient Exploration via State Marginal Matching
Lisa Lee
Benjamin Eysenbach
Emilio Parisotto
Eric Xing
Sergey Levine
Ruslan Salakhutdinov
147
248
0
12 Jun 2019
Fast Task Inference with Variational Intrinsic Successor Features
Steven Hansen
Will Dabney
André Barreto
T. Wiele
David Warde-Farley
Volodymyr Mnih
BDL
100
152
0
12 Jun 2019
Self-Supervised Exploration via Disagreement
Deepak Pathak
Dhiraj Gandhi
Abhinav Gupta
SSL
85
385
0
10 Jun 2019
Diversity-Inducing Policy Gradient: Using Maximum Mean Discrepancy to Find a Set of Diverse Policies
M. A. Masood
Finale Doshi-Velez
65
51
0
31 May 2019
Learning Navigation Subroutines from Egocentric Videos
Ashish Kumar
Saurabh Gupta
Jitendra Malik
SSL
EgoV
85
13
0
29 May 2019
Adversarial Imitation Learning from Incomplete Demonstrations
Mingfei Sun
Xiaojuan Ma
78
29
0
29 May 2019
AI-GAs: AI-generating algorithms, an alternate paradigm for producing general artificial intelligence
Jeff Clune
146
122
0
27 May 2019
MCP: Learning Composable Hierarchical Control with Multiplicative Compositional Policies
Xue Bin Peng
Michael Chang
Grace Zhang
Pieter Abbeel
Sergey Levine
85
197
0
23 May 2019
COBRA: Data-Efficient Model-Based RL through Unsupervised Object Discovery and Curiosity-Driven Exploration
Nicholas Watters
Loic Matthey
Matko Bosnjak
Christopher P. Burgess
Alexander Lerchner
OffRL
120
118
0
22 May 2019
Maximum Entropy-Regularized Multi-Goal Reinforcement Learning
Rui Zhao
Xudong Sun
Volker Tresp
67
83
0
21 May 2019
Mega-Reward: Achieving Human-Level Play without Extrinsic Rewards
Yuhang Song
Jianyi Wang
Thomas Lukasiewicz
Zhenghua Xu
Shangtong Zhang
Andrzej Wojcicki
Mai Xu
LRM
87
15
0
12 May 2019
Routing Networks and the Challenges of Modular and Compositional Computation
Clemens Rosenbaum
Ignacio Cases
Matthew D Riemer
Tim Klinger
77
84
0
29 Apr 2019
Active Domain Randomization
Bhairav Mehta
Manfred Diaz
Florian Golemo
C. Pal
Liam Paull
90
265
0
09 Apr 2019
Multitask Soft Option Learning
Maximilian Igl
Andrew Gambardella
Jinke He
Nantas Nardelli
N. Siddharth
Wendelin Bohmer
Shimon Whiteson
187
26
0
01 Apr 2019
Exploiting Hierarchy for Learning and Transfer in KL-regularized RL
Dhruva Tirumala
Hyeonwoo Noh
Alexandre Galashov
Leonard Hasenclever
Arun Ahuja
Greg Wayne
Razvan Pascanu
Yee Whye Teh
N. Heess
OffRL
72
44
0
18 Mar 2019
Skew-Fit: State-Covering Self-Supervised Reinforcement Learning
Vitchyr H. Pong
Murtaza Dalal
Steven Lin
Ashvin Nair
Shikhar Bahl
Sergey Levine
OffRL
SSL
132
277
0
08 Mar 2019
Discovering Options for Exploration by Minimizing Cover Time
Yuu Jinnai
Jee Won Park
David Abel
George Konidaris
78
52
0
02 Mar 2019
The Termination Critic
Anna Harutyunyan
Will Dabney
Diana Borsa
N. Heess
Rémi Munos
Doina Precup
OffRL
55
48
0
26 Feb 2019
Preferences Implicit in the State of the World
Rohin Shah
Dmitrii Krasheninnikov
Jordan Alexander
Pieter Abbeel
Anca Dragan
82
55
0
12 Feb 2019
CLIC: Curriculum Learning and Imitation for object Control in non-rewarding environments
Pierre Fournier
Olivier Sigaud
Cédric Colas
Mohamed Chetouani
OffRL
87
26
0
28 Jan 2019
Amplifying the Imitation Effect for Reinforcement Learning of UCAV's Mission Execution
G. Lee
Chang Ouk Kim
33
4
0
17 Jan 2019
Paired Open-Ended Trailblazer (POET): Endlessly Generating Increasingly Complex and Diverse Learning Environments and Their Solutions
Rui Wang
Joel Lehman
Jeff Clune
Kenneth O. Stanley
125
250
0
07 Jan 2019
Modulated Policy Hierarchies
Alexander Pashevich
Danijar Hafner
James Davidson
Rahul Sukthankar
Cordelia Schmid
46
6
0
30 Nov 2018
Unsupervised Control Through Non-Parametric Discriminative Rewards
David Warde-Farley
T. Wiele
Tejas D. Kulkarni
Catalin Ionescu
Steven Hansen
Volodymyr Mnih
DRL
OffRL
SSL
101
178
0
28 Nov 2018
Learning Goal Embeddings via Self-Play for Hierarchical Reinforcement Learning
Sainbayar Sukhbaatar
Emily L. Denton
Arthur Szlam
Rob Fergus
SSL
116
43
0
22 Nov 2018
Reward learning from human preferences and demonstrations in Atari
Borja Ibarz
Jan Leike
Tobias Pohlen
G. Irving
Shane Legg
Dario Amodei
126
398
0
15 Nov 2018
Diversity-Driven Extensible Hierarchical Reinforcement Learning
Yuhang Song
Jianyi Wang
Thomas Lukasiewicz
Zhenghua Xu
Mai Xu
69
18
0
10 Nov 2018
Contingency-Aware Exploration in Reinforcement Learning
Jongwook Choi
Yijie Guo
Marcin Moczulski
Junhyuk Oh
Neal Wu
Mohammad Norouzi
Honglak Lee
80
73
0
05 Nov 2018
Exploration by Random Network Distillation
Yuri Burda
Harrison Edwards
Amos Storkey
Oleg Klimov
163
1,347
0
30 Oct 2018
Model-Based Active Exploration
Pranav Shyam
Wojciech Ja'skowski
Faustino J. Gomez
101
179
0
29 Oct 2018
Inverse reinforcement learning for video games
Aaron David Tucker
Adam Gleave
Stuart J. Russell
72
48
0
24 Oct 2018
Finding Options that Minimize Planning Time
Yuu Jinnai
David Abel
D Ellis Hershkowitz
Michael Littman
George Konidaris
75
42
0
16 Oct 2018
Policy Transfer with Strategy Optimization
Wenhao Yu
Chenxi Liu
Greg Turk
98
81
0
12 Oct 2018
Previous
1
2
3
4
5
6
7
8
9
Next