Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1705.05363
Cited By
Curiosity-driven Exploration by Self-supervised Prediction
15 May 2017
Deepak Pathak
Pulkit Agrawal
Alexei A. Efros
Trevor Darrell
LRM
SSL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Curiosity-driven Exploration by Self-supervised Prediction"
50 / 1,353 papers shown
Title
Learning about an exponential amount of conditional distributions
Mohamed Ishmael Belghazi
Maxime Oquab
Yann LeCun
David Lopez-Paz
BDL
SSL
68
28
0
22 Feb 2019
World Discovery Models
M. G. Azar
Bilal Piot
Bernardo Avila-Pires
Jean-Bastien Grill
Florent Altché
Rémi Munos
126
26
0
20 Feb 2019
Curiosity-Driven Experience Prioritization via Density Estimation
Rui Zhao
Volker Tresp
136
55
0
20 Feb 2019
Sufficiently Accurate Model Learning
Clark Zhang
Arbaaz Khan
Santiago Paternain
Alejandro Ribeiro
33
3
0
19 Feb 2019
Distilling Policy Distillation
Wojciech M. Czarnecki
Razvan Pascanu
Simon Osindero
Siddhant M. Jayakumar
G. Swirszcz
Max Jaderberg
85
134
0
06 Feb 2019
Obstacle Tower: A Generalization Challenge in Vision, Control, and Planning
Arthur Juliani
Ahmed Khalifa
Vincent-Pierre Berges
Jonathan Harper
Ervin Teng
Hunter Henry
A. Crespi
Julian Togelius
Danny Lange
87
144
0
04 Feb 2019
A Meta-MDP Approach to Exploration for Lifelong Reinforcement Learning
Francisco M. Garcia
Philip S. Thomas
104
41
0
03 Feb 2019
Competitive Experience Replay
Hao Liu
Alexander R. Trott
R. Socher
Caiming Xiong
OffRL
131
53
0
01 Feb 2019
Learning Action Representations for Reinforcement Learning
Yash Chandak
Georgios Theocharous
James E. Kostas
Scott M. Jordan
Philip S. Thomas
75
165
0
01 Feb 2019
InfoBot: Transfer and Exploration via the Information Bottleneck
Anirudh Goyal
Riashat Islam
Daniel Strouse
Zafarali Ahmed
M. Botvinick
Hugo Larochelle
Yoshua Bengio
Sergey Levine
OffRL
133
167
0
30 Jan 2019
Trust Region-Guided Proximal Policy Optimization
Yuhui Wang
Hao He
Xiaoyang Tan
Yaozhong Gan
OffRL
89
57
0
29 Jan 2019
Provably efficient RL with Rich Observations via Latent State Decoding
S. Du
A. Krishnamurthy
Nan Jiang
Alekh Agarwal
Miroslav Dudík
John Langford
OffRL
78
230
0
25 Jan 2019
Decoupling feature extraction from policy learning: assessing benefits of state representation learning in goal based robotics
Antonin Raffin
Ashley Hill
Kalifou René Traoré
Timothée Lesort
Natalia Díaz Rodríguez
David Filliat
SSL
OffRL
52
56
0
24 Jan 2019
Deep Neural Linear Bandits: Overcoming Catastrophic Forgetting through Likelihood Matching
Tom Zahavy
Shie Mannor
HAI
113
30
0
24 Jan 2019
Never Forget: Balancing Exploration and Exploitation via Learning Optical Flow
Hsuan-Kung Yang
Po-Han Chiang
Kuan-Wei Ho
Min-Fong Hong
Chun-Yi Lee
45
7
0
24 Jan 2019
Open-ended Learning in Symmetric Zero-sum Games
David Balduzzi
M. Garnelo
Yoram Bachrach
Wojciech M. Czarnecki
Julien Perolat
Max Jaderberg
T. Graepel
92
174
0
23 Jan 2019
Amplifying the Imitation Effect for Reinforcement Learning of UCAV's Mission Execution
G. Lee
Chang Ouk Kim
33
4
0
17 Jan 2019
An investigation of model-free planning
A. Guez
M. Berk Mirza
Karol Gregor
Rishabh Kabra
S. Racanière
...
Laurent Orseau
Tom Eccles
Greg Wayne
David Silver
Timothy Lillicrap
OffRL
106
117
0
11 Jan 2019
Exploring applications of deep reinforcement learning for real-world autonomous driving systems
V. Talpaert
Ibrahim Sobh
Ravi Kiran
Patrick Mannion
S. Yogamani
Ahmad El-Sallab
P. Pérez
70
74
0
06 Jan 2019
What Should I Do Now? Marrying Reinforcement Learning and Symbolic Planning
Daniel Gordon
Dieter Fox
Ali Farhadi
80
20
0
06 Jan 2019
Mid-Level Visual Representations Improve Generalization and Sample Efficiency for Learning Visuomotor Policies
Alexander Sax
Bradley Emi
Amir Zamir
Leonidas Guibas
Silvio Savarese
Jitendra Malik
SSL
93
16
0
31 Dec 2018
NADPEx: An on-policy temporally consistent exploration method for deep reinforcement learning
Sirui Xie
Junning Huang
Lanxin Lei
Chunxiao Liu
Zheng Ma
Wayne Zhang
Liang Lin
57
8
0
21 Dec 2018
An Atari Model Zoo for Analyzing, Visualizing, and Comparing Deep Reinforcement Learning Agents
F. Such
Vashisht Madhavan
Rosanne Liu
Rui Wang
Pablo Samuel Castro
...
Jiale Zhi
Ludwig Schubert
Marc G. Bellemare
Jeff Clune
Joel Lehman
OffRL
86
54
0
17 Dec 2018
Malthusian Reinforcement Learning
Joel Z Leibo
Julien Perolat
Edward Hughes
S. Wheelwright
Adam H. Marblestone
Edgar A. Duénez-Guzmán
P. Sunehag
Iain Dunning
T. Graepel
AI4CE
103
38
0
17 Dec 2018
Gold Seeker: Information Gain from Policy Distributions for Goal-oriented Vision-and-Langauge Reasoning
Ehsan Abbasnejad
Iman Abbasnejad
Qi Wu
Javen Qinfeng Shi
Anton Van Den Hengel
OffRL
87
5
0
16 Dec 2018
On the potential for open-endedness in neural networks
N. Guttenberg
N. Virgo
A. Penn
64
10
0
12 Dec 2018
Efficient Model-Free Reinforcement Learning Using Gaussian Process
Ying Fan
Letian Chen
Yizhou Wang
GP
62
6
0
11 Dec 2018
Improving Model-Based Control and Active Exploration with Reconstruction Uncertainty Optimization
Norman Di Palo
Harri Valpola
26
3
0
10 Dec 2018
Learning Montezuma's Revenge from a Single Demonstration
Tim Salimans
Richard J. Chen
132
139
0
08 Dec 2018
Provably Efficient Maximum Entropy Exploration
Elad Hazan
Sham Kakade
Karan Singh
A. V. Soest
98
305
0
06 Dec 2018
Learning to Learn How to Learn: Self-Adaptive Visual Navigation Using Meta-Learning
Mitchell Wortsman
Kiana Ehsani
Mohammad Rastegari
Ali Farhadi
Roozbeh Mottaghi
SSL
105
223
0
03 Dec 2018
Modulated Policy Hierarchies
Alexander Pashevich
Danijar Hafner
James Davidson
Rahul Sukthankar
Cordelia Schmid
46
6
0
30 Nov 2018
An Introduction to Deep Reinforcement Learning
Vincent François-Lavet
Peter Henderson
Riashat Islam
Marc G. Bellemare
Joelle Pineau
OffRL
AI4CE
177
1,279
0
30 Nov 2018
Exploring Restart Distributions
Arash Tavakoli
Vitaly Levdik
Riashat Islam
Christopher M. Smith
Petar Kormushev
OffRL
35
5
0
27 Nov 2018
PNS: Population-Guided Novelty Search for Reinforcement Learning in Hard Exploration Environments
Qihao Liu
Yujia Wang
Xiao-Fei Liu
84
8
0
26 Nov 2018
Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation
Xin Eric Wang
Qiuyuan Huang
Asli Celikyilmaz
Jianfeng Gao
Dinghan Shen
Yuan-fang Wang
William Yang Wang
Lei Zhang
LM&Ro
SSL
153
542
0
25 Nov 2018
Stable Opponent Shaping in Differentiable Games
Alistair Letcher
Jakob N. Foerster
David Balduzzi
Tim Rocktaschel
Shimon Whiteson
153
110
0
20 Nov 2018
Model Learning for Look-ahead Exploration in Continuous Control
Arpit Agarwal
Katharina Muelling
Katerina Fragkiadaki
61
8
0
20 Nov 2018
Learning Actionable Representations with Goal-Conditioned Policies
Dibya Ghosh
Abhishek Gupta
Sergey Levine
105
110
0
19 Nov 2018
Policy Optimization with Model-based Explorations
Feiyang Pan
Qingpeng Cai
Anxiang Zeng
C. Pan
Qing Da
Hua-Lin He
Qing He
Pingzhong Tang
86
11
0
18 Nov 2018
Cost-Aware Fine-Grained Recognition for IoTs Based on Sequential Fixations
Hanxiao Wang
Venkatesh Saligrama
Stan Sclaroff
Vitaly Ablavsky
46
0
0
16 Nov 2018
Reward learning from human preferences and demonstrations in Atari
Borja Ibarz
Jan Leike
Tobias Pohlen
G. Irving
Shane Legg
Dario Amodei
134
398
0
15 Nov 2018
Towards Governing Agent's Efficacy: Action-Conditional
β
β
β
-VAE for Deep Transparent Reinforcement Learning
John Yang
Gyujeong Lee
Minsung Hyun
Simyung Chang
Nojun Kwak
65
3
0
11 Nov 2018
Diversity-Driven Extensible Hierarchical Reinforcement Learning
Yuhang Song
Jianyi Wang
Thomas Lukasiewicz
Zhenghua Xu
Mai Xu
69
18
0
10 Nov 2018
Plan Online, Learn Offline: Efficient Learning and Exploration via Model-Based Control
Kendall Lowrey
Aravind Rajeswaran
Sham Kakade
G. Haro
Igor Mordatch
OffRL
79
229
0
05 Nov 2018
Contingency-Aware Exploration in Reinforcement Learning
Jongwook Choi
Yijie Guo
Marcin Moczulski
Junhyuk Oh
Neal Wu
Mohammad Norouzi
Honglak Lee
80
73
0
05 Nov 2018
Sequence Generation with Guider Network
Ruiyi Zhang
Changyou Chen
Zhe Gan
Wenlin Wang
Liqun Chen
Dinghan Shen
Guoyin Wang
Lawrence Carin
3DV
46
4
0
02 Nov 2018
Exploration by Random Network Distillation
Yuri Burda
Harrison Edwards
Amos Storkey
Oleg Klimov
183
1,347
0
30 Oct 2018
Model-Based Active Exploration
Pranav Shyam
Wojciech Ja'skowski
Faustino J. Gomez
101
179
0
29 Oct 2018
Deep Intrinsically Motivated Continuous Actor-Critic for Efficient Robotic Visuomotor Skill Learning
Muhammad Burhan Hafez
C. Weber
Matthias Kerzel
S. Wermter
54
22
0
26 Oct 2018
Previous
1
2
3
...
24
25
26
27
28
Next