Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1507.00814
Cited By
Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models
3 July 2015
Bradly C. Stadie
Sergey Levine
Pieter Abbeel
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models"
15 / 115 papers shown
Title
Curiosity-driven Exploration by Self-supervised Prediction
Deepak Pathak
Pulkit Agrawal
Alexei A. Efros
Trevor Darrell
LRM
SSL
78
2,401
0
15 May 2017
On Improving Deep Reinforcement Learning for POMDPs
Pengfei Zhu
Xin Li
Pascal Poupart
Guanghui Miao
29
123
0
26 Apr 2017
Surprise-Based Intrinsic Motivation for Deep Reinforcement Learning
Joshua Achiam
S. Shankar Sastry
40
235
0
06 Mar 2017
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRL
VLM
109
1,505
0
25 Jan 2017
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning
Haoran Tang
Rein Houthooft
Davis Foote
Adam Stooke
Xi Chen
Yan Duan
John Schulman
F. Turck
Pieter Abbeel
OffRL
60
760
0
15 Nov 2016
Multi-Objective Deep Reinforcement Learning
Hossam Mossalam
Yannis Assael
D. Roijers
Shimon Whiteson
35
151
0
09 Oct 2016
BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems
Zachary Chase Lipton
Xiujun Li
Jianfeng Gao
Lihong Li
Faisal Ahmed
Li Deng
40
6
0
17 Aug 2016
Unifying Count-Based Exploration and Intrinsic Motivation
Marc G. Bellemare
S. Srinivasan
Georg Ostrovski
Tom Schaul
D. Saxton
Rémi Munos
55
1,459
0
06 Jun 2016
Deep Learning for Reward Design to Improve Monte Carlo Tree Search in ATARI Games
Xiaoxiao Guo
Satinder Singh
Richard L. Lewis
Honglak Lee
30
55
0
24 Apr 2016
Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation
Tejas D. Kulkarni
Karthik Narasimhan
A. Saeedi
J. Tenenbaum
25
1,127
0
20 Apr 2016
Dialog-based Language Learning
Jason Weston
LLMAG
24
108
0
20 Apr 2016
Exploratory Gradient Boosting for Reinforcement Learning in Complex Domains
David Abel
Alekh Agarwal
Fernando Diaz
A. Krishnamurthy
Robert Schapire
OffRL
80
43
0
14 Mar 2016
Learning to Communicate to Solve Riddles with Deep Distributed Recurrent Q-Networks
Jakob N. Foerster
Yannis Assael
Nando de Freitas
Shimon Whiteson
21
147
0
08 Feb 2016
Dueling Network Architectures for Deep Reinforcement Learning
Ziyun Wang
Tom Schaul
Matteo Hessel
H. V. Hasselt
Marc Lanctot
Nando de Freitas
OffRL
29
3,733
0
20 Nov 2015
Prioritized Experience Replay
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
OffRL
132
3,766
0
18 Nov 2015
Previous
1
2
3