Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.04621
Cited By
Deep Exploration via Bootstrapped DQN
15 February 2016
Ian Osband
Charles Blundell
Alexander Pritzel
Benjamin Van Roy
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Exploration via Bootstrapped DQN"
38 / 288 papers shown
Title
Towards Safe Autonomous Driving: Capture Uncertainty in the Deep Neural Network For Lidar 3D Vehicle Detection
Di Feng
Lars Rosenbaum
Klaus C. J. Dietmayer
3DPC
UQCV
36
243
0
13 Apr 2018
Personalized Dynamics Models for Adaptive Assistive Navigation Systems
Eshed Ohn-Bar
Kris Kitani
Chieko Asakawa
27
29
0
11 Apr 2018
Learning to Run challenge solutions: Adapting reinforcement learning methods for neuromusculoskeletal environments
L. Kidzinski
Sharada Mohanty
Carmichael F. Ong
Zhewei Huang
Shuchang Zhou
...
Sean F. Carroll
Jennifer Hicks
Sergey Levine
M. Salathé
Scott L. Delp
40
88
0
02 Apr 2018
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Tabish Rashid
Mikayel Samvelyan
Christian Schroeder de Witt
Gregory Farquhar
Jakob N. Foerster
Shimon Whiteson
93
1,657
0
30 Mar 2018
Some Considerations on Learning to Explore via Meta-Reinforcement Learning
Bradly C. Stadie
Ge Yang
Rein Houthooft
Xi Chen
Yan Duan
Yuhuai Wu
Pieter Abbeel
Ilya Sutskever
LRM
40
116
0
03 Mar 2018
Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling
C. Riquelme
George Tucker
Jasper Snoek
BDL
41
365
0
26 Feb 2018
Reinforcement Learning on Web Interfaces Using Workflow-Guided Exploration
E. Liu
Kelvin Guu
Panupong Pasupat
Tianlin Shi
Percy Liang
OnRL
24
207
0
24 Feb 2018
Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning
F. Such
Vashisht Madhavan
Edoardo Conti
Joel Lehman
Kenneth O. Stanley
Jeff Clune
47
686
0
18 Dec 2017
Uncertainty Estimates for Efficient Neural Network-based Dialogue Policy Optimisation
Christopher Tegho
Paweł Budzianowski
Milica Gasic
35
8
0
30 Nov 2017
Variational Deep Q Network
Yunhao Tang
A. Kucukelbir
BDL
38
10
0
30 Nov 2017
Leave no Trace: Learning to Reset for Safe and Autonomous Reinforcement Learning
Benjamin Eysenbach
S. Gu
Julian Ibarz
Sergey Levine
CLL
37
139
0
18 Nov 2017
Exploration in Feature Space for Reinforcement Learning
S. N. Sasikumar
57
4
0
05 Oct 2017
Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents
Marlos C. Machado
Marc G. Bellemare
Erik Talvitie
J. Veness
Matthew J. Hausknecht
Michael Bowling
37
544
0
18 Sep 2017
The Uncertainty Bellman Equation and Exploration
Brendan O'Donoghue
Ian Osband
Rémi Munos
Volodymyr Mnih
36
186
0
15 Sep 2017
A Brief Survey of Deep Reinforcement Learning
Kai Arulkumaran
M. Deisenroth
Miles Brundage
Anil Anthony Bharath
OffRL
65
2,780
0
19 Aug 2017
Hindsight Experience Replay
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
OffRL
107
2,297
0
05 Jul 2017
Count-Based Exploration in Feature Space for Reinforcement Learning
Jarryd Martin
S. N. Sasikumar
Tom Everitt
Marcus Hutter
24
122
0
25 Jun 2017
Ensemble Sampling
Xiuyuan Lu
Benjamin Van Roy
14
118
0
20 May 2017
Curiosity-driven Exploration by Self-supervised Prediction
Deepak Pathak
Pulkit Agrawal
Alexei A. Efros
Trevor Darrell
LRM
SSL
78
2,401
0
15 May 2017
From Language to Programs: Bridging Reinforcement Learning and Maximum Marginal Likelihood
Kelvin Guu
Panupong Pasupat
E. Liu
Percy Liang
34
190
0
25 Apr 2017
Thompson Sampling for Linear-Quadratic Control Problems
Marc Abeille
A. Lazaric
17
55
0
27 Mar 2017
Deep Exploration via Randomized Value Functions
Ian Osband
Benjamin Van Roy
Daniel Russo
Zheng Wen
41
300
0
22 Mar 2017
Neural Episodic Control
Alexander Pritzel
Benigno Uria
Sriram Srinivasan
A. Badia
Oriol Vinyals
Demis Hassabis
Daan Wierstra
Charles Blundell
OffRL
BDL
35
345
0
06 Mar 2017
Multiplicative Normalizing Flows for Variational Bayesian Neural Networks
Christos Louizos
Max Welling
BDL
47
454
0
06 Mar 2017
Uncertainty-Aware Reinforcement Learning for Collision Avoidance
G. Kahn
Adam R. Villaflor
Vitchyr H. Pong
Pieter Abbeel
Sergey Levine
41
312
0
03 Feb 2017
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRL
VLM
104
1,505
0
25 Jan 2017
Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles
Balaji Lakshminarayanan
Alexander Pritzel
Charles Blundell
UQCV
BDL
278
5,695
0
05 Dec 2016
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning
Haoran Tang
Rein Houthooft
Davis Foote
Adam Stooke
Xi Chen
Yan Duan
John Schulman
F. Turck
Pieter Abbeel
OffRL
60
760
0
15 Nov 2016
Towards Lifelong Self-Supervision: A Deep Learning Direction for Robotics
J. M. Wong
27
11
0
01 Nov 2016
Supervision via Competition: Robot Adversaries for Learning Tasks
Lerrel Pinto
James Davidson
Abhinav Gupta
SSL
31
82
0
05 Oct 2016
Episodic Exploration for Deep Deterministic Policies: An Application to StarCraft Micromanagement Tasks
Nicolas Usunier
Gabriel Synnaeve
Zeming Lin
Soumith Chintala
27
138
0
10 Sep 2016
BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems
Zachary Chase Lipton
Xiujun Li
Jianfeng Gao
Lihong Li
Faisal Ahmed
Li Deng
40
6
0
17 Aug 2016
Playing Atari Games with Deep Reinforcement Learning and Human Checkpoint Replay
Ionel-Alexandru Hosu
Traian Rebedea
29
97
0
18 Jul 2016
Concrete Problems in AI Safety
Dario Amodei
C. Olah
Jacob Steinhardt
Paul Christiano
John Schulman
Dandelion Mané
83
2,324
0
21 Jun 2016
Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation
Tejas D. Kulkarni
Karthik Narasimhan
A. Saeedi
J. Tenenbaum
25
1,127
0
20 Apr 2016
Structured and Efficient Variational Deep Learning with Matrix Gaussian Posteriors
Christos Louizos
Max Welling
BDL
33
254
0
15 Mar 2016
Learning values across many orders of magnitude
H. V. Hasselt
A. Guez
Matteo Hessel
Volodymyr Mnih
David Silver
22
169
0
24 Feb 2016
Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning
Y. Gal
Zoubin Ghahramani
UQCV
BDL
287
9,167
0
06 Jun 2015
Previous
1
2
3
4
5
6