Deep Exploration via Bootstrapped DQN

15 February 2016

Alexander Pritzel

Papers citing "Deep Exploration via Bootstrapped DQN"

38 / 288 papers shown

Title
Towards Safe Autonomous Driving: Capture Uncertainty in the Deep Neural Network For Lidar 3D Vehicle Detection Di Feng Lars Rosenbaum Klaus C. J. Dietmayer 3DPC UQCV 36 243 0 13 Apr 2018
Personalized Dynamics Models for Adaptive Assistive Navigation Systems Eshed Ohn-Bar Kris Kitani Chieko Asakawa 27 29 0 11 Apr 2018
Learning to Run challenge solutions: Adapting reinforcement learning methods for neuromusculoskeletal environments L. Kidzinski Sharada Mohanty Carmichael F. Ong Zhewei Huang Shuchang Zhou ... Sean F. Carroll Jennifer Hicks Sergey Levine M. Salathé Scott L. Delp 40 88 0 02 Apr 2018
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning Tabish Rashid Mikayel Samvelyan Christian Schroeder de Witt Gregory Farquhar Jakob N. Foerster Shimon Whiteson 93 1,657 0 30 Mar 2018
Some Considerations on Learning to Explore via Meta-Reinforcement Learning Bradly C. Stadie Ge Yang Rein Houthooft Xi Chen Yan Duan Yuhuai Wu Pieter Abbeel Ilya Sutskever LRM 40 116 0 03 Mar 2018
Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling C. Riquelme George Tucker Jasper Snoek BDL 41 365 0 26 Feb 2018
Reinforcement Learning on Web Interfaces Using Workflow-Guided Exploration E. Liu Kelvin Guu Panupong Pasupat Tianlin Shi Percy Liang OnRL 24 207 0 24 Feb 2018
Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning F. Such Vashisht Madhavan Edoardo Conti Joel Lehman Kenneth O. Stanley Jeff Clune 47 686 0 18 Dec 2017
Uncertainty Estimates for Efficient Neural Network-based Dialogue Policy Optimisation Christopher Tegho Paweł Budzianowski Milica Gasic 35 8 0 30 Nov 2017
Variational Deep Q Network Yunhao Tang A. Kucukelbir BDL 38 10 0 30 Nov 2017
Leave no Trace: Learning to Reset for Safe and Autonomous Reinforcement Learning Benjamin Eysenbach S. Gu Julian Ibarz Sergey Levine CLL 37 139 0 18 Nov 2017
Exploration in Feature Space for Reinforcement Learning S. N. Sasikumar 57 4 0 05 Oct 2017
Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents Marlos C. Machado Marc G. Bellemare Erik Talvitie J. Veness Matthew J. Hausknecht Michael Bowling 37 544 0 18 Sep 2017
The Uncertainty Bellman Equation and Exploration Brendan O'Donoghue Ian Osband Rémi Munos Volodymyr Mnih 36 186 0 15 Sep 2017
A Brief Survey of Deep Reinforcement Learning Kai Arulkumaran M. Deisenroth Miles Brundage Anil Anthony Bharath OffRL 65 2,780 0 19 Aug 2017
Hindsight Experience Replay Marcin Andrychowicz Dwight Crow Alex Ray Jonas Schneider Rachel Fong Peter Welinder Bob McGrew Joshua Tobin Pieter Abbeel Wojciech Zaremba OffRL 107 2,297 0 05 Jul 2017
Count-Based Exploration in Feature Space for Reinforcement Learning Jarryd Martin S. N. Sasikumar Tom Everitt Marcus Hutter 24 122 0 25 Jun 2017
Ensemble Sampling Xiuyuan Lu Benjamin Van Roy 14 118 0 20 May 2017
Curiosity-driven Exploration by Self-supervised Prediction Deepak Pathak Pulkit Agrawal Alexei A. Efros Trevor Darrell LRM SSL 78 2,401 0 15 May 2017
From Language to Programs: Bridging Reinforcement Learning and Maximum Marginal Likelihood Kelvin Guu Panupong Pasupat E. Liu Percy Liang 34 190 0 25 Apr 2017
Thompson Sampling for Linear-Quadratic Control Problems Marc Abeille A. Lazaric 17 55 0 27 Mar 2017
Deep Exploration via Randomized Value Functions Ian Osband Benjamin Van Roy Daniel Russo Zheng Wen 41 300 0 22 Mar 2017
Neural Episodic Control Alexander Pritzel Benigno Uria Sriram Srinivasan A. Badia Oriol Vinyals Demis Hassabis Daan Wierstra Charles Blundell OffRL BDL 35 345 0 06 Mar 2017
Multiplicative Normalizing Flows for Variational Bayesian Neural Networks Christos Louizos Max Welling BDL 47 454 0 06 Mar 2017
Uncertainty-Aware Reinforcement Learning for Collision Avoidance G. Kahn Adam R. Villaflor Vitchyr H. Pong Pieter Abbeel Sergey Levine 41 312 0 03 Feb 2017
Deep Reinforcement Learning: An Overview Yuxi Li OffRL VLM 104 1,505 0 25 Jan 2017
Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles Balaji Lakshminarayanan Alexander Pritzel Charles Blundell UQCV BDL 278 5,695 0 05 Dec 2016
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning Haoran Tang Rein Houthooft Davis Foote Adam Stooke Xi Chen Yan Duan John Schulman F. Turck Pieter Abbeel OffRL 60 760 0 15 Nov 2016
Towards Lifelong Self-Supervision: A Deep Learning Direction for Robotics J. M. Wong 27 11 0 01 Nov 2016
Supervision via Competition: Robot Adversaries for Learning Tasks Lerrel Pinto James Davidson Abhinav Gupta SSL 31 82 0 05 Oct 2016
Episodic Exploration for Deep Deterministic Policies: An Application to StarCraft Micromanagement Tasks Nicolas Usunier Gabriel Synnaeve Zeming Lin Soumith Chintala 27 138 0 10 Sep 2016
BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems Zachary Chase Lipton Xiujun Li Jianfeng Gao Lihong Li Faisal Ahmed Li Deng 40 6 0 17 Aug 2016
Playing Atari Games with Deep Reinforcement Learning and Human Checkpoint Replay Ionel-Alexandru Hosu Traian Rebedea 29 97 0 18 Jul 2016
Concrete Problems in AI Safety Dario Amodei C. Olah Jacob Steinhardt Paul Christiano John Schulman Dandelion Mané 83 2,324 0 21 Jun 2016
Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation Tejas D. Kulkarni Karthik Narasimhan A. Saeedi J. Tenenbaum 25 1,127 0 20 Apr 2016
Structured and Efficient Variational Deep Learning with Matrix Gaussian Posteriors Christos Louizos Max Welling BDL 33 254 0 15 Mar 2016
Learning values across many orders of magnitude H. V. Hasselt A. Guez Matteo Hessel Volodymyr Mnih David Silver 22 169 0 24 Feb 2016
Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning Y. Gal Zoubin Ghahramani UQCV BDL 287 9,167 0 06 Jun 2015