ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.04621
  4. Cited By
Deep Exploration via Bootstrapped DQN

Deep Exploration via Bootstrapped DQN

15 February 2016
Ian Osband
Charles Blundell
Alexander Pritzel
Benjamin Van Roy
ArXivPDFHTML

Papers citing "Deep Exploration via Bootstrapped DQN"

38 / 288 papers shown
Title
Towards Safe Autonomous Driving: Capture Uncertainty in the Deep Neural
  Network For Lidar 3D Vehicle Detection
Towards Safe Autonomous Driving: Capture Uncertainty in the Deep Neural Network For Lidar 3D Vehicle Detection
Di Feng
Lars Rosenbaum
Klaus C. J. Dietmayer
3DPC
UQCV
36
243
0
13 Apr 2018
Personalized Dynamics Models for Adaptive Assistive Navigation Systems
Personalized Dynamics Models for Adaptive Assistive Navigation Systems
Eshed Ohn-Bar
Kris Kitani
Chieko Asakawa
27
29
0
11 Apr 2018
Learning to Run challenge solutions: Adapting reinforcement learning
  methods for neuromusculoskeletal environments
Learning to Run challenge solutions: Adapting reinforcement learning methods for neuromusculoskeletal environments
L. Kidzinski
Sharada Mohanty
Carmichael F. Ong
Zhewei Huang
Shuchang Zhou
...
Sean F. Carroll
Jennifer Hicks
Sergey Levine
M. Salathé
Scott L. Delp
40
88
0
02 Apr 2018
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent
  Reinforcement Learning
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Tabish Rashid
Mikayel Samvelyan
Christian Schroeder de Witt
Gregory Farquhar
Jakob N. Foerster
Shimon Whiteson
93
1,657
0
30 Mar 2018
Some Considerations on Learning to Explore via Meta-Reinforcement
  Learning
Some Considerations on Learning to Explore via Meta-Reinforcement Learning
Bradly C. Stadie
Ge Yang
Rein Houthooft
Xi Chen
Yan Duan
Yuhuai Wu
Pieter Abbeel
Ilya Sutskever
LRM
40
116
0
03 Mar 2018
Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep
  Networks for Thompson Sampling
Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling
C. Riquelme
George Tucker
Jasper Snoek
BDL
41
365
0
26 Feb 2018
Reinforcement Learning on Web Interfaces Using Workflow-Guided
  Exploration
Reinforcement Learning on Web Interfaces Using Workflow-Guided Exploration
E. Liu
Kelvin Guu
Panupong Pasupat
Tianlin Shi
Percy Liang
OnRL
24
207
0
24 Feb 2018
Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative
  for Training Deep Neural Networks for Reinforcement Learning
Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning
F. Such
Vashisht Madhavan
Edoardo Conti
Joel Lehman
Kenneth O. Stanley
Jeff Clune
47
686
0
18 Dec 2017
Uncertainty Estimates for Efficient Neural Network-based Dialogue Policy
  Optimisation
Uncertainty Estimates for Efficient Neural Network-based Dialogue Policy Optimisation
Christopher Tegho
Paweł Budzianowski
Milica Gasic
35
8
0
30 Nov 2017
Variational Deep Q Network
Variational Deep Q Network
Yunhao Tang
A. Kucukelbir
BDL
38
10
0
30 Nov 2017
Leave no Trace: Learning to Reset for Safe and Autonomous Reinforcement
  Learning
Leave no Trace: Learning to Reset for Safe and Autonomous Reinforcement Learning
Benjamin Eysenbach
S. Gu
Julian Ibarz
Sergey Levine
CLL
37
139
0
18 Nov 2017
Exploration in Feature Space for Reinforcement Learning
Exploration in Feature Space for Reinforcement Learning
S. N. Sasikumar
57
4
0
05 Oct 2017
Revisiting the Arcade Learning Environment: Evaluation Protocols and
  Open Problems for General Agents
Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents
Marlos C. Machado
Marc G. Bellemare
Erik Talvitie
J. Veness
Matthew J. Hausknecht
Michael Bowling
37
544
0
18 Sep 2017
The Uncertainty Bellman Equation and Exploration
The Uncertainty Bellman Equation and Exploration
Brendan O'Donoghue
Ian Osband
Rémi Munos
Volodymyr Mnih
36
186
0
15 Sep 2017
A Brief Survey of Deep Reinforcement Learning
A Brief Survey of Deep Reinforcement Learning
Kai Arulkumaran
M. Deisenroth
Miles Brundage
Anil Anthony Bharath
OffRL
65
2,780
0
19 Aug 2017
Hindsight Experience Replay
Hindsight Experience Replay
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
OffRL
107
2,297
0
05 Jul 2017
Count-Based Exploration in Feature Space for Reinforcement Learning
Count-Based Exploration in Feature Space for Reinforcement Learning
Jarryd Martin
S. N. Sasikumar
Tom Everitt
Marcus Hutter
24
122
0
25 Jun 2017
Ensemble Sampling
Ensemble Sampling
Xiuyuan Lu
Benjamin Van Roy
14
118
0
20 May 2017
Curiosity-driven Exploration by Self-supervised Prediction
Curiosity-driven Exploration by Self-supervised Prediction
Deepak Pathak
Pulkit Agrawal
Alexei A. Efros
Trevor Darrell
LRM
SSL
78
2,401
0
15 May 2017
From Language to Programs: Bridging Reinforcement Learning and Maximum
  Marginal Likelihood
From Language to Programs: Bridging Reinforcement Learning and Maximum Marginal Likelihood
Kelvin Guu
Panupong Pasupat
E. Liu
Percy Liang
34
190
0
25 Apr 2017
Thompson Sampling for Linear-Quadratic Control Problems
Thompson Sampling for Linear-Quadratic Control Problems
Marc Abeille
A. Lazaric
17
55
0
27 Mar 2017
Deep Exploration via Randomized Value Functions
Deep Exploration via Randomized Value Functions
Ian Osband
Benjamin Van Roy
Daniel Russo
Zheng Wen
41
300
0
22 Mar 2017
Neural Episodic Control
Neural Episodic Control
Alexander Pritzel
Benigno Uria
Sriram Srinivasan
A. Badia
Oriol Vinyals
Demis Hassabis
Daan Wierstra
Charles Blundell
OffRL
BDL
35
345
0
06 Mar 2017
Multiplicative Normalizing Flows for Variational Bayesian Neural
  Networks
Multiplicative Normalizing Flows for Variational Bayesian Neural Networks
Christos Louizos
Max Welling
BDL
47
454
0
06 Mar 2017
Uncertainty-Aware Reinforcement Learning for Collision Avoidance
Uncertainty-Aware Reinforcement Learning for Collision Avoidance
G. Kahn
Adam R. Villaflor
Vitchyr H. Pong
Pieter Abbeel
Sergey Levine
41
312
0
03 Feb 2017
Deep Reinforcement Learning: An Overview
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRL
VLM
104
1,505
0
25 Jan 2017
Simple and Scalable Predictive Uncertainty Estimation using Deep
  Ensembles
Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles
Balaji Lakshminarayanan
Alexander Pritzel
Charles Blundell
UQCV
BDL
278
5,695
0
05 Dec 2016
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement
  Learning
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning
Haoran Tang
Rein Houthooft
Davis Foote
Adam Stooke
Xi Chen
Yan Duan
John Schulman
F. Turck
Pieter Abbeel
OffRL
60
760
0
15 Nov 2016
Towards Lifelong Self-Supervision: A Deep Learning Direction for
  Robotics
Towards Lifelong Self-Supervision: A Deep Learning Direction for Robotics
J. M. Wong
27
11
0
01 Nov 2016
Supervision via Competition: Robot Adversaries for Learning Tasks
Supervision via Competition: Robot Adversaries for Learning Tasks
Lerrel Pinto
James Davidson
Abhinav Gupta
SSL
31
82
0
05 Oct 2016
Episodic Exploration for Deep Deterministic Policies: An Application to
  StarCraft Micromanagement Tasks
Episodic Exploration for Deep Deterministic Policies: An Application to StarCraft Micromanagement Tasks
Nicolas Usunier
Gabriel Synnaeve
Zeming Lin
Soumith Chintala
27
138
0
10 Sep 2016
BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for
  Task-Oriented Dialogue Systems
BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems
Zachary Chase Lipton
Xiujun Li
Jianfeng Gao
Lihong Li
Faisal Ahmed
Li Deng
40
6
0
17 Aug 2016
Playing Atari Games with Deep Reinforcement Learning and Human
  Checkpoint Replay
Playing Atari Games with Deep Reinforcement Learning and Human Checkpoint Replay
Ionel-Alexandru Hosu
Traian Rebedea
29
97
0
18 Jul 2016
Concrete Problems in AI Safety
Concrete Problems in AI Safety
Dario Amodei
C. Olah
Jacob Steinhardt
Paul Christiano
John Schulman
Dandelion Mané
83
2,324
0
21 Jun 2016
Hierarchical Deep Reinforcement Learning: Integrating Temporal
  Abstraction and Intrinsic Motivation
Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation
Tejas D. Kulkarni
Karthik Narasimhan
A. Saeedi
J. Tenenbaum
25
1,127
0
20 Apr 2016
Structured and Efficient Variational Deep Learning with Matrix Gaussian
  Posteriors
Structured and Efficient Variational Deep Learning with Matrix Gaussian Posteriors
Christos Louizos
Max Welling
BDL
33
254
0
15 Mar 2016
Learning values across many orders of magnitude
Learning values across many orders of magnitude
H. V. Hasselt
A. Guez
Matteo Hessel
Volodymyr Mnih
David Silver
22
169
0
24 Feb 2016
Dropout as a Bayesian Approximation: Representing Model Uncertainty in
  Deep Learning
Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning
Y. Gal
Zoubin Ghahramani
UQCV
BDL
287
9,167
0
06 Jun 2015
Previous
123456