ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.04621
  4. Cited By
Deep Exploration via Bootstrapped DQN

Deep Exploration via Bootstrapped DQN

15 February 2016
Ian Osband
Charles Blundell
Alexander Pritzel
Benjamin Van Roy
ArXivPDFHTML

Papers citing "Deep Exploration via Bootstrapped DQN"

50 / 288 papers shown
Title
How to Train Your Robot with Deep Reinforcement Learning; Lessons We've
  Learned
How to Train Your Robot with Deep Reinforcement Learning; Lessons We've Learned
Julian Ibarz
Jie Tan
Chelsea Finn
Mrinal Kalakrishnan
P. Pastor
Sergey Levine
OffRL
16
520
0
04 Feb 2021
U-LanD: Uncertainty-Driven Video Landmark Detection
U-LanD: Uncertainty-Driven Video Landmark Detection
Mohammad Jafari
C. Luong
Michael Y. Tsang
A. Gu
N. V. Woudenberg
R. Rohling
T. Tsang
Purang Abolmaesumi
44
12
0
02 Feb 2021
Decoupled Exploration and Exploitation Policies for Sample-Efficient
  Reinforcement Learning
Decoupled Exploration and Exploitation Policies for Sample-Efficient Reinforcement Learning
William F. Whitney
Michael Bloesch
Jost Tobias Springenberg
A. Abdolmaleki
Kyunghyun Cho
Martin Riedmiller
OffRL
29
13
0
23 Jan 2021
Geometric Entropic Exploration
Geometric Entropic Exploration
Z. Guo
M. G. Azar
Alaa Saade
S. Thakoor
Bilal Piot
Bernardo Avila-Pires
Michal Valko
Thomas Mesnard
Tor Lattimore
Rémi Munos
38
30
0
06 Jan 2021
BeBold: Exploration Beyond the Boundary of Explored Regions
BeBold: Exploration Beyond the Boundary of Explored Regions
Tianjun Zhang
Huazhe Xu
Xiaolong Wang
Yi Wu
Kurt Keutzer
Joseph E. Gonzalez
Yuandong Tian
36
40
0
15 Dec 2020
TAMPC: A Controller for Escaping Traps in Novel Environments
TAMPC: A Controller for Escaping Traps in Novel Environments
Sheng Zhong
Zhenyuan Zhang
Nima Fazeli
Dmitry Berenson
31
7
0
23 Oct 2020
Improved Worst-Case Regret Bounds for Randomized Least-Squares Value
  Iteration
Improved Worst-Case Regret Bounds for Randomized Least-Squares Value Iteration
Priyank Agrawal
Jinglin Chen
Nan Jiang
30
18
0
23 Oct 2020
Optimising Stochastic Routing for Taxi Fleets with Model Enhanced
  Reinforcement Learning
Optimising Stochastic Routing for Taxi Fleets with Model Enhanced Reinforcement Learning
Shen Ren
Qianxiao Li
Liye Zhang
Zheng Qin
Bo Yang
26
0
0
22 Oct 2020
Training independent subnetworks for robust prediction
Training independent subnetworks for robust prediction
Marton Havasi
Rodolphe Jenatton
Stanislav Fort
Jeremiah Zhe Liu
Jasper Snoek
Balaji Lakshminarayanan
Andrew M. Dai
Dustin Tran
UQCV
OOD
41
208
0
13 Oct 2020
The Greatest Teacher, Failure is: Using Reinforcement Learning for SFC
  Placement Based on Availability and Energy Consumption
The Greatest Teacher, Failure is: Using Reinforcement Learning for SFC Placement Based on Availability and Energy Consumption
Guto Leoni Santos
Theo Lynn
J. Kelner
P. Endo
21
0
0
12 Oct 2020
Online Safety Assurance for Deep Reinforcement Learning
Online Safety Assurance for Deep Reinforcement Learning
Noga H. Rotman
Michael Schapira
Aviv Tamar
OffRL
38
5
0
07 Oct 2020
Randomized Value Functions via Posterior State-Abstraction Sampling
Randomized Value Functions via Posterior State-Abstraction Sampling
Dilip Arumugam
Benjamin Van Roy
OffRL
33
7
0
05 Oct 2020
Neural Thompson Sampling
Neural Thompson Sampling
Weitong Zhang
Dongruo Zhou
Lihong Li
Quanquan Gu
34
115
0
02 Oct 2020
Novelty Search in Representational Space for Sample Efficient
  Exploration
Novelty Search in Representational Space for Sample Efficient Exploration
Ruo Yu Tao
Vincent François-Lavet
Joelle Pineau
35
43
0
28 Sep 2020
Revisiting Design Choices in Proximal Policy Optimization
Revisiting Design Choices in Proximal Policy Optimization
Chloe Ching-Yun Hsu
Celestine Mendler-Dünner
Moritz Hardt
25
53
0
23 Sep 2020
Provably Efficient Reward-Agnostic Navigation with Linear Value
  Iteration
Provably Efficient Reward-Agnostic Navigation with Linear Value Iteration
Andrea Zanette
A. Lazaric
Mykel J. Kochenderfer
Emma Brunskill
36
64
0
18 Aug 2020
Survey of XAI in digital pathology
Survey of XAI in digital pathology
Milda Pocevičiūtė
Gabriel Eilertsen
Claes Lundström
14
56
0
14 Aug 2020
Deep Bayesian Bandits: Exploring in Online Personalized Recommendations
Deep Bayesian Bandits: Exploring in Online Personalized Recommendations
Dalin Guo
S. Ktena
Ferenc Huszár
Pranay K. Myana
Wenzhe Shi
Alykhan Tejani
OffRL
38
40
0
03 Aug 2020
UAV Target Tracking in Urban Environments Using Deep Reinforcement
  Learning
UAV Target Tracking in Urban Environments Using Deep Reinforcement Learning
Sarthak Bhagat
Sujit PB
42
47
0
21 Jul 2020
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep
  Reinforcement Learning
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
Kimin Lee
Michael Laskin
A. Srinivas
Pieter Abbeel
OffRL
25
199
0
09 Jul 2020
Selective Dyna-style Planning Under Limited Model Capacity
Selective Dyna-style Planning Under Limited Model Capacity
Zaheer Abbas
Samuel Sokota
Erin J. Talvitie
Martha White
37
32
0
05 Jul 2020
A Unifying Framework for Reinforcement Learning and Planning
A Unifying Framework for Reinforcement Learning and Planning
Thomas M. Moerland
Joost Broekens
Aske Plaat
Catholijn M. Jonker
OffRL
36
9
0
26 Jun 2020
Bayesian Neural Networks: An Introduction and Survey
Bayesian Neural Networks: An Introduction and Survey
Ethan Goan
Clinton Fookes
BDL
UQCV
37
199
0
22 Jun 2020
Reinforcement Learning with Uncertainty Estimation for Tactical
  Decision-Making in Intersections
Reinforcement Learning with Uncertainty Estimation for Tactical Decision-Making in Intersections
C. Hoel
Tommy Tram
J. Sjöberg
32
30
0
17 Jun 2020
Efficient Model-Based Reinforcement Learning through Optimistic Policy
  Search and Planning
Efficient Model-Based Reinforcement Learning through Optimistic Policy Search and Planning
Sebastian Curi
Felix Berkenkamp
Andreas Krause
33
82
0
15 Jun 2020
Non-local Policy Optimization via Diversity-regularized Collaborative
  Exploration
Non-local Policy Optimization via Diversity-regularized Collaborative Exploration
Zhenghao Peng
Hao Sun
Bolei Zhou
18
18
0
14 Jun 2020
Temporally-Extended ε-Greedy Exploration
Temporally-Extended ε-Greedy Exploration
Will Dabney
Georg Ostrovski
André Barreto
22
34
0
02 Jun 2020
A Smooth Representation of Belief over SO(3) for Deep Rotation Learning
  with Uncertainty
A Smooth Representation of Belief over SO(3) for Deep Rotation Learning with Uncertainty
Valentin Peretroukhin
Matthew Giamou
David M. Rosen
W. N. Greene
Nicholas Roy
Jonathan Kelly
24
42
0
01 Jun 2020
Efficient Ensemble Model Generation for Uncertainty Estimation with
  Bayesian Approximation in Segmentation
Efficient Ensemble Model Generation for Uncertainty Estimation with Bayesian Approximation in Segmentation
Hong Joo Lee
S. T. Kim
Hakmin Lee
Nassir Navab
Yong Man Ro
UQCV
18
7
0
21 May 2020
Novel Policy Seeking with Constrained Optimization
Novel Policy Seeking with Constrained Optimization
Hao Sun
Zhenghao Peng
Bo Dai
Jian Guo
Dahua Lin
Bolei Zhou
24
13
0
21 May 2020
Planning to Explore via Self-Supervised World Models
Planning to Explore via Self-Supervised World Models
Ramanan Sekar
Oleh Rybkin
Kostas Daniilidis
Pieter Abbeel
Danijar Hafner
Deepak Pathak
SSL
33
399
0
12 May 2020
Smooth Exploration for Robotic Reinforcement Learning
Smooth Exploration for Robotic Reinforcement Learning
Antonin Raffin
Jens Kober
F. Stulp
32
57
0
12 May 2020
Delay-Aware Model-Based Reinforcement Learning for Continuous Control
Delay-Aware Model-Based Reinforcement Learning for Continuous Control
Baiming Chen
Mengdi Xu
Liang-Sheng Li
Ding Zhao
OffRL
42
63
0
11 May 2020
Learning Adaptive Exploration Strategies in Dynamic Environments Through
  Informed Policy Regularization
Learning Adaptive Exploration Strategies in Dynamic Environments Through Informed Policy Regularization
Pierre-Alexandre Kamienny
Matteo Pirotta
A. Lazaric
Thibault Lavril
Nicolas Usunier
Ludovic Denoyer
14
19
0
06 May 2020
Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning
Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning
Shangtong Zhang
Bo Liu
Shimon Whiteson
29
38
0
22 Apr 2020
Agent57: Outperforming the Atari Human Benchmark
Agent57: Outperforming the Atari Human Benchmark
Adria Puigdomenech Badia
Bilal Piot
Steven Kapturowski
Pablo Sprechmann
Alex Vitvitskyi
Daniel Guo
Charles Blundell
OffRL
29
510
0
30 Mar 2020
An empirical investigation of the challenges of real-world reinforcement
  learning
An empirical investigation of the challenges of real-world reinforcement learning
Gabriel Dulac-Arnold
Nir Levine
D. Mankowitz
Jerry Li
Cosmin Paduraru
Sven Gowal
Todd Hester
OffRL
34
121
0
24 Mar 2020
Monotonic Value Function Factorisation for Deep Multi-Agent
  Reinforcement Learning
Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Tabish Rashid
Mikayel Samvelyan
Christian Schroeder de Witt
Gregory Farquhar
Jakob N. Foerster
Shimon Whiteson
67
773
0
19 Mar 2020
Uncertainty Estimation Using a Single Deep Deterministic Neural Network
Uncertainty Estimation Using a Single Deep Deterministic Neural Network
Joost R. van Amersfoort
Lewis Smith
Yee Whye Teh
Y. Gal
UQCV
BDL
14
55
0
04 Mar 2020
Uncertainty Quantification for Sparse Deep Learning
Uncertainty Quantification for Sparse Deep Learning
Yuexi Wang
Veronika Rockova
BDL
UQCV
36
31
0
26 Feb 2020
Effective Diversity in Population Based Reinforcement Learning
Effective Diversity in Population Based Reinforcement Learning
Jack Parker-Holder
Aldo Pacchiano
K. Choromanski
Stephen J. Roberts
22
158
0
03 Feb 2020
Making Sense of Reinforcement Learning and Probabilistic Inference
Making Sense of Reinforcement Learning and Probabilistic Inference
Brendan O'Donoghue
Ian Osband
Catalin Ionescu
OffRL
27
48
0
03 Jan 2020
Uncertainty-Based Out-of-Distribution Classification in Deep
  Reinforcement Learning
Uncertainty-Based Out-of-Distribution Classification in Deep Reinforcement Learning
Andreas Sedlmeier
Thomas Gabor
Thomy Phan
Lenz Belzner
Claudia Linnhoff-Popien
21
25
0
31 Dec 2019
A Survey of Deep Reinforcement Learning in Video Games
A Survey of Deep Reinforcement Learning in Video Games
Kun Shao
Zhentao Tang
Yuanheng Zhu
Nannan Li
Dongbin Zhao
OffRL
AI4TS
43
188
0
23 Dec 2019
Parting with Illusions about Deep Active Learning
Parting with Illusions about Deep Active Learning
Sudhanshu Mittal
Maxim Tatarchenko
Özgün Çiçek
Thomas Brox
VLM
29
59
0
11 Dec 2019
Optimism in Reinforcement Learning with Generalized Linear Function
  Approximation
Optimism in Reinforcement Learning with Generalized Linear Function Approximation
Yining Wang
Ruosong Wang
S. Du
A. Krishnamurthy
137
135
0
09 Dec 2019
Adaptive Online Planning for Continual Lifelong Learning
Adaptive Online Planning for Continual Lifelong Learning
Kevin Lu
Igor Mordatch
Pieter Abbeel
OffRL
OnRL
CLL
11
15
0
03 Dec 2019
Implicit Generative Modeling for Efficient Exploration
Implicit Generative Modeling for Efficient Exploration
Neale Ratzlaff
Qinxun Bai
Fuxin Li
Wenyuan Xu
27
12
0
19 Nov 2019
Multi-Path Policy Optimization
Multi-Path Policy Optimization
L. Pan
Qingpeng Cai
Longbo Huang
18
2
0
11 Nov 2019
Explicit Explore-Exploit Algorithms in Continuous State Spaces
Explicit Explore-Exploit Algorithms in Continuous State Spaces
Mikael Henaff
OffRL
22
31
0
01 Nov 2019
Previous
123456
Next