Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.04621
Cited By
Deep Exploration via Bootstrapped DQN
15 February 2016
Ian Osband
Charles Blundell
Alexander Pritzel
Benjamin Van Roy
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Exploration via Bootstrapped DQN"
50 / 288 papers shown
Title
Generative Adversarial Exploration for Reinforcement Learning
Weijun Hong
Menghui Zhu
Minghuan Liu
Weinan Zhang
Ming Zhou
Yong Yu
Peng Sun
OnRL
39
7
0
27 Jan 2022
Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation
Vincent Mai
Kaustubh Mani
Liam Paull
40
34
0
05 Jan 2022
Generalisation effects of predictive uncertainty estimation in deep learning for digital pathology
Milda Pocevičiūtė
Gabriel Eilertsen
Sofia Jarkman
Claes Lundström
OOD
UQCV
36
24
0
17 Dec 2021
Sample-Efficient Reinforcement Learning via Conservative Model-Based Actor-Critic
Zhihai Wang
Jie Wang
Qi Zhou
Bin Li
Houqiang Li
27
30
0
16 Dec 2021
Model-Value Inconsistency as a Signal for Epistemic Uncertainty
Angelos Filos
Eszter Vértes
Zita Marinho
Gregory Farquhar
Diana Borsa
A. Friesen
Feryal M. P. Behbahani
Tom Schaul
André Barreto
Simon Osindero
44
7
0
08 Dec 2021
Enhanced Exploration in Neural Feature Selection for Deep Click-Through Rate Prediction Models via Ensemble of Gating Layers
L. Guan
Xia Xiao
Ming-yue Chen
Youlong Cheng
27
1
0
07 Dec 2021
Learning State Representations via Retracing in Reinforcement Learning
Changmin Yu
Dong Li
Jianye Hao
Jun Wang
Neil Burgess
32
7
0
24 Nov 2021
A General Divergence Modeling Strategy for Salient Object Detection
Xinyu Tian
Jing Zhang
Yuchao Dai
35
0
0
23 Nov 2021
Deep Reinforced Attention Regression for Partial Sketch Based Image Retrieval
Dingrong Wang
Hitesh Sapkota
Xumin Liu
Qi Yu
41
4
0
21 Nov 2021
Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance
Yanqiu Wu
Xinyue Chen
Che Wang
Yiming Zhang
Keith Ross
OffRL
17
9
0
17 Nov 2021
Dealing with the Unknown: Pessimistic Offline Reinforcement Learning
Jinning Li
Chen Tang
Masayoshi Tomizuka
Wei Zhan
OffRL
21
21
0
09 Nov 2021
The Value of Information When Deciding What to Learn
Dilip Arumugam
Benjamin Van Roy
37
12
0
26 Oct 2021
False Correlation Reduction for Offline Reinforcement Learning
Arvindkumar Krishnakumar
Zuyue Fu
Lingxiao Wang
Zhuoran Yang
Chenjia Bai
Tianyi Zhou
Judy Hoffman
Jing Jiang
OffRL
39
9
0
24 Oct 2021
Anti-Concentrated Confidence Bonuses for Scalable Exploration
Jordan T. Ash
Cyril Zhang
Surbhi Goel
A. Krishnamurthy
Sham Kakade
45
6
0
21 Oct 2021
Value Penalized Q-Learning for Recommender Systems
Chengqian Gao
Ke Xu
Kuangqi Zhou
Lanqing Li
Xueqian Wang
Bo Yuan
P. Zhao
OffRL
54
20
0
15 Oct 2021
Dropout Q-Functions for Doubly Efficient Reinforcement Learning
Takuya Hiraoka
Takahisa Imagawa
Taisei Hashimoto
Takashi Onishi
Yoshimasa Tsuruoka
13
105
0
05 Oct 2021
Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble
Gaon An
Seungyong Moon
Jang-Hyun Kim
Hyun Oh Song
OffRL
105
265
0
04 Oct 2021
Seeking Visual Discomfort: Curiosity-driven Representations for Reinforcement Learning
Elie Aljalbout
Maximilian Ulmer
Rudolph Triebel
24
2
0
02 Oct 2021
Dr Jekyll and Mr Hyde: the Strange Case of Off-Policy Policy Updates
Romain Laroche
Rémi Tachet des Combes
46
8
0
29 Sep 2021
Deep Reinforcement Learning Versus Evolution Strategies: A Comparative Survey
Amjad Yousef Majid
Serge Saaybi
Tomas van Rietbergen
Vincent François-Lavet
R. V. Prasad
Chris Verhoeven
OffRL
64
55
0
28 Sep 2021
Making Curiosity Explicit in Vision-based RL
Elie Aljalbout
Maximilian Ulmer
Rudolph Triebel
OffRL
34
2
0
28 Sep 2021
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Mikayel Samvelyan
Robert Kirk
Vitaly Kurin
Jack Parker-Holder
Minqi Jiang
Eric Hambro
Fabio Petroni
Heinrich Küttler
Edward Grefenstette
Tim Rocktaschel
OffRL
238
89
0
27 Sep 2021
Introspective Robot Perception using Smoothed Predictions from Bayesian Neural Networks
Jianxiang Feng
M. Durner
Zoltán-Csaba Márton
Ferenc Bálint-Benczédi
Rudolph Triebel
UQCV
BDL
13
11
0
27 Sep 2021
Deep Exploration for Recommendation Systems
Zheqing Zhu
Benjamin Van Roy
34
11
0
26 Sep 2021
MEPG: A Minimalist Ensemble Policy Gradient Framework for Deep Reinforcement Learning
Qiang He
Yuxun Qu
Chen Gong
Xinwen Hou
OffRL
22
10
0
22 Sep 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
41
93
0
14 Sep 2021
ADER:Adapting between Exploration and Robustness for Actor-Critic Methods
Bo Zhou
Kejiao Li
Hongsheng Zeng
Fan Wang
Hao Tian
OffRL
38
1
0
08 Sep 2021
A Survey of Exploration Methods in Reinforcement Learning
Susan Amin
Maziar Gomrokchi
Harsh Satija
H. V. Hoof
Doina Precup
OffRL
37
80
0
01 Sep 2021
Uncertainty-Aware Model Adaptation for Unsupervised Cross-Domain Object Detection
Minjie Cai
Minyi Luo
Xionghu Zhong
Hao Chen
OOD
29
6
0
28 Aug 2021
NPBDREG: Uncertainty Assessment in Diffeomorphic Brain MRI Registration using a Non-parametric Bayesian Deep-Learning Based Approach
Samah Khawaled
Moti Freiman
UQCV
39
8
0
15 Aug 2021
Cooperative Exploration for Multi-Agent Deep Reinforcement Learning
Iou-Jen Liu
Unnat Jain
Raymond A. Yeh
Alex Schwing
42
104
0
23 Jul 2021
Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit Partial Observability
Dibya Ghosh
Jad Rahme
Aviral Kumar
Amy Zhang
Ryan P. Adams
Sergey Levine
OffRL
283
109
0
13 Jul 2021
Offline reinforcement learning with uncertainty for treatment strategies in sepsis
Ran Liu
J. Greenstein
J. Fackler
Jules Bergmann
M. Bembea
R. Winslow
OffRL
14
7
0
09 Jul 2021
FedAdapt: Adaptive Offloading for IoT Devices in Federated Learning
Di Wu
R. Ullah
P. Harvey
Peter Kilpatrick
I. Spence
Blesson Varghese
42
79
0
09 Jul 2021
Learning from Demonstration without Demonstrations
Tom Blau
Gilad Francis
Philippe Morere
OffRL
24
1
0
17 Jun 2021
On the Sample Complexity and Metastability of Heavy-tailed Policy Search in Continuous Control
Amrit Singh Bedi
Anjaly Parayil
Junyu Zhang
Mengdi Wang
Alec Koppel
38
15
0
15 Jun 2021
Randomized Exploration for Reinforcement Learning with General Value Function Approximation
Haque Ishfaq
Qiwen Cui
V. Nguyen
Alex Ayoub
Zhuoran Yang
Zhaoran Wang
Doina Precup
Lin F. Yang
37
43
0
15 Jun 2021
Quantifying Uncertainty in Deep Spatiotemporal Forecasting
Dongxian Wu
Liyao (Mars) Gao
X. Xiong
Matteo Chinazzi
Alessandro Vespignani
Yi Ma
Rose Yu
AI4TS
16
68
0
25 May 2021
Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning
Yue Wu
Shuangfei Zhai
Nitish Srivastava
J. Susskind
Jian Zhang
Ruslan Salakhutdinov
Hanlin Goh
EDL
OffRL
OnRL
21
184
0
17 May 2021
Principled Exploration via Optimistic Bootstrapping and Backward Induction
Chenjia Bai
Lingxiao Wang
Lei Han
Jianye Hao
Animesh Garg
Peng Liu
Zhaoran Wang
OffRL
26
38
0
13 May 2021
Spectral Normalisation for Deep Reinforcement Learning: an Optimisation Perspective
Florin Gogianu
Tudor Berariu
Mihaela Rosca
Claudia Clopath
L. Buşoniu
Razvan Pascanu
24
54
0
11 May 2021
Combining Pessimism with Optimism for Robust and Efficient Model-Based Deep Reinforcement Learning
Sebastian Curi
Ilija Bogunovic
Andreas Krause
39
17
0
18 Mar 2021
An Information-Theoretic Perspective on Credit Assignment in Reinforcement Learning
Dilip Arumugam
Peter Henderson
Pierre-Luc Bacon
24
17
0
10 Mar 2021
Reinforcement Learning, Bit by Bit
Xiuyuan Lu
Benjamin Van Roy
Vikranth Dwaracherla
M. Ibrahimi
Ian Osband
Zheng Wen
30
70
0
06 Mar 2021
Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings
Lili Chen
Kimin Lee
A. Srinivas
Pieter Abbeel
OffRL
24
11
0
04 Mar 2021
Beyond Fine-Tuning: Transferring Behavior in Reinforcement Learning
Victor Campos
Pablo Sprechmann
Steven Hansen
André Barreto
Steven Kapturowski
Alex Vitvitskyi
Adria Puigdomenech Badia
Charles Blundell
OffRL
OnRL
43
25
0
24 Feb 2021
DEUP: Direct Epistemic Uncertainty Prediction
Salem Lahlou
Moksh Jain
Hadi Nekoei
V. Butoi
Paul Bertin
Jarrid Rector-Brooks
Maksym Korablyov
Yoshua Bengio
PER
UQLM
UQCV
UD
212
81
0
16 Feb 2021
Sparse Attention Guided Dynamic Value Estimation for Single-Task Multi-Scene Reinforcement Learning
Jaskirat Singh
Liang Zheng
OffRL
21
3
0
14 Feb 2021
Online Apprenticeship Learning
Lior Shani
Tom Zahavy
Shie Mannor
OffRL
29
25
0
13 Feb 2021
When and How Mixup Improves Calibration
Linjun Zhang
Zhun Deng
Kenji Kawaguchi
James Zou
UQCV
36
67
0
11 Feb 2021
Previous
1
2
3
4
5
6
Next