Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.04621
Cited By
Deep Exploration via Bootstrapped DQN
15 February 2016
Ian Osband
Charles Blundell
Alexander Pritzel
Benjamin Van Roy
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Exploration via Bootstrapped DQN"
50 / 288 papers shown
Title
Counter-Inferential Behavior in Natural and Artificial Cognitive Systems
Serge Dolgikh
22
0
0
19 May 2025
Exploration by Random Distribution Distillation
Zhirui Fang
Kai Yang
Jian Tao
Jiafei Lyu
Lusong Li
Li Shen
Xiu Li
14
0
0
16 May 2025
Language Agents Mirror Human Causal Reasoning Biases. How Can We Help Them Think Like Scientists?
Anthony GX-Chen
Dongyan Lin
Mandana Samiei
Doina Precup
Blake A. Richards
Rob Fergus
Kenneth Marino
CML
LRM
34
0
0
14 May 2025
Parameter Estimation using Reinforcement Learning Causal Curiosity: Limits and Challenges
Miguel Arana-Catania
Weisi Guo
CML
35
0
0
13 May 2025
Toward Efficient Exploration by Large Language Model Agents
Dilip Arumugam
Thomas L. Griffiths
LLMAG
94
1
0
29 Apr 2025
LaMOuR: Leveraging Language Models for Out-of-Distribution Recovery in Reinforcement Learning
Chan Kim
Seung-Woo Seo
Seong-Woo Kim
OODD
259
0
0
21 Mar 2025
Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model
Moritz A. Zanger
Pascal R. van der Vaart
Wendelin Bohmer
M. Spaan
UQCV
BDL
245
0
0
14 Mar 2025
IL-SOAR : Imitation Learning with Soft Optimistic Actor cRitic
Stefano Viel
Luca Viano
V. Cevher
95
0
0
27 Feb 2025
Training a Generally Curious Agent
Fahim Tajwar
Yiding Jiang
Abitha Thankaraj
Sumaita Sadia Rahman
J. Zico Kolter
Jeff Schneider
Ruslan Salakhutdinov
126
1
0
24 Feb 2025
Ensemble RL through Classifier Models: Enhancing Risk-Return Trade-offs in Trading Strategies
Zheli Xiong
49
0
0
23 Feb 2025
Spatial-aware decision-making with ring attractors in reinforcement learning systems
Marcos Negre Saura
Richard Allmendinger
Theodore Papamarkou
Wei Pan
241
0
0
17 Feb 2025
DR-MPC: Deep Residual Model Predictive Control for Real-world Social Navigation
James R. Han
Hugues Thomas
Jian Zhang
Nicholas Rhinehart
T. Barfoot
69
1
0
17 Feb 2025
Learning from Active Human Involvement through Proxy Value Propagation
Zhenghao Peng
Wenjie Mo
Chenda Duan
Quanyi Li
Bolei Zhou
109
14
0
05 Feb 2025
Inverse-RLignment: Large Language Model Alignment from Demonstrations through Inverse Reinforcement Learning
Hao Sun
M. Schaar
94
14
0
28 Jan 2025
EvoRL: A GPU-accelerated Framework for Evolutionary Reinforcement Learning
Bowen Zheng
Ran Cheng
Kay Chen Tan
49
0
0
25 Jan 2025
EVaDE : Event-Based Variational Thompson Sampling for Model-Based Reinforcement Learning
Siddharth Aravindan
Dixant Mittal
Wee Sun Lee
BDL
81
0
0
17 Jan 2025
ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning
Kun Wu
Yinuo Zhao
Zhihao Xu
Zhengping Che
Chengxiang Yin
C. Liu
Qinru Qiu
Feiferi Feng
OffRL
109
1
0
22 Dec 2024
Improved Regret of Linear Ensemble Sampling
Harin Lee
Min-hwan Oh
42
1
0
06 Nov 2024
vMF-Contact: Uncertainty-aware Evidential Learning for Probabilistic Contact-grasp in Noisy Clutter
Yitian Shi
Edgar Welte
Maximilian Gilles
Rania Rayyes
43
3
0
06 Nov 2024
Prioritized Generative Replay
Renhao Wang
Kevin Frans
Pieter Abbeel
Sergey Levine
Alexei A. Efros
OnRL
DiffM
119
2
0
23 Oct 2024
Optimizing TD3 for 7-DOF Robotic Arm Grasping: Overcoming Suboptimality with Exploration-Enhanced Contrastive Learning
Wen-Han Hsieh
Jen-Yuan Chang
26
0
0
26 Aug 2024
Random Latent Exploration for Deep Reinforcement Learning
Srinath Mahankali
Zhang-Wei Hong
Ayush Sekhari
Alexander Rakhlin
Pulkit Agrawal
38
3
0
18 Jul 2024
Learning to Steer Markovian Agents under Model Uncertainty
Jiawei Huang
Vinzenz Thoma
Zebang Shen
H. Nax
Niao He
48
2
0
14 Jul 2024
Simplifying Deep Temporal Difference Learning
Matteo Gallici
Mattie Fellows
Benjamin Ellis
B. Pou
Ivan Masmitja
Jakob Foerster
Mario Martin
OffRL
62
17
0
05 Jul 2024
Bayesian Bandit Algorithms with Approximate Inference in Stochastic Linear Bandits
Ziyi Huang
Henry Lam
Haofeng Zhang
35
0
0
20 Jun 2024
Fusion-PSRO: Nash Policy Fusion for Policy Space Response Oracles
Jiesong Lian
Yucong Huang
Chengdong Ma
Mingzhi Wang
Ying Wen
Long Hu
Yixue Hao
65
0
0
31 May 2024
Ensemble Successor Representations for Task Generalization in Offline-to-Online Reinforcement Learning
Changhong Wang
Xudong Yu
Chenjia Bai
Qiaosheng Zhang
Zhen Wang
42
1
0
12 May 2024
The Curse of Diversity in Ensemble-Based Exploration
Zhixuan Lin
P. DÓro
Evgenii Nikishin
Rameswar Panda
52
1
0
07 May 2024
Goal Exploration via Adaptive Skill Distribution for Goal-Conditioned Reinforcement Learning
Lisheng Wu
Ke Chen
34
0
0
19 Apr 2024
Active Exploration in Bayesian Model-based Reinforcement Learning for Robot Manipulation
Carlos Plou
Ana C. Murillo
Ruben Martinez-Cantin
OffRL
45
0
0
02 Apr 2024
Dataset Clustering for Improved Offline Policy Learning
Qiang Wang
Yixin Deng
Francisco Roldan Sanchez
Keru Wang
Kevin McGuinness
Noel E. O'Connor
Stephen J. Redmond
OffRL
34
2
0
14 Feb 2024
Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice via HyperAgent
Yingru Li
Jiawei Xu
Lei Han
Zhi-Quan Luo
BDL
OffRL
33
5
0
05 Feb 2024
Sharing Knowledge in Multi-Task Deep Reinforcement Learning
Carlo DÉramo
Davide Tateo
Andrea Bonarini
Marcello Restelli
Jan Peters
59
125
0
17 Jan 2024
On Sample-Efficient Offline Reinforcement Learning: Data Diversity, Posterior Sampling, and Beyond
Thanh Nguyen-Tang
Raman Arora
OffRL
38
3
0
06 Jan 2024
Conservative Exploration for Policy Optimization via Off-Policy Policy Evaluation
Paul Daoudi
Mathias Formoso
Othman Gaizi
Achraf Azize
Evrard Garcelon
OffRL
28
0
0
24 Dec 2023
Agent-Aware Training for Agent-Agnostic Action Advising in Deep Reinforcement Learning
Yaoquan Wei
Shunyu Liu
Mingli Song
Tongya Zheng
Kaixuan Chen
Yong Wang
Mingli Song
27
0
0
28 Nov 2023
EduGym: An Environment and Notebook Suite for Reinforcement Learning Education
Thomas M. Moerland
Matthias Muller-Brockhausen
Zhao Yang
Andrius Bernatavicius
Koen Ponse
Tom Kouwenhoven
Andreas Sauter
Michiel van der Meer
Bram M. Renting
Aske Plaat
OffRL
40
0
0
17 Nov 2023
Ensemble sampling for linear bandits: small ensembles suffice
David Janz
A. Litvak
Csaba Szepesvári
38
1
0
14 Nov 2023
On the Theory of Risk-Aware Agents: Bridging Actor-Critic and Economics
Michal Nauman
Marek Cygan
40
1
0
30 Oct 2023
Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills
Seongun Kim
Kyowoon Lee
Jaesik Choi
SSL
DRL
43
7
0
30 Oct 2023
Knowing What LLMs DO NOT Know: A Simple Yet Effective Self-Detection Method
Yukun Zhao
Lingyong Yan
Weiwei Sun
Guoliang Xing
Chong Meng
Shuaiqiang Wang
Zhicong Cheng
Zhaochun Ren
Dawei Yin
36
37
0
27 Oct 2023
Posterior Sampling-based Online Learning for Episodic POMDPs
Dengwang Tang
Dongze Ye
Rahul Jain
A. Nayyar
Pierluigi Nuzzo
OffRL
53
0
0
16 Oct 2023
Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning
Trevor A. McInroe
Adam Jelley
Stefano V. Albrecht
Amos Storkey
OffRL
OnRL
30
6
0
09 Oct 2023
LESSON: Learning to Integrate Exploration Strategies for Reinforcement Learning via an Option Framework
Woojun Kim
Jeonghye Kim
Young-Jin Sung
28
5
0
05 Oct 2023
FoX: Formation-aware exploration in multi-agent reinforcement learning
Yonghyeon Jo
Sunwoo Lee
Junghyuk Yum
Seungyul Han
35
5
0
22 Aug 2023
Bag of Policies for Distributional Deep Exploration
Asen Nachkov
Luchen Li
Giulia Luise
Filippo Valdettaro
Aldo A. Faisal
OffRL
43
0
0
03 Aug 2023
Diverse Projection Ensembles for Distributional Reinforcement Learning
Moritz A. Zanger
Wendelin Bohmer
M. Spaan
38
4
0
12 Jun 2023
Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo
Haque Ishfaq
Qingfeng Lan
Pan Xu
A. R. Mahmood
Doina Precup
Anima Anandkumar
Kamyar Azizzadenesheli
BDL
OffRL
30
20
0
29 May 2023
State Representation Learning Using an Unbalanced Atlas
Li Meng
Morten Goodwin
Anis Yazidi
P. Engelstad
37
2
0
17 May 2023
Bayesian Reinforcement Learning with Limited Cognitive Load
Dilip Arumugam
Mark K. Ho
Noah D. Goodman
Benjamin Van Roy
OffRL
36
8
0
05 May 2023
1
2
3
4
5
6
Next