Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.10228
Cited By
Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice via HyperAgent
5 February 2024
Yingru Li
Jiawei Xu
Lei Han
Zhi-Quan Luo
BDL
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice via HyperAgent"
14 / 14 papers shown
Title
Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo
Haque Ishfaq
Qingfeng Lan
Pan Xu
A. R. Mahmood
Doina Precup
Anima Anandkumar
Kamyar Azizzadenesheli
BDL
OffRL
65
20
0
29 May 2023
A Provably Efficient Model-Free Posterior Sampling Method for Episodic Reinforcement Learning
Christoph Dann
M. Mohri
Tong Zhang
Julian Zimmert
OffRL
33
34
0
23 Aug 2022
Guarantees for Epsilon-Greedy Reinforcement Learning with Function Approximation
Christoph Dann
Yishay Mansour
M. Mohri
Ayush Sekhari
Karthik Sridharan
52
52
0
19 Jun 2022
An Analysis of Ensemble Sampling
Chao Qin
Zheng Wen
Xiuyuan Lu
Benjamin Van Roy
59
22
0
02 Mar 2022
Mastering Atari Games with Limited Data
Weirui Ye
Shao-Wei Liu
Thanard Kurutach
Pieter Abbeel
Yang Gao
VLM
90
231
0
30 Oct 2021
APS: Active Pretraining with Successor Features
Hao Liu
Pieter Abbeel
76
119
0
31 Aug 2021
Bilinear Classes: A Structural Framework for Provable Generalization in RL
S. Du
Sham Kakade
Jason D. Lee
Shachar Lovett
G. Mahajan
Wen Sun
Ruosong Wang
OffRL
111
189
0
19 Mar 2021
Reinforcement Learning, Bit by Bit
Xiuyuan Lu
Benjamin Van Roy
Vikranth Dwaracherla
M. Ibrahimi
Ian Osband
Zheng Wen
40
70
0
06 Mar 2021
Efficiently Sampling Functions from Gaussian Process Posteriors
James T. Wilson
Viacheslav Borovitskiy
Alexander Terenin
P. Mostowsky
M. Deisenroth
36
163
0
21 Feb 2020
Deep Exploration via Randomized Value Functions
Ian Osband
Benjamin Van Roy
Daniel Russo
Zheng Wen
77
302
0
22 Mar 2017
Why is Posterior Sampling Better than Optimism for Reinforcement Learning?
Ian Osband
Benjamin Van Roy
BDL
74
257
0
01 Jul 2016
Unifying Count-Based Exploration and Intrinsic Motivation
Marc G. Bellemare
S. Srinivasan
Georg Ostrovski
Tom Schaul
D. Saxton
Rémi Munos
162
1,465
0
06 Jun 2016
Deep Reinforcement Learning with Double Q-learning
H. V. Hasselt
A. Guez
David Silver
OffRL
136
7,590
0
22 Sep 2015
The Arcade Learning Environment: An Evaluation Platform for General Agents
Marc G. Bellemare
Yavar Naddaf
J. Veness
Michael Bowling
85
2,992
0
19 Jul 2012
1