Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice
via HyperAgent

Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice via HyperAgent

5 February 2024

Papers citing "Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice via HyperAgent"

14 / 14 papers shown

Title
Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo Haque Ishfaq Qingfeng Lan Pan Xu A. R. Mahmood Doina Precup Anima Anandkumar Kamyar Azizzadenesheli BDL OffRL 65 20 0 29 May 2023
A Provably Efficient Model-Free Posterior Sampling Method for Episodic Reinforcement Learning Christoph Dann M. Mohri Tong Zhang Julian Zimmert OffRL 33 34 0 23 Aug 2022
Guarantees for Epsilon-Greedy Reinforcement Learning with Function Approximation Christoph Dann Yishay Mansour M. Mohri Ayush Sekhari Karthik Sridharan 52 52 0 19 Jun 2022
An Analysis of Ensemble Sampling Chao Qin Zheng Wen Xiuyuan Lu Benjamin Van Roy 59 22 0 02 Mar 2022
Mastering Atari Games with Limited Data Weirui Ye Shao-Wei Liu Thanard Kurutach Pieter Abbeel Yang Gao VLM 90 231 0 30 Oct 2021
APS: Active Pretraining with Successor Features Hao Liu Pieter Abbeel 76 119 0 31 Aug 2021
Bilinear Classes: A Structural Framework for Provable Generalization in RL S. Du Sham Kakade Jason D. Lee Shachar Lovett G. Mahajan Wen Sun Ruosong Wang OffRL 111 189 0 19 Mar 2021
Reinforcement Learning, Bit by Bit Xiuyuan Lu Benjamin Van Roy Vikranth Dwaracherla M. Ibrahimi Ian Osband Zheng Wen 40 70 0 06 Mar 2021
Efficiently Sampling Functions from Gaussian Process Posteriors James T. Wilson Viacheslav Borovitskiy Alexander Terenin P. Mostowsky M. Deisenroth 36 163 0 21 Feb 2020
Deep Exploration via Randomized Value Functions Ian Osband Benjamin Van Roy Daniel Russo Zheng Wen 77 302 0 22 Mar 2017
Why is Posterior Sampling Better than Optimism for Reinforcement Learning? Ian Osband Benjamin Van Roy BDL 74 257 0 01 Jul 2016
Unifying Count-Based Exploration and Intrinsic Motivation Marc G. Bellemare S. Srinivasan Georg Ostrovski Tom Schaul D. Saxton Rémi Munos 162 1,465 0 06 Jun 2016
Deep Reinforcement Learning with Double Q-learning H. V. Hasselt A. Guez David Silver OffRL 136 7,590 0 22 Sep 2015
The Arcade Learning Environment: An Evaluation Platform for General Agents Marc G. Bellemare Yavar Naddaf J. Veness Michael Bowling 85 2,992 0 19 Jul 2012