Beyond Optimism: Exploration With Partially Observable Rewards

Beyond Optimism: Exploration With Partially Observable Rewards

20 June 2024

Alireza Kazemipour

Michael Bowling

Papers citing "Beyond Optimism: Exploration With Partially Observable Rewards"

15 / 15 papers shown

Title
Optimistic Active Exploration of Dynamical Systems Bhavya Sukhija Lenart Treven Cansu Sancaktar Sebastian Blaes Stelian Coros Andreas Krause 72 18 0 21 Jun 2023
Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning Sam Lobel Akhil Bagaria George Konidaris 49 16 0 05 Jun 2023
An information-theoretic perspective on intrinsic motivation in reinforcement learning: a survey A. Aubret L. Matignon S. Hassas 71 35 0 19 Sep 2022
Exploration in Deep Reinforcement Learning: A Survey Pawel Ladosz Lilian Weng Minwoo Kim H. Oh OffRL 45 334 0 02 May 2022
Active Learning for Nonlinear System Identification with Guarantees Horia Mania Michael I. Jordan Benjamin Recht 63 102 0 18 Jun 2020
Information Directed Sampling for Linear Partial Monitoring Johannes Kirschner Tor Lattimore Andreas Krause 38 46 0 25 Feb 2020
Explicit Explore-Exploit Algorithms in Continuous State Spaces Mikael Henaff OffRL 29 31 0 01 Nov 2019
Reinforcement Learning in Healthcare: A Survey Chao Yu Jiming Liu S. Nemati LM&MA OffRL 98 557 0 22 Aug 2019
Near Optimal Exploration-Exploitation in Non-Communicating Markov Decision Processes Ronan Fruit Matteo Pirotta A. Lazaric 18 61 0 06 Jul 2018
Curiosity-driven Exploration by Self-supervised Prediction Deepak Pathak Pulkit Agrawal Alexei A. Efros Trevor Darrell LRM SSL 91 2,416 0 15 May 2017
Deep Exploration via Randomized Value Functions Ian Osband Benjamin Van Roy Daniel Russo Zheng Wen 66 302 0 22 Mar 2017
Why is Posterior Sampling Better than Optimism for Reinforcement Learning? Ian Osband Benjamin Van Roy BDL 74 257 0 01 Jul 2016
Unifying Count-Based Exploration and Intrinsic Motivation Marc G. Bellemare S. Srinivasan Georg Ostrovski Tom Schaul D. Saxton Rémi Munos 156 1,465 0 06 Jun 2016
Prioritized Experience Replay Tom Schaul John Quan Ioannis Antonoglou David Silver OffRL 185 3,777 0 18 Nov 2015
Continuous control with deep reinforcement learning Timothy Lillicrap Jonathan J. Hunt Alexander Pritzel N. Heess Tom Erez Yuval Tassa David Silver Daan Wierstra 168 13,174 0 09 Sep 2015