Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.13909
Cited By
Beyond Optimism: Exploration With Partially Observable Rewards
20 June 2024
Simone Parisi
Alireza Kazemipour
Michael Bowling
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Beyond Optimism: Exploration With Partially Observable Rewards"
15 / 15 papers shown
Title
Optimistic Active Exploration of Dynamical Systems
Bhavya Sukhija
Lenart Treven
Cansu Sancaktar
Sebastian Blaes
Stelian Coros
Andreas Krause
72
18
0
21 Jun 2023
Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning
Sam Lobel
Akhil Bagaria
George Konidaris
49
16
0
05 Jun 2023
An information-theoretic perspective on intrinsic motivation in reinforcement learning: a survey
A. Aubret
L. Matignon
S. Hassas
71
35
0
19 Sep 2022
Exploration in Deep Reinforcement Learning: A Survey
Pawel Ladosz
Lilian Weng
Minwoo Kim
H. Oh
OffRL
45
334
0
02 May 2022
Active Learning for Nonlinear System Identification with Guarantees
Horia Mania
Michael I. Jordan
Benjamin Recht
63
102
0
18 Jun 2020
Information Directed Sampling for Linear Partial Monitoring
Johannes Kirschner
Tor Lattimore
Andreas Krause
38
46
0
25 Feb 2020
Explicit Explore-Exploit Algorithms in Continuous State Spaces
Mikael Henaff
OffRL
29
31
0
01 Nov 2019
Reinforcement Learning in Healthcare: A Survey
Chao Yu
Jiming Liu
S. Nemati
LM&MA
OffRL
98
557
0
22 Aug 2019
Near Optimal Exploration-Exploitation in Non-Communicating Markov Decision Processes
Ronan Fruit
Matteo Pirotta
A. Lazaric
18
61
0
06 Jul 2018
Curiosity-driven Exploration by Self-supervised Prediction
Deepak Pathak
Pulkit Agrawal
Alexei A. Efros
Trevor Darrell
LRM
SSL
91
2,416
0
15 May 2017
Deep Exploration via Randomized Value Functions
Ian Osband
Benjamin Van Roy
Daniel Russo
Zheng Wen
66
302
0
22 Mar 2017
Why is Posterior Sampling Better than Optimism for Reinforcement Learning?
Ian Osband
Benjamin Van Roy
BDL
74
257
0
01 Jul 2016
Unifying Count-Based Exploration and Intrinsic Motivation
Marc G. Bellemare
S. Srinivasan
Georg Ostrovski
Tom Schaul
D. Saxton
Rémi Munos
156
1,465
0
06 Jun 2016
Prioritized Experience Replay
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
OffRL
185
3,777
0
18 Nov 2015
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
168
13,174
0
09 Sep 2015
1