ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2406.13909
  4. Cited By
Beyond Optimism: Exploration With Partially Observable Rewards

Beyond Optimism: Exploration With Partially Observable Rewards

20 June 2024
Simone Parisi
Alireza Kazemipour
Michael Bowling
    OffRL
ArXivPDFHTML

Papers citing "Beyond Optimism: Exploration With Partially Observable Rewards"

15 / 15 papers shown
Title
Optimistic Active Exploration of Dynamical Systems
Optimistic Active Exploration of Dynamical Systems
Bhavya Sukhija
Lenart Treven
Cansu Sancaktar
Sebastian Blaes
Stelian Coros
Andreas Krause
72
18
0
21 Jun 2023
Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement
  Learning
Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning
Sam Lobel
Akhil Bagaria
George Konidaris
49
16
0
05 Jun 2023
An information-theoretic perspective on intrinsic motivation in
  reinforcement learning: a survey
An information-theoretic perspective on intrinsic motivation in reinforcement learning: a survey
A. Aubret
L. Matignon
S. Hassas
71
35
0
19 Sep 2022
Exploration in Deep Reinforcement Learning: A Survey
Exploration in Deep Reinforcement Learning: A Survey
Pawel Ladosz
Lilian Weng
Minwoo Kim
H. Oh
OffRL
45
334
0
02 May 2022
Active Learning for Nonlinear System Identification with Guarantees
Active Learning for Nonlinear System Identification with Guarantees
Horia Mania
Michael I. Jordan
Benjamin Recht
63
102
0
18 Jun 2020
Information Directed Sampling for Linear Partial Monitoring
Information Directed Sampling for Linear Partial Monitoring
Johannes Kirschner
Tor Lattimore
Andreas Krause
38
46
0
25 Feb 2020
Explicit Explore-Exploit Algorithms in Continuous State Spaces
Explicit Explore-Exploit Algorithms in Continuous State Spaces
Mikael Henaff
OffRL
29
31
0
01 Nov 2019
Reinforcement Learning in Healthcare: A Survey
Reinforcement Learning in Healthcare: A Survey
Chao Yu
Jiming Liu
S. Nemati
LM&MA
OffRL
98
557
0
22 Aug 2019
Near Optimal Exploration-Exploitation in Non-Communicating Markov
  Decision Processes
Near Optimal Exploration-Exploitation in Non-Communicating Markov Decision Processes
Ronan Fruit
Matteo Pirotta
A. Lazaric
18
61
0
06 Jul 2018
Curiosity-driven Exploration by Self-supervised Prediction
Curiosity-driven Exploration by Self-supervised Prediction
Deepak Pathak
Pulkit Agrawal
Alexei A. Efros
Trevor Darrell
LRM
SSL
91
2,416
0
15 May 2017
Deep Exploration via Randomized Value Functions
Deep Exploration via Randomized Value Functions
Ian Osband
Benjamin Van Roy
Daniel Russo
Zheng Wen
66
302
0
22 Mar 2017
Why is Posterior Sampling Better than Optimism for Reinforcement
  Learning?
Why is Posterior Sampling Better than Optimism for Reinforcement Learning?
Ian Osband
Benjamin Van Roy
BDL
74
257
0
01 Jul 2016
Unifying Count-Based Exploration and Intrinsic Motivation
Unifying Count-Based Exploration and Intrinsic Motivation
Marc G. Bellemare
S. Srinivasan
Georg Ostrovski
Tom Schaul
D. Saxton
Rémi Munos
156
1,465
0
06 Jun 2016
Prioritized Experience Replay
Prioritized Experience Replay
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
OffRL
185
3,777
0
18 Nov 2015
Continuous control with deep reinforcement learning
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
168
13,174
0
09 Sep 2015
1