Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2209.02864
Cited By
v1
v2 (latest)
On the Convergence of Monte Carlo UCB for Random-Length Episodic MDPs
7 September 2022
Zixuan Dong
Che Wang
George Andriopoulos
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"On the Convergence of Monte Carlo UCB for Random-Length Episodic MDPs"
10 / 10 papers shown
Title
Finite-Sample Analysis of the Monte Carlo Exploring Starts Algorithm for Reinforcement Learning
Suei-Wen Chen
Keith Ross
Pierre Youssef
60
1
0
03 Oct 2024
Online Learning of Decision Trees with Thompson Sampling
Ayman Chaouki
Jesse Read
Albert Bifet
72
2
0
09 Apr 2024
Slowly Changing Adversarial Bandit Algorithms are Efficient for Discounted MDPs
Ian A. Kash
L. Reyzin
Zishun Yu
105
0
0
18 May 2022
On the Convergence of Reinforcement Learning with Monte Carlo Exploring Starts
Jun Liu
33
15
0
21 Jul 2020
On the Convergence of the Monte Carlo Exploring Starts Algorithm for Reinforcement Learning
Che Wang
Shuhan Yuan
Kai Shao
George Andriopoulos
40
12
0
10 Feb 2020
Provably Efficient Reinforcement Learning with Linear Function Approximation
Chi Jin
Zhuoran Yang
Zhaoran Wang
Michael I. Jordan
113
561
0
11 Jul 2019
On the convergence of optimistic policy iteration for stochastic shortest path problem
Yuanlong Chen
39
7
0
27 Aug 2018
Minimax Regret Bounds for Reinforcement Learning
M. G. Azar
Ian Osband
Rémi Munos
101
778
0
16 Mar 2017
Unifying Count-Based Exploration and Intrinsic Motivation
Marc G. Bellemare
S. Srinivasan
Georg Ostrovski
Tom Schaul
D. Saxton
Rémi Munos
195
1,487
0
06 Jun 2016
Asymptotic Behavior of Minimal-Exploration Allocation Policies: Almost Sure, Arbitrarily Slow Growing Regret
Wesley Cowan
M. Katehakis
145
14
0
12 May 2015
1