Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.01234
Cited By
Achieving Tractable Minimax Optimal Regret in Average Reward MDPs
3 June 2024
Victor Boone
Zihan Zhang
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Achieving Tractable Minimax Optimal Regret in Average Reward MDPs"
6 / 6 papers shown
Title
Optimistic Q-learning for average reward and episodic reinforcement learning
Priyank Agrawal
Shipra Agrawal
112
6
0
18 Jul 2024
Improved Analysis of UCRL2 with Empirical Bernstein Inequality
Ronan Fruit
Matteo Pirotta
A. Lazaric
42
33
0
10 Jul 2020
Planning in Markov Decision Processes with Gap-Dependent Sample Complexity
Anders Jonsson
E. Kaufmann
Pierre Ménard
O. D. Domingues
Edouard Leurent
Michal Valko
57
35
0
10 Jun 2020
Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes
Chen-Yu Wei
Mehdi Jafarnia-Jahromi
Haipeng Luo
Hiteshi Sharma
R. Jain
154
108
0
15 Oct 2019
Regret Minimization for Reinforcement Learning by Evaluating the Optimal Bias Function
Zihan Zhang
Xiangyang Ji
69
72
0
12 Jun 2019
Posterior Sampling for Large Scale Reinforcement Learning
Georgios Theocharous
Zheng Wen
Yasin Abbasi-Yadkori
N. Vlassis
BDL
OffRL
57
25
0
21 Nov 2017
1