Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.08359
Cited By
Horizon-free Reinforcement Learning in Adversarial Linear Mixture MDPs
15 May 2023
Kaixuan Ji
Qingyue Zhao
Jiafan He
Weitong Zhang
Q. Gu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Horizon-free Reinforcement Learning in Adversarial Linear Mixture MDPs"
6 / 6 papers shown
Title
Nearly Optimal Sample Complexity of Offline KL-Regularized Contextual Bandits under Single-Policy Concentrability
Qingyue Zhao
Kaixuan Ji
Heyang Zhao
Tong Zhang
Q. Gu
OffRL
45
0
0
09 Feb 2025
Follow-the-Perturbed-Leader for Adversarial Markov Decision Processes with Bandit Feedback
Yan Dai
Haipeng Luo
Liyu Chen
60
19
0
26 May 2022
Computationally Efficient Horizon-Free Reinforcement Learning for Linear Mixture MDPs
Dongruo Zhou
Quanquan Gu
75
43
0
23 May 2022
Near-optimal Policy Optimization Algorithms for Learning Adversarial Linear Mixture MDPs
Jiafan He
Dongruo Zhou
Quanquan Gu
95
23
0
17 Feb 2021
Improved Variance-Aware Confidence Sets for Linear Bandits and Linear Mixture MDP
Zihan Zhang
Jiaqi Yang
Xiangyang Ji
S. Du
71
36
0
29 Jan 2021
Optimism in Reinforcement Learning with Generalized Linear Function Approximation
Yining Wang
Ruosong Wang
S. Du
A. Krishnamurthy
135
135
0
09 Dec 2019
1