Regret-Optimal Model-Free Reinforcement Learning for Discounted MDPs with Short Burn-In Time

24 May 2023

Papers citing "Regret-Optimal Model-Free Reinforcement Learning for Discounted MDPs with Short Burn-In Time"

10 / 10 papers shown

Title
Emoti-Attack: Zero-Perturbation Adversarial Attacks on NLP Systems via Emoji Sequences Yangshijie Zhang AAML 37 0 0 24 Feb 2025
Improved Sample Complexity for Global Convergence of Actor-Critic Algorithms Navdeep Kumar Priyank Agrawal Giorgia Ramponi Kfir Y. Levy Shie Mannor 33 0 0 11 Oct 2024
Reinforcement Learning for Infinite-Horizon Average-Reward Linear MDPs via Approximation by Discounted-Reward MDPs Kihyuk Hong Yufan Zhang Ambuj Tewari Dabeen Lee Ambuj Tewari 32 2 0 23 May 2024
Enhancing Classification Performance via Reinforcement Learning for Feature Selection Younes Ghazagh Jahed Seyyed Ali Sadat Tavana 24 2 0 09 Mar 2024
Near-Optimal Reinforcement Learning with Self-Play under Adaptivity Constraints Dan Qiao Yu-Xiang Wang OffRL 22 3 0 02 Feb 2024
Settling the Sample Complexity of Online Reinforcement Learning Zihan Zhang Yuxin Chen Jason D. Lee S. Du OffRL 90 21 0 25 Jul 2023
UCB Momentum Q-learning: Correcting the bias without forgetting Pierre Menard O. D. Domingues Xuedong Shang Michal Valko 77 40 0 01 Mar 2021
On the Convergence and Sample Efficiency of Variance-Reduced Policy Gradient Method Junyu Zhang Chengzhuo Ni Zheng Yu Csaba Szepesvári Mengdi Wang 44 67 0 17 Feb 2021
Provably Efficient Reinforcement Learning with Linear Function Approximation Under Adaptivity Constraints Chi Jin Zhuoran Yang Zhaoran Wang OffRL 107 166 0 06 Jan 2021
A Proximal Stochastic Gradient Method with Progressive Variance Reduction Lin Xiao Tong Zhang ODL 78 736 0 19 Mar 2014