Near-Optimal Goal-Oriented Reinforcement Learning in Non-Stationary Environments

25 May 2022

Papers citing "Near-Optimal Goal-Oriented Reinforcement Learning in Non-Stationary Environments"

10 / 10 papers shown

Title
A Model Selection Approach for Corruption Robust Reinforcement Learning Chen-Yu Wei Christoph Dann Julian Zimmert 134 45 0 31 Dec 2024
Provably Efficient Primal-Dual Reinforcement Learning for CMDPs with Non-stationary Objectives and Constraints Yuhao Ding Javad Lavaei 46 11 0 28 Jan 2022
Finding the Stochastic Shortest Path with Low Regret: The Adversarial Cost and Unknown Transition Case Liyu Chen Haipeng Luo 64 31 0 10 Feb 2021
Is Reinforcement Learning More Difficult Than Bandits? A Near-optimal Algorithm Escaping the Curse of Horizon Zihan Zhang Xiangyang Ji S. Du OffRL 95 105 0 28 Sep 2020
Reinforcement Learning for Non-Stationary Markov Decision Processes: The Blessing of (More) Optimism Wang Chi Cheung D. Simchi-Levi Ruihao Zhu OffRL 58 96 0 24 Jun 2020
Near-optimal Regret Bounds for Stochastic Shortest Path Alon Cohen Haim Kaplan Yishay Mansour Aviv A. Rosenberg 58 55 0 23 Feb 2020
Optimistic Policy Optimization with Bandit Feedback Yonathan Efroni Lior Shani Aviv A. Rosenberg Shie Mannor 50 90 0 19 Feb 2020
Combinatorial Semi-Bandit in the Non-Stationary Environment Wei Chen Liwei Wang Haoyu Zhao Kai Zheng 69 18 0 10 Feb 2020
No-Regret Exploration in Goal-Oriented Reinforcement Learning Jean Tarbouriech Evrard Garcelon Michal Valko Matteo Pirotta A. Lazaric 63 46 0 07 Dec 2019
A New Algorithm for Non-stationary Contextual Bandits: Efficient, Optimal, and Parameter-free Yifang Chen Chung-Wei Lee Haipeng Luo Chen-Yu Wei 125 133 0 03 Feb 2019