ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.13044
  4. Cited By
Near-Optimal Goal-Oriented Reinforcement Learning in Non-Stationary
  Environments

Near-Optimal Goal-Oriented Reinforcement Learning in Non-Stationary Environments

25 May 2022
Liyu Chen
Haipeng Luo
ArXivPDFHTML

Papers citing "Near-Optimal Goal-Oriented Reinforcement Learning in Non-Stationary Environments"

10 / 10 papers shown
Title
A Model Selection Approach for Corruption Robust Reinforcement Learning
A Model Selection Approach for Corruption Robust Reinforcement Learning
Chen-Yu Wei
Christoph Dann
Julian Zimmert
134
45
0
31 Dec 2024
Provably Efficient Primal-Dual Reinforcement Learning for CMDPs with
  Non-stationary Objectives and Constraints
Provably Efficient Primal-Dual Reinforcement Learning for CMDPs with Non-stationary Objectives and Constraints
Yuhao Ding
Javad Lavaei
46
11
0
28 Jan 2022
Finding the Stochastic Shortest Path with Low Regret: The Adversarial
  Cost and Unknown Transition Case
Finding the Stochastic Shortest Path with Low Regret: The Adversarial Cost and Unknown Transition Case
Liyu Chen
Haipeng Luo
64
31
0
10 Feb 2021
Is Reinforcement Learning More Difficult Than Bandits? A Near-optimal
  Algorithm Escaping the Curse of Horizon
Is Reinforcement Learning More Difficult Than Bandits? A Near-optimal Algorithm Escaping the Curse of Horizon
Zihan Zhang
Xiangyang Ji
S. Du
OffRL
95
105
0
28 Sep 2020
Reinforcement Learning for Non-Stationary Markov Decision Processes: The
  Blessing of (More) Optimism
Reinforcement Learning for Non-Stationary Markov Decision Processes: The Blessing of (More) Optimism
Wang Chi Cheung
D. Simchi-Levi
Ruihao Zhu
OffRL
58
96
0
24 Jun 2020
Near-optimal Regret Bounds for Stochastic Shortest Path
Near-optimal Regret Bounds for Stochastic Shortest Path
Alon Cohen
Haim Kaplan
Yishay Mansour
Aviv A. Rosenberg
58
55
0
23 Feb 2020
Optimistic Policy Optimization with Bandit Feedback
Optimistic Policy Optimization with Bandit Feedback
Yonathan Efroni
Lior Shani
Aviv A. Rosenberg
Shie Mannor
50
90
0
19 Feb 2020
Combinatorial Semi-Bandit in the Non-Stationary Environment
Combinatorial Semi-Bandit in the Non-Stationary Environment
Wei Chen
Liwei Wang
Haoyu Zhao
Kai Zheng
69
18
0
10 Feb 2020
No-Regret Exploration in Goal-Oriented Reinforcement Learning
No-Regret Exploration in Goal-Oriented Reinforcement Learning
Jean Tarbouriech
Evrard Garcelon
Michal Valko
Matteo Pirotta
A. Lazaric
63
46
0
07 Dec 2019
A New Algorithm for Non-stationary Contextual Bandits: Efficient,
  Optimal, and Parameter-free
A New Algorithm for Non-stationary Contextual Bandits: Efficient, Optimal, and Parameter-free
Yifang Chen
Chung-Wei Lee
Haipeng Luo
Chen-Yu Wei
125
133
0
03 Feb 2019
1