ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.05800
  4. Cited By
Stochastic first-order methods for average-reward Markov decision
  processes
v1v2v3v4v5 (latest)

Stochastic first-order methods for average-reward Markov decision processes

11 May 2022
Tianjiao Li
Feiyang Wu
Guanghui Lan
ArXiv (abs)PDFHTML

Papers citing "Stochastic first-order methods for average-reward Markov decision processes"

11 / 11 papers shown
Title
A Finite-Sample Analysis of Distributionally Robust Average-Reward Reinforcement Learning
A Finite-Sample Analysis of Distributionally Robust Average-Reward Reinforcement Learning
Zachary Roch
Chi Zhang
George Atia
Yue Wang
75
1
0
18 May 2025
Towards Optimal Offline Reinforcement Learning
Towards Optimal Offline Reinforcement Learning
Mengmeng Li
Daniel Kuhn
Tobias Sutter
OffRL
147
0
0
15 Mar 2025
Finding good policies in average-reward Markov Decision Processes
  without prior knowledge
Finding good policies in average-reward Markov Decision Processes without prior knowledge
Adrienne Tuynman
Rémy Degenne
Emilie Kaufmann
100
4
0
27 May 2024
Provable Policy Gradient Methods for Average-Reward Markov Potential
  Games
Provable Policy Gradient Methods for Average-Reward Markov Potential Games
Min Cheng
Ruida Zhou
P. R. Kumar
Chao Tian
98
6
0
09 Mar 2024
Infer and Adapt: Bipedal Locomotion Reward Learning from Demonstrations
  via Inverse Reinforcement Learning
Infer and Adapt: Bipedal Locomotion Reward Learning from Demonstrations via Inverse Reinforcement Learning
Chao Liu
Zhaoyuan Gu
Hanran Wu
Deniz Irem Erus
Ye Zhao
110
6
0
28 Sep 2023
Accelerated stochastic approximation with state-dependent noise
Accelerated stochastic approximation with state-dependent noise
Sasila Ilandarideva
A. Juditsky
Guanghui Lan
Tianjiao Li
80
8
0
04 Jul 2023
Sharper Model-free Reinforcement Learning for Average-reward Markov
  Decision Processes
Sharper Model-free Reinforcement Learning for Average-reward Markov Decision Processes
Zihan Zhang
Qiaomin Xie
OffRL
64
20
0
28 Jun 2023
Langevin Thompson Sampling with Logarithmic Communication: Bandits and
  Reinforcement Learning
Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning
Amin Karbasi
Nikki Lijing Kuang
Yi-An Ma
Siddharth Mitra
OffRL
69
5
0
15 Jun 2023
Inverse Reinforcement Learning with the Average Reward Criterion
Inverse Reinforcement Learning with the Average Reward Criterion
Feiyang Wu
Jingyang Ke
Anqi Wu
85
11
0
24 May 2023
Model-Free Robust Average-Reward Reinforcement Learning
Model-Free Robust Average-Reward Reinforcement Learning
Yue Wang
Alvaro Velasquez
George Atia
Ashley Prater-Bennette
Shaofeng Zou
72
15
0
17 May 2023
Policy Mirror Descent Inherently Explores Action Space
Policy Mirror Descent Inherently Explores Action Space
Yan Li
Guanghui Lan
OffRL
127
8
0
08 Mar 2023
1