ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2209.15141
  4. Cited By
On Convergence of Average-Reward Off-Policy Control Algorithms in Weakly
  Communicating MDPs
v1v2 (latest)

On Convergence of Average-Reward Off-Policy Control Algorithms in Weakly Communicating MDPs

30 September 2022
Yi Wan
R. Sutton
ArXiv (abs)PDFHTML

Papers citing "On Convergence of Average-Reward Off-Policy Control Algorithms in Weakly Communicating MDPs"

4 / 4 papers shown
Title
Average Reward Reinforcement Learning for Omega-Regular and Mean-Payoff Objectives
Average Reward Reinforcement Learning for Omega-Regular and Mean-Payoff Objectives
Milad Kazemi
Mateo Perez
Fabio Somenzi
Sadegh Soudjani
Ashutosh Trivedi
Alvaro Velasquez
89
1
0
21 May 2025
A Finite-Sample Analysis of Distributionally Robust Average-Reward Reinforcement Learning
A Finite-Sample Analysis of Distributionally Robust Average-Reward Reinforcement Learning
Zachary Roch
Chi Zhang
George Atia
Yue Wang
71
1
0
18 May 2025
RVI-SAC: Average Reward Off-Policy Deep Reinforcement Learning
RVI-SAC: Average Reward Off-Policy Deep Reinforcement Learning
Yukinari Hisaki
Isao Ono
70
2
0
04 Aug 2024
Model-Free Robust Average-Reward Reinforcement Learning
Model-Free Robust Average-Reward Reinforcement Learning
Yue Wang
Alvaro Velasquez
George Atia
Ashley Prater-Bennette
Shaofeng Zou
70
15
0
17 May 2023
1