ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.16318
  4. Cited By
Learning and Planning in Average-Reward Markov Decision Processes
v1v2v3 (latest)

Learning and Planning in Average-Reward Markov Decision Processes

29 June 2020
Yi Wan
A. Naik
R. Sutton
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Learning and Planning in Average-Reward Markov Decision Processes"

37 / 37 papers shown
Title
A Differential Perspective on Distributional Reinforcement Learning
A Differential Perspective on Distributional Reinforcement Learning
Juan Sebastian Rojas
Chi-Guhn Lee
OffRL
27
0
0
03 Jun 2025
Average Reward Reinforcement Learning for Omega-Regular and Mean-Payoff Objectives
Average Reward Reinforcement Learning for Omega-Regular and Mean-Payoff Objectives
Milad Kazemi
Mateo Perez
Fabio Somenzi
Sadegh Soudjani
Ashutosh Trivedi
Alvaro Velasquez
89
1
0
21 May 2025
A Finite-Sample Analysis of Distributionally Robust Average-Reward Reinforcement Learning
A Finite-Sample Analysis of Distributionally Robust Average-Reward Reinforcement Learning
Zachary Roch
Chi Zhang
George Atia
Yue Wang
71
1
0
18 May 2025
Planning and Learning in Average Risk-aware MDPs
Planning and Learning in Average Risk-aware MDPs
Weikai Wang
Erick Delage
87
0
0
22 Mar 2025
Reinforcement Teaching
Reinforcement Teaching
Alex Lewandowski
Calarina Muslimani
Dale Schuurmans
Matthew E. Taylor
Jun Luo
198
2
0
28 Jan 2025
Average-Reward Reinforcement Learning with Entropy Regularization
Average-Reward Reinforcement Learning with Entropy Regularization
Jacob Adamczyk
Volodymyr Makarenko
Stas Tiomkin
R. Kulkarni
OOD
80
2
0
17 Jan 2025
An Empirical Study of Deep Reinforcement Learning in Continuing Tasks
An Empirical Study of Deep Reinforcement Learning in Continuing Tasks
Yi Wan
D. Korenkevych
Zheqing Zhu
OffRLCLL
79
0
0
12 Jan 2025
Ornstein-Uhlenbeck Adaptation as a Mechanism for Learning in Brains and
  Machines
Ornstein-Uhlenbeck Adaptation as a Mechanism for Learning in Brains and Machines
Jesus Garcia Fernandez
Nasir Ahmad
Marcel van Gerven
58
1
0
17 Oct 2024
Reinforcement Learning with LTL and $ω$-Regular Objectives via
  Optimality-Preserving Translation to Average Rewards
Reinforcement Learning with LTL and ωωω-Regular Objectives via Optimality-Preserving Translation to Average Rewards
Xuan-Bach Le
Dominik Wagner
Leon Witzman
Alexander Rabinovich
Luke Ong
87
4
0
16 Oct 2024
Burning RED: Unlocking Subtask-Driven Reinforcement Learning and
  Risk-Awareness in Average-Reward Markov Decision Processes
Burning RED: Unlocking Subtask-Driven Reinforcement Learning and Risk-Awareness in Average-Reward Markov Decision Processes
Juan Sebastian Rojas
Chi-Guhn Lee
72
1
0
14 Oct 2024
RVI-SAC: Average Reward Off-Policy Deep Reinforcement Learning
RVI-SAC: Average Reward Off-Policy Deep Reinforcement Learning
Yukinari Hisaki
Isao Ono
70
2
0
04 Aug 2024
Hierarchical Average-Reward Linearly-solvable Markov Decision Processes
Hierarchical Average-Reward Linearly-solvable Markov Decision Processes
Guillermo Infante
Anders Jonsson
Vicenç Gómez
50
0
0
09 Jul 2024
Reward Centering
Reward Centering
Abhishek Naik
Yi Wan
Manan Tomar
Richard S. Sutton
62
7
0
16 May 2024
Stochastic Halpern iteration in normed spaces and applications to reinforcement learning
Stochastic Halpern iteration in normed spaces and applications to reinforcement learning
Mario Bravo
Juan Pablo Contreras
97
4
0
19 Mar 2024
Why Online Reinforcement Learning is Causal
Why Online Reinforcement Learning is Causal
Oliver Schulte
Pascal Poupart
CMLOffRL
82
1
0
07 Mar 2024
Finite-Time Analysis of Whittle Index based Q-Learning for Restless
  Multi-Armed Bandits with Neural Network Function Approximation
Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation
Guojun Xiong
Jian Li
87
14
0
03 Oct 2023
Learning to Stabilize Online Reinforcement Learning in Unbounded State
  Spaces
Learning to Stabilize Online Reinforcement Learning in Unbounded State Spaces
Brahma S. Pavse
M. Zurek
Yudong Chen
Qiaomin Xie
Josiah P. Hanna
OffRL
78
1
0
02 Jun 2023
Model-Free Robust Average-Reward Reinforcement Learning
Model-Free Robust Average-Reward Reinforcement Learning
Yue Wang
Alvaro Velasquez
George Atia
Ashley Prater-Bennette
Shaofeng Zou
70
15
0
17 May 2023
Full Gradient Deep Reinforcement Learning for Average-Reward Criterion
Full Gradient Deep Reinforcement Learning for Average-Reward Criterion
Tejas Pagare
Vivek Borkar
Konstantin Avrachenkov
70
5
0
07 Apr 2023
Performance Bounds for Policy-Based Average Reward Reinforcement
  Learning Algorithms
Performance Bounds for Policy-Based Average Reward Reinforcement Learning Algorithms
Yashaswini Murthy
Mehrdad Moharrami
R. Srikant
OffRL
61
5
0
02 Feb 2023
ACPO: A Policy Optimization Algorithm for Average MDPs with Constraints
ACPO: A Policy Optimization Algorithm for Average MDPs with Constraints
Akhil Agnihotri
R. Jain
Haipeng Luo
110
2
0
02 Feb 2023
Single-Trajectory Distributionally Robust Reinforcement Learning
Single-Trajectory Distributionally Robust Reinforcement Learning
Zhipeng Liang
Xiaoteng Ma
Jose H. Blanchet
Jiheng Zhang
Zhengyuan Zhou
OODOffRL
86
12
0
27 Jan 2023
Robust Average-Reward Markov Decision Processes
Robust Average-Reward Markov Decision Processes
Yue Wang
Alvaro Velasquez
George Atia
Ashley Prater-Bennette
Shaofeng Zou
102
14
0
02 Jan 2023
On Convergence of Average-Reward Off-Policy Control Algorithms in Weakly
  Communicating MDPs
On Convergence of Average-Reward Off-Policy Control Algorithms in Weakly Communicating MDPs
Yi Wan
R. Sutton
30
4
0
30 Sep 2022
Markovian Interference in Experiments
Markovian Interference in Experiments
Vivek F. Farias
Andrew A. Li
Tianyi Peng
Andrew Zheng
OffRL
88
33
0
06 Jun 2022
Stochastic first-order methods for average-reward Markov decision
  processes
Stochastic first-order methods for average-reward Markov decision processes
Tianjiao Li
Feiyang Wu
Guanghui Lan
115
14
0
11 May 2022
Influencing Long-Term Behavior in Multiagent Reinforcement Learning
Influencing Long-Term Behavior in Multiagent Reinforcement Learning
Dong-Ki Kim
Matthew D Riemer
Miao Liu
Jakob N. Foerster
Michael Everett
Chuangchuang Sun
Gerald Tesauro
Jonathan P. How
147
0
0
07 Mar 2022
Stochastic Gradient Descent with Dependent Data for Offline
  Reinforcement Learning
Stochastic Gradient Descent with Dependent Data for Offline Reinforcement Learning
Jing-rong Dong
Xin T. Tong
OffRL
89
2
0
06 Feb 2022
Continual Learning In Environments With Polynomial Mixing Times
Continual Learning In Environments With Polynomial Mixing Times
Matthew D Riemer
Sharath Chandra Raparthy
Ignacio Cases
G. Subbaraj
M. P. Touzel
Irina Rish
CLL
95
9
0
13 Dec 2021
Average-Reward Learning and Planning with Options
Average-Reward Learning and Planning with Options
Yi Wan
A. Naik
R. Sutton
28
7
0
26 Oct 2021
On-Policy Deep Reinforcement Learning for the Average-Reward Criterion
On-Policy Deep Reinforcement Learning for the Average-Reward Criterion
Yiming Zhang
George Andriopoulos
OffRL
93
43
0
14 Jun 2021
Average-Reward Reinforcement Learning with Trust Region Methods
Average-Reward Reinforcement Learning with Trust Region Methods
Xiaoteng Ma
Xiao-Jing Tang
Li Xia
Jun Yang
Qianchuan Zhao
61
18
0
07 Jun 2021
Simple Agent, Complex Environment: Efficient Reinforcement Learning with
  Agent States
Simple Agent, Complex Environment: Efficient Reinforcement Learning with Agent States
Shi Dong
Benjamin Van Roy
Zhengyuan Zhou
107
32
0
10 Feb 2021
Breaking the Deadly Triad with a Target Network
Breaking the Deadly Triad with a Target Network
Shangtong Zhang
Hengshuai Yao
Shimon Whiteson
AAML
129
45
0
21 Jan 2021
Average-Reward Off-Policy Policy Evaluation with Function Approximation
Average-Reward Off-Policy Policy Evaluation with Function Approximation
Shangtong Zhang
Yi Wan
R. Sutton
Shimon Whiteson
OffRL
73
31
0
08 Jan 2021
Average-reward model-free reinforcement learning: a systematic review
  and literature mapping
Average-reward model-free reinforcement learning: a systematic review and literature mapping
Vektor Dewanto
George Dunn
A. Eshragh
M. Gallagher
Fred Roosta
83
30
0
18 Oct 2020
Temporal-Logic-Based Reward Shaping for Continuing Reinforcement
  Learning Tasks
Temporal-Logic-Based Reward Shaping for Continuing Reinforcement Learning Tasks
Yuqian Jiang
Suda Bharadwaj
Bo Wu
Rishi Shah
Ufuk Topcu
Peter Stone
CLLOffRLLRM
58
43
0
03 Jul 2020
1