Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2006.16318
Cited By
v1
v2
v3 (latest)
Learning and Planning in Average-Reward Markov Decision Processes
29 June 2020
Yi Wan
A. Naik
R. Sutton
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Learning and Planning in Average-Reward Markov Decision Processes"
37 / 37 papers shown
Title
A Differential Perspective on Distributional Reinforcement Learning
Juan Sebastian Rojas
Chi-Guhn Lee
OffRL
27
0
0
03 Jun 2025
Average Reward Reinforcement Learning for Omega-Regular and Mean-Payoff Objectives
Milad Kazemi
Mateo Perez
Fabio Somenzi
Sadegh Soudjani
Ashutosh Trivedi
Alvaro Velasquez
89
1
0
21 May 2025
A Finite-Sample Analysis of Distributionally Robust Average-Reward Reinforcement Learning
Zachary Roch
Chi Zhang
George Atia
Yue Wang
71
1
0
18 May 2025
Planning and Learning in Average Risk-aware MDPs
Weikai Wang
Erick Delage
87
0
0
22 Mar 2025
Reinforcement Teaching
Alex Lewandowski
Calarina Muslimani
Dale Schuurmans
Matthew E. Taylor
Jun Luo
198
2
0
28 Jan 2025
Average-Reward Reinforcement Learning with Entropy Regularization
Jacob Adamczyk
Volodymyr Makarenko
Stas Tiomkin
R. Kulkarni
OOD
80
2
0
17 Jan 2025
An Empirical Study of Deep Reinforcement Learning in Continuing Tasks
Yi Wan
D. Korenkevych
Zheqing Zhu
OffRL
CLL
79
0
0
12 Jan 2025
Ornstein-Uhlenbeck Adaptation as a Mechanism for Learning in Brains and Machines
Jesus Garcia Fernandez
Nasir Ahmad
Marcel van Gerven
58
1
0
17 Oct 2024
Reinforcement Learning with LTL and
ω
ω
ω
-Regular Objectives via Optimality-Preserving Translation to Average Rewards
Xuan-Bach Le
Dominik Wagner
Leon Witzman
Alexander Rabinovich
Luke Ong
87
4
0
16 Oct 2024
Burning RED: Unlocking Subtask-Driven Reinforcement Learning and Risk-Awareness in Average-Reward Markov Decision Processes
Juan Sebastian Rojas
Chi-Guhn Lee
72
1
0
14 Oct 2024
RVI-SAC: Average Reward Off-Policy Deep Reinforcement Learning
Yukinari Hisaki
Isao Ono
70
2
0
04 Aug 2024
Hierarchical Average-Reward Linearly-solvable Markov Decision Processes
Guillermo Infante
Anders Jonsson
Vicenç Gómez
50
0
0
09 Jul 2024
Reward Centering
Abhishek Naik
Yi Wan
Manan Tomar
Richard S. Sutton
62
7
0
16 May 2024
Stochastic Halpern iteration in normed spaces and applications to reinforcement learning
Mario Bravo
Juan Pablo Contreras
97
4
0
19 Mar 2024
Why Online Reinforcement Learning is Causal
Oliver Schulte
Pascal Poupart
CML
OffRL
82
1
0
07 Mar 2024
Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation
Guojun Xiong
Jian Li
87
14
0
03 Oct 2023
Learning to Stabilize Online Reinforcement Learning in Unbounded State Spaces
Brahma S. Pavse
M. Zurek
Yudong Chen
Qiaomin Xie
Josiah P. Hanna
OffRL
78
1
0
02 Jun 2023
Model-Free Robust Average-Reward Reinforcement Learning
Yue Wang
Alvaro Velasquez
George Atia
Ashley Prater-Bennette
Shaofeng Zou
70
15
0
17 May 2023
Full Gradient Deep Reinforcement Learning for Average-Reward Criterion
Tejas Pagare
Vivek Borkar
Konstantin Avrachenkov
70
5
0
07 Apr 2023
Performance Bounds for Policy-Based Average Reward Reinforcement Learning Algorithms
Yashaswini Murthy
Mehrdad Moharrami
R. Srikant
OffRL
61
5
0
02 Feb 2023
ACPO: A Policy Optimization Algorithm for Average MDPs with Constraints
Akhil Agnihotri
R. Jain
Haipeng Luo
110
2
0
02 Feb 2023
Single-Trajectory Distributionally Robust Reinforcement Learning
Zhipeng Liang
Xiaoteng Ma
Jose H. Blanchet
Jiheng Zhang
Zhengyuan Zhou
OOD
OffRL
86
12
0
27 Jan 2023
Robust Average-Reward Markov Decision Processes
Yue Wang
Alvaro Velasquez
George Atia
Ashley Prater-Bennette
Shaofeng Zou
102
14
0
02 Jan 2023
On Convergence of Average-Reward Off-Policy Control Algorithms in Weakly Communicating MDPs
Yi Wan
R. Sutton
30
4
0
30 Sep 2022
Markovian Interference in Experiments
Vivek F. Farias
Andrew A. Li
Tianyi Peng
Andrew Zheng
OffRL
88
33
0
06 Jun 2022
Stochastic first-order methods for average-reward Markov decision processes
Tianjiao Li
Feiyang Wu
Guanghui Lan
115
14
0
11 May 2022
Influencing Long-Term Behavior in Multiagent Reinforcement Learning
Dong-Ki Kim
Matthew D Riemer
Miao Liu
Jakob N. Foerster
Michael Everett
Chuangchuang Sun
Gerald Tesauro
Jonathan P. How
147
0
0
07 Mar 2022
Stochastic Gradient Descent with Dependent Data for Offline Reinforcement Learning
Jing-rong Dong
Xin T. Tong
OffRL
89
2
0
06 Feb 2022
Continual Learning In Environments With Polynomial Mixing Times
Matthew D Riemer
Sharath Chandra Raparthy
Ignacio Cases
G. Subbaraj
M. P. Touzel
Irina Rish
CLL
95
9
0
13 Dec 2021
Average-Reward Learning and Planning with Options
Yi Wan
A. Naik
R. Sutton
28
7
0
26 Oct 2021
On-Policy Deep Reinforcement Learning for the Average-Reward Criterion
Yiming Zhang
George Andriopoulos
OffRL
93
43
0
14 Jun 2021
Average-Reward Reinforcement Learning with Trust Region Methods
Xiaoteng Ma
Xiao-Jing Tang
Li Xia
Jun Yang
Qianchuan Zhao
61
18
0
07 Jun 2021
Simple Agent, Complex Environment: Efficient Reinforcement Learning with Agent States
Shi Dong
Benjamin Van Roy
Zhengyuan Zhou
107
32
0
10 Feb 2021
Breaking the Deadly Triad with a Target Network
Shangtong Zhang
Hengshuai Yao
Shimon Whiteson
AAML
129
45
0
21 Jan 2021
Average-Reward Off-Policy Policy Evaluation with Function Approximation
Shangtong Zhang
Yi Wan
R. Sutton
Shimon Whiteson
OffRL
73
31
0
08 Jan 2021
Average-reward model-free reinforcement learning: a systematic review and literature mapping
Vektor Dewanto
George Dunn
A. Eshragh
M. Gallagher
Fred Roosta
83
30
0
18 Oct 2020
Temporal-Logic-Based Reward Shaping for Continuing Reinforcement Learning Tasks
Yuqian Jiang
Suda Bharadwaj
Bo Wu
Rishi Shah
Ufuk Topcu
Peter Stone
CLL
OffRL
LRM
58
43
0
03 Jul 2020
1