ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.08920
  4. Cited By
Average-reward model-free reinforcement learning: a systematic review
  and literature mapping

Average-reward model-free reinforcement learning: a systematic review and literature mapping

18 October 2020
Vektor Dewanto
George Dunn
A. Eshragh
M. Gallagher
Fred Roosta
ArXivPDFHTML

Papers citing "Average-reward model-free reinforcement learning: a systematic review and literature mapping"

11 / 11 papers shown
Title
Average Reward Reinforcement Learning for Omega-Regular and Mean-Payoff Objectives
Average Reward Reinforcement Learning for Omega-Regular and Mean-Payoff Objectives
Milad Kazemi
Mateo Perez
Fabio Somenzi
Sadegh Soudjani
Ashutosh Trivedi
Alvaro Velasquez
43
0
0
21 May 2025
A Finite-Sample Analysis of Distributionally Robust Average-Reward Reinforcement Learning
A Finite-Sample Analysis of Distributionally Robust Average-Reward Reinforcement Learning
Zachary Roch
Chi Zhang
George Atia
Yue Wang
32
0
0
18 May 2025
Average-Reward Maximum Entropy Reinforcement Learning for Underactuated
  Double Pendulum Tasks
Average-Reward Maximum Entropy Reinforcement Learning for Underactuated Double Pendulum Tasks
Jean Seong Bjorn Choe
Bumkyu Choi
Jong-kook Kim
54
2
0
13 Sep 2024
NeoRL: Efficient Exploration for Nonepisodic RL
NeoRL: Efficient Exploration for Nonepisodic RL
Bhavya Sukhija
Lenart Treven
Florian Dorfler
Stelian Coros
Andreas Krause
OffRL
41
0
0
03 Jun 2024
Full Gradient Deep Reinforcement Learning for Average-Reward Criterion
Full Gradient Deep Reinforcement Learning for Average-Reward Criterion
Tejas Pagare
Vivek Borkar
Konstantin Avrachenkov
51
4
0
07 Apr 2023
Reducing Blackwell and Average Optimality to Discounted MDPs via the
  Blackwell Discount Factor
Reducing Blackwell and Average Optimality to Discounted MDPs via the Blackwell Discount Factor
Julien Grand-Clément
Marko Petrik
44
14
0
31 Jan 2023
Average-Reward Reinforcement Learning with Trust Region Methods
Average-Reward Reinforcement Learning with Trust Region Methods
Xiaoteng Ma
Xiao-Jing Tang
Li Xia
Jun Yang
Qianchuan Zhao
29
16
0
07 Jun 2021
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on
  Open Problems
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
390
1,980
0
04 May 2020
A Finite Time Analysis of Two Time-Scale Actor Critic Methods
A Finite Time Analysis of Two Time-Scale Actor Critic Methods
Yue Wu
Weitong Zhang
Pan Xu
Quanquan Gu
104
147
0
04 May 2020
Model-free Reinforcement Learning in Infinite-horizon Average-reward
  Markov Decision Processes
Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes
Chen-Yu Wei
Mehdi Jafarnia-Jahromi
Haipeng Luo
Hiteshi Sharma
R. Jain
109
104
0
15 Oct 2019
A Batch, Off-Policy, Actor-Critic Algorithm for Optimizing the Average
  Reward
A Batch, Off-Policy, Actor-Critic Algorithm for Optimizing the Average Reward
Susan Murphy
Yanzhen Deng
Eric B. Laber
H. Maei
R. Sutton
K. Witkiewitz
OffRL
38
22
0
18 Jul 2016
1