Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.06806
Cited By
On the Global Convergence of Policy Gradient in Average Reward Markov Decision Processes
11 March 2024
Navdeep Kumar
Yashaswini Murthy
Itai Shufaro
Kfir Y. Levy
R. Srikant
Shie Mannor
Re-assign community
ArXiv
PDF
HTML
Papers citing
"On the Global Convergence of Policy Gradient in Average Reward Markov Decision Processes"
4 / 4 papers shown
Title
Reducing Blackwell and Average Optimality to Discounted MDPs via the Blackwell Discount Factor
Julien Grand-Clément
Marko Petrik
48
14
0
31 Jan 2023
On the Linear convergence of Natural Policy Gradient Algorithm
S. Khodadadian
P. Jhunjhunwala
Sushil Mahavir Varma
S. T. Maguluri
78
57
0
04 May 2021
Variational Policy Gradient Method for Reinforcement Learning with General Utilities
Junyu Zhang
Alec Koppel
Amrit Singh Bedi
Csaba Szepesvári
Mengdi Wang
59
139
0
04 Jul 2020
Global Optimality Guarantees For Policy Gradient Methods
Jalaj Bhandari
Daniel Russo
72
193
0
05 Jun 2019
1