Convergence and Optimality of Policy Gradient Methods in Weakly Smooth Settings

30 October 2021

Papers citing "Convergence and Optimality of Policy Gradient Methods in Weakly Smooth Settings"

6 / 6 papers shown

Title
A Fisher-Rao gradient flow for entropy-regularised Markov decision processes in Polish spaces B. Kerimkulov J. Leahy David Siska Lukasz Szpruch Yufei Zhang 29 7 0 04 Oct 2023
Stochastic Policy Gradient Methods: Improved Sample Complexity for Fisher-non-degenerate Policies Ilyas Fatkhullin Anas Barakat Anastasia Kireeva Niao He 32 37 0 03 Feb 2023
On the Global Convergence of Fitted Q-Iteration with Two-layer Neural Network Parametrization Mudit Gaur Vaneet Aggarwal Mridul Agarwal MLT 41 1 0 14 Nov 2022
Geometry and convergence of natural policy gradient methods Johannes Muller Guido Montúfar 29 11 0 03 Nov 2022
On the Convergence and Sample Efficiency of Variance-Reduced Policy Gradient Method Junyu Zhang Chengzhuo Ni Zheng Yu Csaba Szepesvári Mengdi Wang 64 67 0 17 Feb 2021
On the Sample Complexity of Actor-Critic Method for Reinforcement Learning with Function Approximation Harshat Kumar Alec Koppel Alejandro Ribeiro 104 80 0 18 Oct 2019