Global Optimality Guarantees For Policy Gradient Methods

5 June 2019

Papers citing "Global Optimality Guarantees For Policy Gradient Methods"

9 / 59 papers shown

Title
SAGA: A Fast Incremental Gradient Method With Support for Non-Strongly Convex Composite Objectives Aaron Defazio Francis R. Bach Simon Lacoste-Julien ODL 131 1,823 0 01 Jul 2014
Approximate Policy Iteration Schemes: A Comparison B. Scherrer 52 93 0 12 May 2014
A Proximal Stochastic Gradient Method with Progressive Variance Reduction Lin Xiao Tong Zhang ODL 150 738 0 19 Mar 2014
The Information Geometry of Mirror Descent Garvesh Raskutti S. Mukherjee 123 123 0 29 Oct 2013
Stochastic First- and Zeroth-order Methods for Nonconvex Stochastic Programming Saeed Ghadimi Guanghui Lan ODL 120 1,548 0 22 Sep 2013
(More) Efficient Reinforcement Learning via Posterior Sampling Ian Osband Daniel Russo Benjamin Van Roy 116 533 0 04 Jun 2013
Online Regret Bounds for Undiscounted Continuous Reinforcement Learning R. Ortner D. Ryabko OffRL 81 85 0 11 Feb 2013
Metrics for Finite Markov Decision Processes N. Ferns Prakash Panangaden Doina Precup 74 320 0 11 Jul 2012
Infinite-Horizon Policy-Gradient Estimation Jonathan Baxter Peter L. Bartlett 94 811 0 03 Jun 2011