On the Hidden Biases of Policy Mirror Ascent in Continuous Action Spaces

28 January 2022

Papers citing "On the Hidden Biases of Policy Mirror Ascent in Continuous Action Spaces"

10 / 10 papers shown

Title
Behind the Myth of Exploration in Policy Gradients Adrien Bolland Gaspard Lambrechts Damien Ernst 79 0 0 31 Jan 2024
On the Sample Complexity and Metastability of Heavy-tailed Policy Search in Continuous Control Amrit Singh Bedi Anjaly Parayil Junyu Zhang Mengdi Wang Alec Koppel 63 15 0 15 Jun 2021
On the Linear convergence of Natural Policy Gradient Algorithm S. Khodadadian P. Jhunjhunwala Sushil Mahavir Varma S. T. Maguluri 78 57 0 04 May 2021
Hausdorff Dimension, Heavy Tails, and Generalization in Neural Networks Umut Simsekli Ozan Sener George Deligiannidis Murat A. Erdogdu 71 56 0 16 Jun 2020
On the Theory of Policy Gradient Methods: Optimality, Approximation, and Distribution Shift Alekh Agarwal Sham Kakade Jason D. Lee G. Mahajan 61 320 0 01 Aug 2019
Global Optimality Guarantees For Policy Gradient Methods Jalaj Bhandari Daniel Russo 72 193 0 05 Jun 2019
Momentum-Based Variance Reduction in Non-Convex SGD Ashok Cutkosky Francesco Orabona ODL 78 406 0 24 May 2019
Continuous control with deep reinforcement learning Timothy Lillicrap Jonathan J. Hunt Alexander Pritzel N. Heess Tom Erez Yuval Tassa David Silver Daan Wierstra 318 13,237 0 09 Sep 2015
Stochastic First- and Zeroth-order Methods for Nonconvex Stochastic Programming Saeed Ghadimi Guanghui Lan ODL 120 1,549 0 22 Sep 2013
Infinite-Horizon Policy-Gradient Estimation Jonathan Baxter Peter L. Bartlett 97 811 0 03 Jun 2011