Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2201.12332
Cited By
On the Hidden Biases of Policy Mirror Ascent in Continuous Action Spaces
28 January 2022
Amrit Singh Bedi
Souradip Chakraborty
Anjaly Parayil
Brian M Sadler
Pratap Tokekar
Alec Koppel
Re-assign community
ArXiv
PDF
HTML
Papers citing
"On the Hidden Biases of Policy Mirror Ascent in Continuous Action Spaces"
10 / 10 papers shown
Title
Behind the Myth of Exploration in Policy Gradients
Adrien Bolland
Gaspard Lambrechts
Damien Ernst
79
0
0
31 Jan 2024
On the Sample Complexity and Metastability of Heavy-tailed Policy Search in Continuous Control
Amrit Singh Bedi
Anjaly Parayil
Junyu Zhang
Mengdi Wang
Alec Koppel
63
15
0
15 Jun 2021
On the Linear convergence of Natural Policy Gradient Algorithm
S. Khodadadian
P. Jhunjhunwala
Sushil Mahavir Varma
S. T. Maguluri
78
57
0
04 May 2021
Hausdorff Dimension, Heavy Tails, and Generalization in Neural Networks
Umut Simsekli
Ozan Sener
George Deligiannidis
Murat A. Erdogdu
71
56
0
16 Jun 2020
On the Theory of Policy Gradient Methods: Optimality, Approximation, and Distribution Shift
Alekh Agarwal
Sham Kakade
Jason D. Lee
G. Mahajan
61
320
0
01 Aug 2019
Global Optimality Guarantees For Policy Gradient Methods
Jalaj Bhandari
Daniel Russo
72
193
0
05 Jun 2019
Momentum-Based Variance Reduction in Non-Convex SGD
Ashok Cutkosky
Francesco Orabona
ODL
78
406
0
24 May 2019
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
318
13,237
0
09 Sep 2015
Stochastic First- and Zeroth-order Methods for Nonconvex Stochastic Programming
Saeed Ghadimi
Guanghui Lan
ODL
120
1,549
0
22 Sep 2013
Infinite-Horizon Policy-Gradient Estimation
Jonathan Baxter
Peter L. Bartlett
97
811
0
03 Jun 2011
1