Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2001.02435
Cited By
v1
v2
v3 (latest)
A Nonparametric Off-Policy Policy Gradient
8 January 2020
Samuele Tosatto
João Carvalho
Hany Abdulsamad
Jan Peters
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"A Nonparametric Off-Policy Policy Gradient"
4 / 4 papers shown
Title
A Temporal-Difference Approach to Policy Gradient Estimation
Samuele Tosatto
Andrew Patterson
Martha White
A. R. Mahmood
OffRL
104
2
0
04 Feb 2022
Contextual Latent-Movements Off-Policy Optimization for Robotic Manipulation Skills
Samuele Tosatto
Georgia Chalvatzaki
Jan Peters
69
12
0
26 Oct 2020
Statistically Efficient Off-Policy Policy Gradients
Nathan Kallus
Masatoshi Uehara
OffRL
110
39
0
10 Feb 2020
An Upper Bound of the Bias of Nadaraya-Watson Kernel Regression under Lipschitz Assumptions
Samuele Tosatto
R. Akrour
Jan Peters
64
4
0
29 Jan 2020
1