Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2103.05147
Cited By
Model-free Policy Learning with Reward Gradients
9 March 2021
Qingfeng Lan
Samuele Tosatto
Homayoon Farrahi
Rupam Mahmood
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Model-free Policy Learning with Reward Gradients"
3 / 3 papers shown
Title
Learning to Optimize for Reinforcement Learning
Qingfeng Lan
Rupam Mahmood
Shuicheng Yan
Zhongwen Xu
OffRL
28
6
0
03 Feb 2023
Asynchronous Reinforcement Learning for Real-Time Control of Physical Robots
Yufeng Yuan
Rupam Mahmood
OffRL
31
19
0
23 Mar 2022
A Temporal-Difference Approach to Policy Gradient Estimation
Samuele Tosatto
Andrew Patterson
Martha White
A. R. Mahmood
OffRL
27
1
0
04 Feb 2022
1