ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.05147
  4. Cited By
Model-free Policy Learning with Reward Gradients

Model-free Policy Learning with Reward Gradients

9 March 2021
Qingfeng Lan
Samuele Tosatto
Homayoon Farrahi
Rupam Mahmood
ArXivPDFHTML

Papers citing "Model-free Policy Learning with Reward Gradients"

3 / 3 papers shown
Title
Learning to Optimize for Reinforcement Learning
Learning to Optimize for Reinforcement Learning
Qingfeng Lan
Rupam Mahmood
Shuicheng Yan
Zhongwen Xu
OffRL
28
6
0
03 Feb 2023
Asynchronous Reinforcement Learning for Real-Time Control of Physical
  Robots
Asynchronous Reinforcement Learning for Real-Time Control of Physical Robots
Yufeng Yuan
Rupam Mahmood
OffRL
31
19
0
23 Mar 2022
A Temporal-Difference Approach to Policy Gradient Estimation
A Temporal-Difference Approach to Policy Gradient Estimation
Samuele Tosatto
Andrew Patterson
Martha White
A. R. Mahmood
OffRL
27
1
0
04 Feb 2022
1