ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.04843
  4. Cited By
Policy Gradient using Weak Derivatives for Reinforcement Learning

Policy Gradient using Weak Derivatives for Reinforcement Learning

9 April 2020
Sujay Bhatt
Alec Koppel
Vikram Krishnamurthy
ArXivPDFHTML

Papers citing "Policy Gradient using Weak Derivatives for Reinforcement Learning"

3 / 3 papers shown
Title
Behind the Myth of Exploration in Policy Gradients
Behind the Myth of Exploration in Policy Gradients
Adrien Bolland
Gaspard Lambrechts
Damien Ernst
56
0
0
31 Jan 2024
Policy Gradient Algorithms Implicitly Optimize by Continuation
Policy Gradient Algorithms Implicitly Optimize by Continuation
Adrien Bolland
Gilles Louppe
D. Ernst
39
3
0
11 May 2023
On the Sample Complexity and Metastability of Heavy-tailed Policy Search
  in Continuous Control
On the Sample Complexity and Metastability of Heavy-tailed Policy Search in Continuous Control
Amrit Singh Bedi
Anjaly Parayil
Junyu Zhang
Mengdi Wang
Alec Koppel
33
15
0
15 Jun 2021
1