ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.04556
  4. Cited By
Exploiting the Sign of the Advantage Function to Learn Deterministic
  Policies in Continuous Domains

Exploiting the Sign of the Advantage Function to Learn Deterministic Policies in Continuous Domains

10 June 2019
Matthieu Zimmer
Paul Weng
ArXivPDFHTML

Papers citing "Exploiting the Sign of the Advantage Function to Learn Deterministic Policies in Continuous Domains"

1 / 1 papers shown
Title
Coordinate Ascent for Off-Policy RL with Global Convergence Guarantees
Coordinate Ascent for Off-Policy RL with Global Convergence Guarantees
Hsin-En Su
Yen-Ju Chen
Ping-Chun Hsieh
Xi Liu
OffRL
26
0
0
10 Dec 2022
1