Exploiting the Sign of the Advantage Function to Learn Deterministic Policies in Continuous Domains

10 June 2019

Papers citing "Exploiting the Sign of the Advantage Function to Learn Deterministic Policies in Continuous Domains"

1 / 1 papers shown

Title
Coordinate Ascent for Off-Policy RL with Global Convergence Guarantees Hsin-En Su Yen-Ju Chen Ping-Chun Hsieh Xi Liu OffRL 26 0 0 10 Dec 2022