Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.04556
Cited By
Exploiting the Sign of the Advantage Function to Learn Deterministic Policies in Continuous Domains
10 June 2019
Matthieu Zimmer
Paul Weng
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Exploiting the Sign of the Advantage Function to Learn Deterministic Policies in Continuous Domains"
1 / 1 papers shown
Title
Coordinate Ascent for Off-Policy RL with Global Convergence Guarantees
Hsin-En Su
Yen-Ju Chen
Ping-Chun Hsieh
Xi Liu
OffRL
26
0
0
10 Dec 2022
1