Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.06643
Cited By
Policy Gradient Methods for Reinforcement Learning with Function Approximation and Action-Dependent Baselines
20 June 2017
Philip S. Thomas
Emma Brunskill
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Policy Gradient Methods for Reinforcement Learning with Function Approximation and Action-Dependent Baselines"
2 / 2 papers shown
Title
Doubly Optimal Policy Evaluation for Reinforcement Learning
Shuze Liu
Claire Chen
Shangtong Zhang
OffRL
177
3
0
03 Oct 2024
A Notation for Markov Decision Processes
Philip S. Thomas
Billy Okal
147
17
0
30 Dec 2015
1