Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1301.2315
Cited By
The Optimal Reward Baseline for Gradient-Based Reinforcement Learning
10 January 2013
Lex Weaver
Nigel Tao
Re-assign community
ArXiv
PDF
HTML
Papers citing
"The Optimal Reward Baseline for Gradient-Based Reinforcement Learning"
2 / 2 papers shown
Title
Multi-Fidelity Policy Gradient Algorithms
Xinjie Liu
Cyrus Neary
Kushagra Gupta
Christian Ellis
Ufuk Topcu
David Fridovich-Keil
OffRL
381
0
0
07 Mar 2025
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRL
OOD
102
5
0
13 Dec 2023
1