Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2008.06738
Cited By
Reducing Sampling Error in Batch Temporal Difference Learning
15 August 2020
Brahma S. Pavse
Ishan Durugkar
Josiah P. Hanna
Peter Stone
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Reducing Sampling Error in Batch Temporal Difference Learning"
4 / 4 papers shown
Title
Behavior Estimation from Multi-Source Data for Offline Reinforcement Learning
Guoxi Zhang
H. Kashima
OffRL
29
2
0
29 Nov 2022
User-Interactive Offline Reinforcement Learning
Phillip Swazinna
Steffen Udluft
Thomas Runkler
OffRL
25
11
0
21 May 2022
ReVar: Strengthening Policy Evaluation via Reduced Variance Sampling
Subhojyoti Mukherjee
Josiah P. Hanna
Robert D. Nowak
OffRL
29
12
0
09 Mar 2022
Universal Off-Policy Evaluation
Yash Chandak
S. Niekum
Bruno C. da Silva
Erik Learned-Miller
Emma Brunskill
Philip S. Thomas
OffRL
ELM
32
52
0
26 Apr 2021
1