Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2008.06738
Cited By
Reducing Sampling Error in Batch Temporal Difference Learning
15 August 2020
Brahma S. Pavse
Ishan Durugkar
Josiah P. Hanna
Peter Stone
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Reducing Sampling Error in Batch Temporal Difference Learning"
6 / 6 papers shown
Title
Behavior Estimation from Multi-Source Data for Offline Reinforcement Learning
Guoxi Zhang
H. Kashima
OffRL
29
2
0
29 Nov 2022
On the Reuse Bias in Off-Policy Reinforcement Learning
Chengyang Ying
Zhongkai Hao
Xinning Zhou
Hang Su
Dong Yan
Jun Zhu
OffRL
40
3
0
15 Sep 2022
User-Interactive Offline Reinforcement Learning
Phillip Swazinna
Steffen Udluft
Thomas Runkler
OffRL
27
11
0
21 May 2022
ReVar: Strengthening Policy Evaluation via Reduced Variance Sampling
Subhojyoti Mukherjee
Josiah P. Hanna
Robert D. Nowak
OffRL
29
12
0
09 Mar 2022
Robust On-Policy Sampling for Data-Efficient Policy Evaluation in Reinforcement Learning
Rujie Zhong
Duohan Zhang
Lukas Schafer
Stefano V. Albrecht
Josiah P. Hanna
OOD
OffRL
15
12
0
29 Nov 2021
Universal Off-Policy Evaluation
Yash Chandak
S. Niekum
Bruno C. da Silva
Erik Learned-Miller
Emma Brunskill
Philip S. Thomas
OffRL
ELM
34
52
0
26 Apr 2021
1