Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2006.06982
Cited By
Confidence Interval for Off-Policy Evaluation from Dependent Samples via Bandit Algorithm: Approach from Standardized Martingales
12 June 2020
Masahiro Kato
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Confidence Interval for Off-Policy Evaluation from Dependent Samples via Bandit Algorithm: Approach from Standardized Martingales"
1 / 1 papers shown
Title
More Efficient Off-Policy Evaluation through Regularized Targeted Learning
Aurélien F. Bibaut
Ivana Malenica
N. Vlassis
Mark van der Laan
OOD
OffRL
27
40
0
13 Dec 2019
1