Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2006.06982
Cited By
Confidence Interval for Off-Policy Evaluation from Dependent Samples via Bandit Algorithm: Approach from Standardized Martingales
12 June 2020
Masahiro Kato
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Confidence Interval for Off-Policy Evaluation from Dependent Samples via Bandit Algorithm: Approach from Standardized Martingales"
2 / 2 papers shown
Title
More Efficient Off-Policy Evaluation through Regularized Targeted Learning
Aurélien F. Bibaut
Ivana Malenica
N. Vlassis
Mark van der Laan
OOD
OffRL
27
40
0
13 Dec 2019
Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models
Michael Oberst
David Sontag
CML
OffRL
43
170
0
14 May 2019
1