Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2202.01721
Cited By
Variance-Optimal Augmentation Logging for Counterfactual Evaluation in Contextual Bandits
3 February 2022
Aaron David Tucker
Thorsten Joachims
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Variance-Optimal Augmentation Logging for Counterfactual Evaluation in Contextual Bandits"
3 / 3 papers shown
Title
SPEED: Experimental Design for Policy Evaluation in Linear Heteroscedastic Bandits
Subhojyoti Mukherjee
Qiaomin Xie
Josiah P. Hanna
R. Nowak
OffRL
58
5
0
29 Jan 2023
Safe Exploration for Efficient Policy Evaluation and Comparison
Runzhe Wan
Branislav Kveton
Rui Song
OffRL
38
10
0
26 Feb 2022
Robust On-Policy Sampling for Data-Efficient Policy Evaluation in Reinforcement Learning
Rujie Zhong
Duohan Zhang
Lukas Schafer
Stefano V. Albrecht
Josiah P. Hanna
OOD
OffRL
15
12
0
29 Nov 2021
1