Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2101.07781
Cited By
Minimax Off-Policy Evaluation for Multi-Armed Bandits
19 January 2021
Cong Ma
Banghua Zhu
Jiantao Jiao
Martin J. Wainwright
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Minimax Off-Policy Evaluation for Multi-Armed Bandits"
4 / 4 papers shown
Title
High-probability sample complexities for policy evaluation with linear function approximation
Gen Li
Weichen Wu
Yuejie Chi
Cong Ma
Alessandro Rinaldo
Yuting Wei
OffRL
40
7
0
30 May 2023
Kernel-based off-policy estimation without overlap: Instance optimality beyond semiparametric efficiency
Wenlong Mou
Peng Ding
Martin J. Wainwright
Peter L. Bartlett
OffRL
40
10
0
16 Jan 2023
Off-policy estimation of linear functionals: Non-asymptotic theory for semi-parametric efficiency
Wenlong Mou
Martin J. Wainwright
Peter L. Bartlett
OffRL
43
11
0
26 Sep 2022
Bandit Algorithms for Precision Medicine
Yangyi Lu
Ziping Xu
Ambuj Tewari
66
11
0
10 Aug 2021
1