Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2108.13703
Cited By
Evaluating the Robustness of Off-Policy Evaluation
31 August 2021
Yuta Saito
Takuma Udagawa
Haruka Kiyohara
Kazuki Mogi
Yusuke Narita
Kei Tateno
ELM
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Evaluating the Robustness of Off-Policy Evaluation"
19 / 19 papers shown
Title
Optimal Off-Policy Evaluation from Multiple Logging Policies
Nathan Kallus
Yuta Saito
Masatoshi Uehara
OffRL
53
40
0
21 Oct 2020
Open Bandit Dataset and Pipeline: Towards Realistic and Reproducible Off-Policy Evaluation
Yuta Saito
Shunsuke Aihara
Megumi Matsutani
Yusuke Narita
OffRL
161
75
0
17 Aug 2020
Evaluating the Performance of Reinforcement Learning Algorithms
Scott M. Jordan
Yash Chandak
Daniel Cohen
Mengxue Zhang
Philip S. Thomas
42
46
0
30 Jun 2020
Off-Policy Evaluation and Learning for External Validity under a Covariate Shift
Masahiro Kato
Masatoshi Uehara
Shota Yasui
OffRL
61
53
0
26 Feb 2020
Adaptive Estimator Selection for Off-Policy Evaluation
Yi-Hsun Su
Pavithra Srinath
A. Krishnamurthy
OffRL
50
47
0
18 Feb 2020
Empirical Study of Off-Policy Policy Evaluation for Reinforcement Learning
Cameron Voloshin
Hoang Minh Le
Nan Jiang
Yisong Yue
OffRL
52
154
0
15 Nov 2019
Triply Robust Off-Policy Evaluation
Anqi Liu
Hao Liu
Anima Anandkumar
Yisong Yue
OffRL
57
10
0
13 Nov 2019
Doubly robust off-policy evaluation with shrinkage
Yi-Hsun Su
Maria Dimakopoulou
A. Krishnamurthy
Miroslav Dudík
OffRL
56
106
0
22 Jul 2019
Intrinsically Efficient, Stable, and Bounded Off-Policy Evaluation for Reinforcement Learning
Nathan Kallus
Masatoshi Uehara
OffRL
70
54
0
09 Jun 2019
Efficient Counterfactual Learning from Bandit Feedback
Yusuke Narita
Shota Yasui
Kohei Yata
OffRL
63
48
0
10 Sep 2018
Behaviour Policy Estimation in Off-Policy Policy Evaluation: Calibration Matters
Aniruddh Raghu
Omer Gottesman
Yao Liu
Matthieu Komorowski
A. Faisal
Finale Doshi-Velez
Emma Brunskill
OffRL
54
34
0
03 Jul 2018
More Robust Doubly Robust Off-policy Evaluation
Mehrdad Farajtabar
Yinlam Chow
Mohammad Ghavamzadeh
OffRL
70
267
0
10 Feb 2018
Offline A/B testing for Recommender Systems
Alexandre Gilotte
Clément Calauzènes
Thomas Nedelec
A. Abraham
Simon Dollé
OffRL
69
221
0
22 Jan 2018
Effective Evaluation using Logged Bandit Feedback from Multiple Loggers
Aman Agarwal
Soumya Basu
Tobias Schnabel
Thorsten Joachims
OffRL
108
68
0
17 Mar 2017
Optimal and Adaptive Off-policy Evaluation in Contextual Bandits
Yu Wang
Alekh Agarwal
Miroslav Dudík
OffRL
107
221
0
04 Dec 2016
Doubly Robust Off-policy Value Evaluation for Reinforcement Learning
Nan Jiang
Lihong Li
OffRL
200
623
0
11 Nov 2015
Doubly Robust Policy Evaluation and Optimization
Miroslav Dudík
D. Erhan
John Langford
Lihong Li
OffRL
180
285
0
10 Mar 2015
Learning from Logged Implicit Exploration Data
Alexander L. Strehl
John Langford
Sham Kakade
Lihong Li
OffRL
181
255
0
27 Feb 2010
The Offset Tree for Learning with Partial Labels
A. Beygelzimer
John Langford
302
185
0
21 Dec 2008
1