ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2108.13703
  4. Cited By
Evaluating the Robustness of Off-Policy Evaluation

Evaluating the Robustness of Off-Policy Evaluation

31 August 2021
Yuta Saito
Takuma Udagawa
Haruka Kiyohara
Kazuki Mogi
Yusuke Narita
Kei Tateno
    ELM
    OffRL
ArXivPDFHTML

Papers citing "Evaluating the Robustness of Off-Policy Evaluation"

19 / 19 papers shown
Title
Optimal Off-Policy Evaluation from Multiple Logging Policies
Optimal Off-Policy Evaluation from Multiple Logging Policies
Nathan Kallus
Yuta Saito
Masatoshi Uehara
OffRL
53
40
0
21 Oct 2020
Open Bandit Dataset and Pipeline: Towards Realistic and Reproducible
  Off-Policy Evaluation
Open Bandit Dataset and Pipeline: Towards Realistic and Reproducible Off-Policy Evaluation
Yuta Saito
Shunsuke Aihara
Megumi Matsutani
Yusuke Narita
OffRL
161
75
0
17 Aug 2020
Evaluating the Performance of Reinforcement Learning Algorithms
Evaluating the Performance of Reinforcement Learning Algorithms
Scott M. Jordan
Yash Chandak
Daniel Cohen
Mengxue Zhang
Philip S. Thomas
42
46
0
30 Jun 2020
Off-Policy Evaluation and Learning for External Validity under a
  Covariate Shift
Off-Policy Evaluation and Learning for External Validity under a Covariate Shift
Masahiro Kato
Masatoshi Uehara
Shota Yasui
OffRL
61
53
0
26 Feb 2020
Adaptive Estimator Selection for Off-Policy Evaluation
Adaptive Estimator Selection for Off-Policy Evaluation
Yi-Hsun Su
Pavithra Srinath
A. Krishnamurthy
OffRL
50
47
0
18 Feb 2020
Empirical Study of Off-Policy Policy Evaluation for Reinforcement
  Learning
Empirical Study of Off-Policy Policy Evaluation for Reinforcement Learning
Cameron Voloshin
Hoang Minh Le
Nan Jiang
Yisong Yue
OffRL
52
154
0
15 Nov 2019
Triply Robust Off-Policy Evaluation
Triply Robust Off-Policy Evaluation
Anqi Liu
Hao Liu
Anima Anandkumar
Yisong Yue
OffRL
57
10
0
13 Nov 2019
Doubly robust off-policy evaluation with shrinkage
Doubly robust off-policy evaluation with shrinkage
Yi-Hsun Su
Maria Dimakopoulou
A. Krishnamurthy
Miroslav Dudík
OffRL
56
106
0
22 Jul 2019
Intrinsically Efficient, Stable, and Bounded Off-Policy Evaluation for
  Reinforcement Learning
Intrinsically Efficient, Stable, and Bounded Off-Policy Evaluation for Reinforcement Learning
Nathan Kallus
Masatoshi Uehara
OffRL
70
54
0
09 Jun 2019
Efficient Counterfactual Learning from Bandit Feedback
Efficient Counterfactual Learning from Bandit Feedback
Yusuke Narita
Shota Yasui
Kohei Yata
OffRL
63
48
0
10 Sep 2018
Behaviour Policy Estimation in Off-Policy Policy Evaluation: Calibration
  Matters
Behaviour Policy Estimation in Off-Policy Policy Evaluation: Calibration Matters
Aniruddh Raghu
Omer Gottesman
Yao Liu
Matthieu Komorowski
A. Faisal
Finale Doshi-Velez
Emma Brunskill
OffRL
54
34
0
03 Jul 2018
More Robust Doubly Robust Off-policy Evaluation
More Robust Doubly Robust Off-policy Evaluation
Mehrdad Farajtabar
Yinlam Chow
Mohammad Ghavamzadeh
OffRL
70
267
0
10 Feb 2018
Offline A/B testing for Recommender Systems
Offline A/B testing for Recommender Systems
Alexandre Gilotte
Clément Calauzènes
Thomas Nedelec
A. Abraham
Simon Dollé
OffRL
69
221
0
22 Jan 2018
Effective Evaluation using Logged Bandit Feedback from Multiple Loggers
Effective Evaluation using Logged Bandit Feedback from Multiple Loggers
Aman Agarwal
Soumya Basu
Tobias Schnabel
Thorsten Joachims
OffRL
108
68
0
17 Mar 2017
Optimal and Adaptive Off-policy Evaluation in Contextual Bandits
Optimal and Adaptive Off-policy Evaluation in Contextual Bandits
Yu Wang
Alekh Agarwal
Miroslav Dudík
OffRL
107
221
0
04 Dec 2016
Doubly Robust Off-policy Value Evaluation for Reinforcement Learning
Doubly Robust Off-policy Value Evaluation for Reinforcement Learning
Nan Jiang
Lihong Li
OffRL
200
623
0
11 Nov 2015
Doubly Robust Policy Evaluation and Optimization
Doubly Robust Policy Evaluation and Optimization
Miroslav Dudík
D. Erhan
John Langford
Lihong Li
OffRL
180
285
0
10 Mar 2015
Learning from Logged Implicit Exploration Data
Learning from Logged Implicit Exploration Data
Alexander L. Strehl
John Langford
Sham Kakade
Lihong Li
OffRL
181
255
0
27 Feb 2010
The Offset Tree for Learning with Partial Labels
The Offset Tree for Learning with Partial Labels
A. Beygelzimer
John Langford
302
185
0
21 Dec 2008
1