Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.03493
Cited By
More Robust Doubly Robust Off-policy Evaluation
10 February 2018
Mehrdad Farajtabar
Yinlam Chow
Mohammad Ghavamzadeh
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"More Robust Doubly Robust Off-policy Evaluation"
18 / 18 papers shown
Title
DOLCE: Decomposing Off-Policy Evaluation/Learning into Lagged and Current Effects
Shu Tamano
Masanori Nojima
OffRL
164
0
0
02 May 2025
Counterfactual Inference under Thompson Sampling
Olivier Jeunen
OffRL
LRM
53
0
0
03 Apr 2025
Bayesian Off-Policy Evaluation and Learning for Large Action Spaces
Imad Aouali
Victor-Emmanuel Brunel
David Rohde
Anna Korba
OffRL
117
5
0
22 Feb 2024
Open Bandit Dataset and Pipeline: Towards Realistic and Reproducible Off-Policy Evaluation
Yuta Saito
Shunsuke Aihara
Megumi Matsutani
Yusuke Narita
OffRL
137
75
0
17 Aug 2020
Off-policy Bandits with Deficient Support
Noveen Sachdeva
Yi-Hsun Su
Thorsten Joachims
OffRL
150
75
0
16 Jun 2020
Large-scale Causal Approaches to Debiasing Post-click Conversion Rate Estimation with Multi-task Learning
Wenhao Zhang
Wentian Bao
Xiao-Yang Liu
Keping Yang
Quan Lin
Hong Wen
Ramin Ramezani
CML
72
106
0
16 Oct 2019
Causal Effect Inference with Deep Latent-Variable Models
Christos Louizos
Uri Shalit
Joris Mooij
David Sontag
R. Zemel
Max Welling
CML
BDL
171
742
0
24 May 2017
The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning
A. Gruslys
Will Dabney
M. G. Azar
Bilal Piot
Marc G. Bellemare
Rémi Munos
43
58
0
15 Apr 2017
Bootstrapping with Models: Confidence Intervals for Off-Policy Evaluation
Josiah P. Hanna
Peter Stone
S. Niekum
OffRL
18
4
0
20 Jun 2016
Safe and Efficient Off-Policy Reinforcement Learning
Rémi Munos
T. Stepleton
Anna Harutyunyan
Marc G. Bellemare
OffRL
138
615
0
08 Jun 2016
Off-policy evaluation for slate recommendation
Adith Swaminathan
A. Krishnamurthy
Alekh Agarwal
Miroslav Dudík
John Langford
Damien Jose
I. Zitouni
CML
OffRL
53
227
0
16 May 2016
Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning
Philip S. Thomas
Emma Brunskill
OffRL
324
576
0
04 Apr 2016
Doubly Robust Off-policy Value Evaluation for Reinforcement Learning
Nan Jiang
Lihong Li
OffRL
176
623
0
11 Nov 2015
Deep Reinforcement Learning with Double Q-learning
H. V. Hasselt
A. Guez
David Silver
OffRL
154
7,623
0
22 Sep 2015
Off-policy Learning with Eligibility Traces: A Survey
Matthieu Geist
B. Scherrer
OffRL
83
94
0
15 Apr 2013
Counterfactual Reasoning and Learning Systems
Léon Bottou
J. Peters
J. Q. Candela
Denis Xavier Charles
D. M. Chickering
Elon Portugaly
Dipankar Ray
Patrice Y. Simard
Edward Snelson
CML
OffRL
280
783
0
11 Sep 2012
Doubly Robust Policy Evaluation and Learning
Miroslav Dudík
John Langford
Lihong Li
OffRL
243
697
0
23 Mar 2011
Unbiased Offline Evaluation of Contextual-bandit-based News Article Recommendation Algorithms
Lihong Li
Wei Chu
John Langford
Xuanhui Wang
OffRL
187
575
0
31 Mar 2010
1