Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1804.10488
Cited By
Offline Evaluation of Ranking Policies with Click Models
27 April 2018
Shuai Li
Yasin Abbasi-Yadkori
Branislav Kveton
S. Muthukrishnan
Vishwa Vinay
Zheng Wen
CML
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Offline Evaluation of Ranking Policies with Click Models"
12 / 12 papers shown
Title
Contextual User Browsing Bandits for Large-Scale Online Mobile Recommendation
Xu He
Bo An
Yanghua Li
Haikai Chen
Qingyu Guo
Xuzhao Li
Zhirong Wang
64
12
0
21 Aug 2020
Open Bandit Dataset and Pipeline: Towards Realistic and Reproducible Off-Policy Evaluation
Yuta Saito
Shunsuke Aihara
Megumi Matsutani
Yusuke Narita
OffRL
108
75
0
17 Aug 2020
Offline A/B testing for Recommender Systems
Alexandre Gilotte
Clément Calauzènes
Thomas Nedelec
A. Abraham
Simon Dollé
OffRL
62
220
0
22 Jan 2018
Unbiased Learning-to-Rank with Biased Feedback
Thorsten Joachims
Adith Swaminathan
Tobias Schnabel
CML
70
538
0
16 Aug 2016
Off-policy evaluation for slate recommendation
Adith Swaminathan
A. Krishnamurthy
Alekh Agarwal
Miroslav Dudík
John Langford
Damien Jose
I. Zitouni
CML
OffRL
43
227
0
16 May 2016
Tight Regret Bounds for Stochastic Combinatorial Semi-Bandits
Branislav Kveton
Zheng Wen
Azin Ashkan
Csaba Szepesvári
29
47
0
03 Oct 2014
Efficient Learning in Large-Scale Combinatorial Semi-Bandits
Zheng Wen
Branislav Kveton
Azin Ashkan
OffRL
100
96
0
28 Jun 2014
Improving offline evaluation of contextual bandit algorithms via bootstrapping techniques
Olivier Nicol
Jérémie Mary
Philippe Preux
OffRL
37
23
0
14 May 2014
Counterfactual Reasoning and Learning Systems
Léon Bottou
J. Peters
J. Q. Candela
Denis Xavier Charles
D. M. Chickering
Elon Portugaly
Dipankar Ray
Patrice Y. Simard
Edward Snelson
CML
OffRL
178
781
0
11 Sep 2012
Doubly Robust Policy Evaluation and Learning
Miroslav Dudík
John Langford
Lihong Li
OffRL
157
694
0
23 Mar 2011
Unbiased Offline Evaluation of Contextual-bandit-based News Article Recommendation Algorithms
Lihong Li
Wei Chu
John Langford
Xuanhui Wang
OffRL
150
574
0
31 Mar 2010
Learning from Logged Implicit Exploration Data
Alexander L. Strehl
John Langford
Sham Kakade
Lihong Li
OffRL
112
254
0
27 Feb 2010
1