Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2206.02593
Cited By
Pessimistic Off-Policy Optimization for Learning to Rank
6 June 2022
Matej Cief
Branislav Kveton
Michal Kompan
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Pessimistic Off-Policy Optimization for Learning to Rank"
10 / 10 papers shown
Title
Bellman-consistent Pessimism for Offline Reinforcement Learning
Tengyang Xie
Ching-An Cheng
Nan Jiang
Paul Mineiro
Alekh Agarwal
OffRL
LRM
145
276
0
13 Jun 2021
Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism
Paria Rashidinejad
Banghua Zhu
Cong Ma
Jiantao Jiao
Stuart J. Russell
OffRL
212
289
0
22 Mar 2021
Counterfactual Evaluation of Slate Recommendations with Sequential Reward Interactions
James McInerney
B. Brost
Praveen Chandar
Rishabh Mehrotra
Ben Carterette
BDL
CML
OffRL
140
56
0
25 Jul 2020
Offline Evaluation of Ranking Policies with Click Models
Shuai Li
Yasin Abbasi-Yadkori
Branislav Kveton
S. Muthukrishnan
Vishwa Vinay
Zheng Wen
CML
OffRL
54
67
0
27 Apr 2018
Offline A/B testing for Recommender Systems
Alexandre Gilotte
Clément Calauzènes
Thomas Nedelec
A. Abraham
Simon Dollé
OffRL
69
221
0
22 Jan 2018
How Algorithmic Confounding in Recommendation Systems Increases Homogeneity and Decreases Utility
A. Chaney
Brandon M Stewart
Barbara E. Engelhardt
CML
206
316
0
30 Oct 2017
Unbiased Learning-to-Rank with Biased Feedback
Thorsten Joachims
Adith Swaminathan
Tobias Schnabel
CML
75
542
0
16 Aug 2016
Doubly Robust Policy Evaluation and Optimization
Miroslav Dudík
D. Erhan
John Langford
Lihong Li
OffRL
180
286
0
10 Mar 2015
Cascading Bandits: Learning to Rank in the Cascade Model
Branislav Kveton
Csaba Szepesvári
Zheng Wen
Azin Ashkan
179
284
0
10 Feb 2015
Empirical Bernstein Bounds and Sample Variance Penalization
Andreas Maurer
Massimiliano Pontil
378
542
0
21 Jul 2009
1