ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2206.02593
  4. Cited By
Pessimistic Off-Policy Optimization for Learning to Rank

Pessimistic Off-Policy Optimization for Learning to Rank

6 June 2022
Matej Cief
Branislav Kveton
Michal Kompan
    OffRL
ArXivPDFHTML

Papers citing "Pessimistic Off-Policy Optimization for Learning to Rank"

10 / 10 papers shown
Title
Bellman-consistent Pessimism for Offline Reinforcement Learning
Bellman-consistent Pessimism for Offline Reinforcement Learning
Tengyang Xie
Ching-An Cheng
Nan Jiang
Paul Mineiro
Alekh Agarwal
OffRL
LRM
145
276
0
13 Jun 2021
Bridging Offline Reinforcement Learning and Imitation Learning: A Tale
  of Pessimism
Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism
Paria Rashidinejad
Banghua Zhu
Cong Ma
Jiantao Jiao
Stuart J. Russell
OffRL
212
289
0
22 Mar 2021
Counterfactual Evaluation of Slate Recommendations with Sequential
  Reward Interactions
Counterfactual Evaluation of Slate Recommendations with Sequential Reward Interactions
James McInerney
B. Brost
Praveen Chandar
Rishabh Mehrotra
Ben Carterette
BDL
CML
OffRL
140
56
0
25 Jul 2020
Offline Evaluation of Ranking Policies with Click Models
Offline Evaluation of Ranking Policies with Click Models
Shuai Li
Yasin Abbasi-Yadkori
Branislav Kveton
S. Muthukrishnan
Vishwa Vinay
Zheng Wen
CML
OffRL
54
67
0
27 Apr 2018
Offline A/B testing for Recommender Systems
Offline A/B testing for Recommender Systems
Alexandre Gilotte
Clément Calauzènes
Thomas Nedelec
A. Abraham
Simon Dollé
OffRL
69
221
0
22 Jan 2018
How Algorithmic Confounding in Recommendation Systems Increases
  Homogeneity and Decreases Utility
How Algorithmic Confounding in Recommendation Systems Increases Homogeneity and Decreases Utility
A. Chaney
Brandon M Stewart
Barbara E. Engelhardt
CML
206
316
0
30 Oct 2017
Unbiased Learning-to-Rank with Biased Feedback
Unbiased Learning-to-Rank with Biased Feedback
Thorsten Joachims
Adith Swaminathan
Tobias Schnabel
CML
75
542
0
16 Aug 2016
Doubly Robust Policy Evaluation and Optimization
Doubly Robust Policy Evaluation and Optimization
Miroslav Dudík
D. Erhan
John Langford
Lihong Li
OffRL
180
286
0
10 Mar 2015
Cascading Bandits: Learning to Rank in the Cascade Model
Cascading Bandits: Learning to Rank in the Cascade Model
Branislav Kveton
Csaba Szepesvári
Zheng Wen
Azin Ashkan
179
284
0
10 Feb 2015
Empirical Bernstein Bounds and Sample Variance Penalization
Empirical Bernstein Bounds and Sample Variance Penalization
Andreas Maurer
Massimiliano Pontil
378
542
0
21 Jul 2009
1