Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1801.07030
Cited By
Offline A/B testing for Recommender Systems
22 January 2018
Alexandre Gilotte
Clément Calauzènes
Thomas Nedelec
A. Abraham
Simon Dollé
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Offline A/B testing for Recommender Systems"
28 / 28 papers shown
Title
Prompt Optimization with Logged Bandit Data
Haruka Kiyohara
Daniel Yiming Cao
Yuta Saito
Thorsten Joachims
86
0
0
03 Apr 2025
Balancing Immediate Revenue and Future Off-Policy Evaluation in Coupon Allocation
Naoki Nishimura
Ken Kobayashi
Kazuhide Nakata
OffRL
30
0
0
06 Jul 2024
Bayesian Off-Policy Evaluation and Learning for Large Action Spaces
Imad Aouali
Victor-Emmanuel Brunel
David Rohde
Anna Korba
OffRL
41
5
0
22 Feb 2024
Reinforcement Learning from Statistical Feedback: the Journey from AB Testing to ANT Testing
Feiyang Han
Yimin Wei
Zhaofeng Liu
Yanxing Qi
43
1
0
24 Nov 2023
On (Normalised) Discounted Cumulative Gain as an Off-Policy Evaluation Metric for Top-
n
n
n
Recommendation
Olivier Jeunen
Ivan Potapov
Aleksei Ustimenko
ELM
OffRL
32
11
0
27 Jul 2023
Correcting for Interference in Experiments: A Case Study at Douyin
Vivek F. Farias
Hao Li
Tianyi Peng
Xinyuyang Ren
B. Hassibi
A. Zheng
41
9
0
04 May 2023
Uncertainty Calibration for Counterfactual Propensity Estimation in Recommendation
Wenbo Hu
Xin Sun
Qiang liu
Wenbo Hu
Shu Wu
47
0
0
23 Mar 2023
Learning to Control Autonomous Fleets from Observation via Offline Reinforcement Learning
Carolin Schmidt
Daniele Gammelli
Francisco Câmara Pereira
Filipe Rodrigues
OffRL
24
4
0
28 Feb 2023
Scalable End-to-End ML Platforms: from AutoML to Self-serve
I. Markov
P. Apostolopoulos
Mia Garrard
Tianyu Qie
Yin Huang
...
Anika Li
Cesar Cardoso
George Han
Ryan Maghsoudian
Norm Zhou
LRM
31
2
0
27 Feb 2023
Recommender Systems: A Primer
P. Castells
Dietmar Jannach
OffRL
34
5
0
06 Feb 2023
Off-policy evaluation for learning-to-rank via interpolating the item-position model and the position-based model
Alexander K. Buchholz
Ben London
Giuseppe Di Benedetto
Thorsten Joachims
OffRL
26
2
0
15 Oct 2022
Simpson's Paradox in Recommender Fairness: Reconciling differences between per-user and aggregated evaluations
Flavien Prost
Ben Packer
Jilin Chen
Li Wei
Pierre-Antoine Kremp
...
Tulsee Doshi
Tonia Osadebe
Lukasz Heldt
Ed H. Chi
Alex Beutel
30
5
0
14 Oct 2022
Offline Evaluation of Reward-Optimizing Recommender Systems: The Case of Simulation
Imad Aouali
Amine Benhalloum
Martin Bompaire
Benjamin Heymann
Olivier Jeunen
D. Rohde
Otmane Sakhi
Flavian Vasile
OffRL
11
2
0
18 Sep 2022
Inverse Propensity Score based offline estimator for deterministic ranking lists using position bias
Nick Wood
Sumit Sidana
OffRL
14
0
0
31 Aug 2022
ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor
Wanqi Xue
Qingpeng Cai
Ruohan Zhan
Dong Zheng
Peng Jiang
Kun Gai
Bo An
OffRL
38
24
0
01 Jun 2022
KuaiRec: A Fully-observed Dataset and Insights for Evaluating Recommender Systems
Chongming Gao
Shijun Li
Wenqiang Lei
Jiawei Chen
Biao Li
Peng Jiang
Xiangnan He
Jiaxin Mao
Tat-Seng Chua
37
131
0
22 Feb 2022
Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model
Haruka Kiyohara
Yuta Saito
Tatsuya Matsuhiro
Yusuke Narita
N. Shimizu
Yasuo Yamamoto
OffRL
26
42
0
03 Feb 2022
Rethinking Neural vs. Matrix-Factorization Collaborative Filtering: the Theoretical Perspectives
Zida Cheng
Chuanwei Ruan
Siheng Chen
Sushant Kumar
Ya Zhang
27
16
0
23 Oct 2021
Benchmarks for Deep Off-Policy Evaluation
Justin Fu
Mohammad Norouzi
Ofir Nachum
George Tucker
Ziyun Wang
...
Yutian Chen
Aviral Kumar
Cosmin Paduraru
Sergey Levine
T. Paine
ELM
OffRL
35
100
0
30 Mar 2021
Understanding the role of importance weighting for deep learning
Da Xu
Yuting Ye
Chuanwei Ruan
FAtt
39
43
0
28 Mar 2021
Advances and Challenges in Conversational Recommender Systems: A Survey
Chongming Gao
Wenqiang Lei
Xiangnan He
Maarten de Rijke
Tat-Seng Chua
138
273
0
23 Jan 2021
Split-Treatment Analysis to Rank Heterogeneous Causal Effects for Prospective Interventions
Yanbo Xu
Divyat Mahajan
Liz Manrao
Amit Sharma
Emre Kıcıman
CML
15
2
0
11 Nov 2020
Open Bandit Dataset and Pipeline: Towards Realistic and Reproducible Off-Policy Evaluation
Yuta Saito
Shunsuke Aihara
Megumi Matsutani
Yusuke Narita
OffRL
24
73
0
17 Aug 2020
Deep Bayesian Bandits: Exploring in Online Personalized Recommendations
Dalin Guo
S. Ktena
Ferenc Huszár
Pranay K. Myana
Wenzhe Shi
Alykhan Tejani
OffRL
38
40
0
03 Aug 2020
Counterfactual Evaluation of Slate Recommendations with Sequential Reward Interactions
James McInerney
B. Brost
Praveen Chandar
Rishabh Mehrotra
Ben Carterette
BDL
CML
OffRL
121
55
0
25 Jul 2020
Toward Simulating Environments in Reinforcement Learning Based Recommendations
Xiangyu Zhao
Long Xia
Zhuoye Ding
Dawei Yin
Jiliang Tang
30
25
0
27 Jun 2019
Top-K Off-Policy Correction for a REINFORCE Recommender System
Minmin Chen
Alex Beutel
Paul Covington
Sagar Jain
Francois Belletti
Ed H. Chi
CML
OffRL
33
474
0
06 Dec 2018
Offline Evaluation of Ranking Policies with Click Models
Shuai Li
Yasin Abbasi-Yadkori
Branislav Kveton
S. Muthukrishnan
Vishwa Vinay
Zheng Wen
CML
OffRL
10
65
0
27 Apr 2018
1