ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1801.07030
  4. Cited By
Offline A/B testing for Recommender Systems

Offline A/B testing for Recommender Systems

22 January 2018
Alexandre Gilotte
Clément Calauzènes
Thomas Nedelec
A. Abraham
Simon Dollé
    OffRL
ArXivPDFHTML

Papers citing "Offline A/B testing for Recommender Systems"

28 / 28 papers shown
Title
Prompt Optimization with Logged Bandit Data
Prompt Optimization with Logged Bandit Data
Haruka Kiyohara
Daniel Yiming Cao
Yuta Saito
Thorsten Joachims
86
0
0
03 Apr 2025
Balancing Immediate Revenue and Future Off-Policy Evaluation in Coupon
  Allocation
Balancing Immediate Revenue and Future Off-Policy Evaluation in Coupon Allocation
Naoki Nishimura
Ken Kobayashi
Kazuhide Nakata
OffRL
30
0
0
06 Jul 2024
Bayesian Off-Policy Evaluation and Learning for Large Action Spaces
Bayesian Off-Policy Evaluation and Learning for Large Action Spaces
Imad Aouali
Victor-Emmanuel Brunel
David Rohde
Anna Korba
OffRL
41
5
0
22 Feb 2024
Reinforcement Learning from Statistical Feedback: the Journey from AB
  Testing to ANT Testing
Reinforcement Learning from Statistical Feedback: the Journey from AB Testing to ANT Testing
Feiyang Han
Yimin Wei
Zhaofeng Liu
Yanxing Qi
43
1
0
24 Nov 2023
On (Normalised) Discounted Cumulative Gain as an Off-Policy Evaluation
  Metric for Top-$n$ Recommendation
On (Normalised) Discounted Cumulative Gain as an Off-Policy Evaluation Metric for Top-nnn Recommendation
Olivier Jeunen
Ivan Potapov
Aleksei Ustimenko
ELM
OffRL
32
11
0
27 Jul 2023
Correcting for Interference in Experiments: A Case Study at Douyin
Correcting for Interference in Experiments: A Case Study at Douyin
Vivek F. Farias
Hao Li
Tianyi Peng
Xinyuyang Ren
B. Hassibi
A. Zheng
41
9
0
04 May 2023
Uncertainty Calibration for Counterfactual Propensity Estimation in Recommendation
Uncertainty Calibration for Counterfactual Propensity Estimation in Recommendation
Wenbo Hu
Xin Sun
Qiang liu
Wenbo Hu
Shu Wu
47
0
0
23 Mar 2023
Learning to Control Autonomous Fleets from Observation via Offline
  Reinforcement Learning
Learning to Control Autonomous Fleets from Observation via Offline Reinforcement Learning
Carolin Schmidt
Daniele Gammelli
Francisco Câmara Pereira
Filipe Rodrigues
OffRL
24
4
0
28 Feb 2023
Scalable End-to-End ML Platforms: from AutoML to Self-serve
Scalable End-to-End ML Platforms: from AutoML to Self-serve
I. Markov
P. Apostolopoulos
Mia Garrard
Tianyu Qie
Yin Huang
...
Anika Li
Cesar Cardoso
George Han
Ryan Maghsoudian
Norm Zhou
LRM
31
2
0
27 Feb 2023
Recommender Systems: A Primer
Recommender Systems: A Primer
P. Castells
Dietmar Jannach
OffRL
34
5
0
06 Feb 2023
Off-policy evaluation for learning-to-rank via interpolating the
  item-position model and the position-based model
Off-policy evaluation for learning-to-rank via interpolating the item-position model and the position-based model
Alexander K. Buchholz
Ben London
Giuseppe Di Benedetto
Thorsten Joachims
OffRL
26
2
0
15 Oct 2022
Simpson's Paradox in Recommender Fairness: Reconciling differences
  between per-user and aggregated evaluations
Simpson's Paradox in Recommender Fairness: Reconciling differences between per-user and aggregated evaluations
Flavien Prost
Ben Packer
Jilin Chen
Li Wei
Pierre-Antoine Kremp
...
Tulsee Doshi
Tonia Osadebe
Lukasz Heldt
Ed H. Chi
Alex Beutel
30
5
0
14 Oct 2022
Offline Evaluation of Reward-Optimizing Recommender Systems: The Case of
  Simulation
Offline Evaluation of Reward-Optimizing Recommender Systems: The Case of Simulation
Imad Aouali
Amine Benhalloum
Martin Bompaire
Benjamin Heymann
Olivier Jeunen
D. Rohde
Otmane Sakhi
Flavian Vasile
OffRL
11
2
0
18 Sep 2022
Inverse Propensity Score based offline estimator for deterministic
  ranking lists using position bias
Inverse Propensity Score based offline estimator for deterministic ranking lists using position bias
Nick Wood
Sumit Sidana
OffRL
14
0
0
31 Aug 2022
ResAct: Reinforcing Long-term Engagement in Sequential Recommendation
  with Residual Actor
ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor
Wanqi Xue
Qingpeng Cai
Ruohan Zhan
Dong Zheng
Peng Jiang
Kun Gai
Bo An
OffRL
38
24
0
01 Jun 2022
KuaiRec: A Fully-observed Dataset and Insights for Evaluating
  Recommender Systems
KuaiRec: A Fully-observed Dataset and Insights for Evaluating Recommender Systems
Chongming Gao
Shijun Li
Wenqiang Lei
Jiawei Chen
Biao Li
Peng Jiang
Xiangnan He
Jiaxin Mao
Tat-Seng Chua
37
131
0
22 Feb 2022
Doubly Robust Off-Policy Evaluation for Ranking Policies under the
  Cascade Behavior Model
Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model
Haruka Kiyohara
Yuta Saito
Tatsuya Matsuhiro
Yusuke Narita
N. Shimizu
Yasuo Yamamoto
OffRL
26
42
0
03 Feb 2022
Rethinking Neural vs. Matrix-Factorization Collaborative Filtering: the
  Theoretical Perspectives
Rethinking Neural vs. Matrix-Factorization Collaborative Filtering: the Theoretical Perspectives
Zida Cheng
Chuanwei Ruan
Siheng Chen
Sushant Kumar
Ya Zhang
27
16
0
23 Oct 2021
Benchmarks for Deep Off-Policy Evaluation
Benchmarks for Deep Off-Policy Evaluation
Justin Fu
Mohammad Norouzi
Ofir Nachum
George Tucker
Ziyun Wang
...
Yutian Chen
Aviral Kumar
Cosmin Paduraru
Sergey Levine
T. Paine
ELM
OffRL
35
100
0
30 Mar 2021
Understanding the role of importance weighting for deep learning
Understanding the role of importance weighting for deep learning
Da Xu
Yuting Ye
Chuanwei Ruan
FAtt
39
43
0
28 Mar 2021
Advances and Challenges in Conversational Recommender Systems: A Survey
Advances and Challenges in Conversational Recommender Systems: A Survey
Chongming Gao
Wenqiang Lei
Xiangnan He
Maarten de Rijke
Tat-Seng Chua
138
273
0
23 Jan 2021
Split-Treatment Analysis to Rank Heterogeneous Causal Effects for
  Prospective Interventions
Split-Treatment Analysis to Rank Heterogeneous Causal Effects for Prospective Interventions
Yanbo Xu
Divyat Mahajan
Liz Manrao
Amit Sharma
Emre Kıcıman
CML
15
2
0
11 Nov 2020
Open Bandit Dataset and Pipeline: Towards Realistic and Reproducible
  Off-Policy Evaluation
Open Bandit Dataset and Pipeline: Towards Realistic and Reproducible Off-Policy Evaluation
Yuta Saito
Shunsuke Aihara
Megumi Matsutani
Yusuke Narita
OffRL
24
73
0
17 Aug 2020
Deep Bayesian Bandits: Exploring in Online Personalized Recommendations
Deep Bayesian Bandits: Exploring in Online Personalized Recommendations
Dalin Guo
S. Ktena
Ferenc Huszár
Pranay K. Myana
Wenzhe Shi
Alykhan Tejani
OffRL
38
40
0
03 Aug 2020
Counterfactual Evaluation of Slate Recommendations with Sequential
  Reward Interactions
Counterfactual Evaluation of Slate Recommendations with Sequential Reward Interactions
James McInerney
B. Brost
Praveen Chandar
Rishabh Mehrotra
Ben Carterette
BDL
CML
OffRL
121
55
0
25 Jul 2020
Toward Simulating Environments in Reinforcement Learning Based
  Recommendations
Toward Simulating Environments in Reinforcement Learning Based Recommendations
Xiangyu Zhao
Long Xia
Zhuoye Ding
Dawei Yin
Jiliang Tang
30
25
0
27 Jun 2019
Top-K Off-Policy Correction for a REINFORCE Recommender System
Top-K Off-Policy Correction for a REINFORCE Recommender System
Minmin Chen
Alex Beutel
Paul Covington
Sagar Jain
Francois Belletti
Ed H. Chi
CML
OffRL
33
474
0
06 Dec 2018
Offline Evaluation of Ranking Policies with Click Models
Offline Evaluation of Ranking Policies with Click Models
Shuai Li
Yasin Abbasi-Yadkori
Branislav Kveton
S. Muthukrishnan
Vishwa Vinay
Zheng Wen
CML
OffRL
10
65
0
27 Apr 2018
1