Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2008.07146
Cited By
Open Bandit Dataset and Pipeline: Towards Realistic and Reproducible Off-Policy Evaluation
17 August 2020
Yuta Saito
Shunsuke Aihara
Megumi Matsutani
Yusuke Narita
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Open Bandit Dataset and Pipeline: Towards Realistic and Reproducible Off-Policy Evaluation"
14 / 14 papers shown
Title
Prompt Optimization with Logged Bandit Data
Haruka Kiyohara
Daniel Yiming Cao
Yuta Saito
Thorsten Joachims
64
0
0
03 Apr 2025
AExGym: Benchmarks and Environments for Adaptive Experimentation
Jimmy Wang
Ethan Che
Daniel R. Jiang
Hongseok Namkoong
42
0
0
08 Aug 2024
Hyperparameter Optimization Can Even be Harmful in Off-Policy Learning and How to Deal with It
Yuta Saito
Masahiro Nomura
OffRL
50
2
0
23 Apr 2024
Distributional Off-Policy Evaluation for Slate Recommendations
Shreyas Chaudhari
David Arbour
Georgios Theocharous
N. Vlassis
OffRL
44
0
0
27 Aug 2023
On (Normalised) Discounted Cumulative Gain as an Off-Policy Evaluation Metric for Top-
n
n
n
Recommendation
Olivier Jeunen
Ivan Potapov
Aleksei Ustimenko
ELM
OffRL
27
11
0
27 Jul 2023
A Field Test of Bandit Algorithms for Recommendations: Understanding the Validity of Assumptions on Human Preferences in Multi-armed Bandits
Liu Leqi
Giulio Zhou
Fatma Kilincc-Karzan
Zachary Chase Lipton
A. Montgomery
21
2
0
16 Apr 2023
Situating Recommender Systems in Practice: Towards Inductive Learning and Incremental Updates
Tobias Schnabel
Mengting Wan
Longqi Yang
HAI
27
8
0
11 Nov 2022
Reinforcement Learning in Practice: Opportunities and Challenges
Yuxi Li
OffRL
36
9
0
23 Feb 2022
KuaiRec: A Fully-observed Dataset and Insights for Evaluating Recommender Systems
Chongming Gao
Shijun Li
Wenqiang Lei
Jiawei Chen
Biao Li
Peng Jiang
Xiangnan He
Jiaxin Mao
Tat-Seng Chua
32
131
0
22 Feb 2022
Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model
Haruka Kiyohara
Yuta Saito
Tatsuya Matsuhiro
Yusuke Narita
N. Shimizu
Yasuo Yamamoto
OffRL
24
42
0
03 Feb 2022
Off-Policy Evaluation Using Information Borrowing and Context-Based Switching
Sutanoy Dasgupta
Yabo Niu
Kishan Panaganti
D. Kalathil
D. Pati
Bani Mallick
OffRL
29
0
0
18 Dec 2021
A Practical Guide of Off-Policy Evaluation for Bandit Problems
Masahiro Kato
Kenshi Abe
Kaito Ariu
Shota Yasui
OffRL
14
3
0
23 Oct 2020
Counterfactual Evaluation of Slate Recommendations with Sequential Reward Interactions
James McInerney
B. Brost
Praveen Chandar
Rishabh Mehrotra
Ben Carterette
BDL
CML
OffRL
118
55
0
25 Jul 2020
Benchmarking Graph Neural Networks
Vijay Prakash Dwivedi
Chaitanya K. Joshi
Anh Tuan Luu
T. Laurent
Yoshua Bengio
Xavier Bresson
189
917
0
02 Mar 2020
1