Post-Contextual-Bandit Inference

Post-Contextual-Bandit Inference

1 June 2021

Aurélien F. Bibaut

Antoine Chambaz

Maria Dimakopoulou

Mark van der Laan

Papers citing "Post-Contextual-Bandit Inference"

10 / 10 papers shown

Title
SNPL: Simultaneous Policy Learning and Evaluation for Safe Multi-Objective Policy Improvement Brian Cho Ana-Roxana Pop Ariel Evince Nathan Kallus OffRL 51 0 0 17 Mar 2025
Statistical Inference in Reinforcement Learning: A Selective Survey Chengchun Shi OffRL 74 1 0 22 Feb 2025
Inference with the Upper Confidence Bound Algorithm K. Khamaru Cun-Hui Zhang 53 0 0 08 Aug 2024
Online learning in bandits with predicted context Yongyi Guo Ziping Xu Susan Murphy 26 4 0 26 Jul 2023
Did we personalize? Assessing personalization by an online reinforcement learning algorithm using resampling Susobhan Ghosh Raphael Kim Prasidh Chhabria Raaz Dwivedi Predrag Klasjna Peng Liao Kelly Zhang Susan Murphy OffRL 38 8 0 11 Apr 2023
Near-Optimal Non-Parametric Sequential Tests and Confidence Sequences with Possibly Dependent Observations Aurélien F. Bibaut Nathan Kallus Michael Lindon 35 9 0 29 Dec 2022
Anytime-valid off-policy inference for contextual bandits Ian Waudby-Smith Lili Wu Aaditya Ramdas Nikos Karampatziakis Paul Mineiro OffRL 45 25 0 19 Oct 2022
Best Arm Identification with Contextual Information under a Small Gap Masahiro Kato Masaaki Imaizumi Takuya Ishihara T. Kitagawa 27 2 0 15 Sep 2022
Entropy Regularization for Population Estimation Ben Chugg Peter Henderson Jacob Goldin Daniel E. Ho 33 3 0 24 Aug 2022
OpenML Benchmarking Suites B. Bischl Giuseppe Casalicchio Matthias Feurer Pieter Gijsbers Frank Hutter Michel Lang R. G. Mantovani Jan N. van Rijn Joaquin Vanschoren VLM ELM 43 152 0 11 Aug 2017