Improving Offline Contextual Bandits with Distributional Robustness

13 November 2020

Papers citing "Improving Offline Contextual Bandits with Distributional Robustness"

4 / 4 papers shown

Title
Logarithmic Smoothing for Pessimistic Off-Policy Evaluation, Selection and Learning Otmane Sakhi Imad Aouali Pierre Alquier Nicolas Chopin OffRL 43 1 0 23 May 2024
Stochastic Re-weighted Gradient Descent via Distributionally Robust Optimization Ramnath Kumar Kushal Majmundar Dheeraj M. Nagaraj A. Suggala ODL 32 6 0 15 Jun 2023
PAC-Bayesian Offline Contextual Bandits With Guarantees Otmane Sakhi Pierre Alquier Nicolas Chopin OffRL 29 12 0 24 Oct 2022
Fast Offline Policy Optimization for Large Scale Recommendation Otmane Sakhi D. Rohde Alexandre Gilotte OffRL 45 3 0 08 Aug 2022