Semiparametric Contextual Bandits

12 March 2018

Papers citing "Semiparametric Contextual Bandits"

10 / 10 papers shown

Title
Zero-Inflated Bandits Haoyu Wei Runzhe Wan Lei Shi Rui Song 42 0 0 25 Dec 2023
Selective Uncertainty Propagation in Offline RL Sanath Kumar Krishnamurthy Shrey Modi Tanmay Gangwani S. Katariya B. Kveton A. Rangi OffRL 61 0 0 01 Feb 2023
GBOSE: Generalized Bandit Orthogonalized Semiparametric Estimation Mubarrat Chowdhury Elkhan Ismayilzada Khalequzzaman Sayem Gi-Soo Kim 20 0 0 20 Jan 2023
Statistical Estimation of Confounded Linear MDPs: An Instrumental Variable Approach Miao Lu Wenhao Yang Liangyu Zhang Zhihua Zhang OffRL 40 1 0 12 Sep 2022
Flexible and Efficient Contextual Bandits with Heterogeneous Treatment Effect Oracles Aldo G. Carranza Sanath Kumar Krishnamurthy Susan Athey 19 1 0 30 Mar 2022
Reinforcement Learning in Modern Biostatistics: Constructing Optimal Adaptive Interventions Nina Deliu Joseph Jay Williams B. Chakraborty OffRL 30 5 0 04 Mar 2022
Bandits with adversarial scaling Thodoris Lykouris Vahab Mirrokni R. Leme 8 14 0 04 Mar 2020
Personalized HeartSteps: A Reinforcement Learning Algorithm for Optimizing Physical Activity Peng Liao Kristjan Greenewald P. Klasnja S. Murphy 17 83 0 08 Sep 2019
Model selection for contextual bandits Dylan J. Foster A. Krishnamurthy Haipeng Luo OffRL 29 89 0 03 Jun 2019
Contextual Multi-armed Bandit Algorithm for Semiparametric Reward Model Gi-Soo Kim M. Paik 17 14 0 31 Jan 2019