Worst-case Performance of Greedy Policies in Bandits with Imperfect Context Observations

10 April 2022

Mohamad Kazem Shirani Faradonbeh

Papers citing "Worst-case Performance of Greedy Policies in Bandits with Imperfect Context Observations"

3 / 3 papers shown

Title
Thompson Sampling in Partially Observable Contextual Bandits Hongju Park Mohamad Kazem Shirani Faradonbeh 28 2 0 15 Feb 2024
Online learning in bandits with predicted context Yongyi Guo Ziping Xu Susan Murphy 26 4 0 26 Jul 2023
Efficient Algorithms for Learning to Control Bandits with Unobserved Contexts Hongju Park Mohamad Kazem Shirani Faradonbeh 16 6 0 02 Feb 2022