ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.04773
  4. Cited By
Worst-case Performance of Greedy Policies in Bandits with Imperfect
  Context Observations

Worst-case Performance of Greedy Policies in Bandits with Imperfect Context Observations

10 April 2022
Hongju Park
Mohamad Kazem Shirani Faradonbeh
    OffRL
ArXivPDFHTML

Papers citing "Worst-case Performance of Greedy Policies in Bandits with Imperfect Context Observations"

3 / 3 papers shown
Title
Thompson Sampling in Partially Observable Contextual Bandits
Thompson Sampling in Partially Observable Contextual Bandits
Hongju Park
Mohamad Kazem Shirani Faradonbeh
28
2
0
15 Feb 2024
Online learning in bandits with predicted context
Online learning in bandits with predicted context
Yongyi Guo
Ziping Xu
Susan Murphy
26
4
0
26 Jul 2023
Efficient Algorithms for Learning to Control Bandits with Unobserved
  Contexts
Efficient Algorithms for Learning to Control Bandits with Unobserved Contexts
Hongju Park
Mohamad Kazem Shirani Faradonbeh
16
6
0
02 Feb 2022
1