ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.12175
  4. Cited By
Analysis of Thompson Sampling for Partially Observable Contextual
  Multi-Armed Bandits

Analysis of Thompson Sampling for Partially Observable Contextual Multi-Armed Bandits

23 October 2021
Yash J. Patel
Mohamad Kazem Shirani Faradonbeh
ArXivPDFHTML

Papers citing "Analysis of Thompson Sampling for Partially Observable Contextual Multi-Armed Bandits"

9 / 9 papers shown
Title
Partially Observable Contextual Bandits with Linear Payoffs
Partially Observable Contextual Bandits with Linear Payoffs
Sihan Zeng
Sujay Bhatt
Alec Koppel
Sumitra Ganesh
OffRL
24
1
0
17 Sep 2024
Thompson Sampling in Partially Observable Contextual Bandits
Thompson Sampling in Partially Observable Contextual Bandits
Hongju Park
Mohamad Kazem Shirani Faradonbeh
31
2
0
15 Feb 2024
Thompson Sampling for Stochastic Bandits with Noisy Contexts: An
  Information-Theoretic Regret Analysis
Thompson Sampling for Stochastic Bandits with Noisy Contexts: An Information-Theoretic Regret Analysis
Sharu Theresa Jose
Shana Moothedath
30
2
0
21 Jan 2024
Distributed Multi-Task Learning for Stochastic Bandits with Context Distribution and Stage-wise Constraints
Distributed Multi-Task Learning for Stochastic Bandits with Context Distribution and Stage-wise Constraints
Jiabin Lin
Shana Moothedath
45
1
0
21 Jan 2024
Online Decision Mediation
Online Decision Mediation
Daniel Jarrett
Alihan Huyuk
M. Schaar
35
2
0
28 Oct 2023
Follow-ups Also Matter: Improving Contextual Bandits via Post-serving
  Contexts
Follow-ups Also Matter: Improving Contextual Bandits via Post-serving Contexts
Chaoqi Wang
Ziyu Ye
Zhe Feng
Ashwinkumar Badanidiyuru
Haifeng Xu
31
1
0
25 Sep 2023
Worst-case Performance of Greedy Policies in Bandits with Imperfect
  Context Observations
Worst-case Performance of Greedy Policies in Bandits with Imperfect Context Observations
Hongju Park
Mohamad Kazem Shirani Faradonbeh
OffRL
29
2
0
10 Apr 2022
Remote Contextual Bandits
Remote Contextual Bandits
Francesco Pase
Deniz Gunduz
M. Zorzi
21
5
0
10 Feb 2022
Efficient Algorithms for Learning to Control Bandits with Unobserved
  Contexts
Efficient Algorithms for Learning to Control Bandits with Unobserved Contexts
Hongju Park
Mohamad Kazem Shirani Faradonbeh
21
6
0
02 Feb 2022
1