ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.05378
99
5

Feature and Parameter Selection in Stochastic Linear Bandits

9 June 2021
Ahmadreza Moradipari
Berkay Turan
Yasin Abbasi-Yadkori
M. Alizadeh
Mohammad Ghavamzadeh
ArXivPDFHTML
Abstract

We study two model selection settings in stochastic linear bandits (LB). In the first setting, which we refer to as feature selection, the expected reward of the LB problem is in the linear span of at least one of MMM feature maps (models). In the second setting, the reward parameter of the LB problem is arbitrarily selected from MMM models represented as (possibly) overlapping balls in Rd\mathbb R^dRd. However, the agent only has access to misspecified models, i.e.,~estimates of the centers and radii of the balls. We refer to this setting as parameter selection. For each setting, we develop and analyze a computationally efficient algorithm that is based on a reduction from bandits to full-information problems. This allows us to obtain regret bounds that are not worse (up to a log⁡M\sqrt{\log M}logM​ factor) than the case where the true model is known. This is the best-reported dependence on the number of models MMM in these settings. Finally, we empirically show the effectiveness of our algorithms using synthetic and real-world experiments.

View on arXiv
Comments on this paper