ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.03327
19
0

Tight Bounds for γγγ-Regret via the Decision-Estimation Coefficient

6 March 2023
Margalit Glasgow
Alexander Rakhlin
    OffRL
ArXivPDFHTML
Abstract

In this work, we give a statistical characterization of the γ\gammaγ-regret for arbitrary structured bandit problems, the regret which arises when comparing against a benchmark that is γ\gammaγ times the optimal solution. The γ\gammaγ-regret emerges in structured bandit problems over a function class F\mathcal{F}F where finding an exact optimum of f∈Ff \in \mathcal{F}f∈F is intractable. Our characterization is given in terms of the γ\gammaγ-DEC, a statistical complexity parameter for the class F\mathcal{F}F, which is a modification of the constrained Decision-Estimation Coefficient (DEC) of Foster et al., 2023 (and closely related to the original offset DEC of Foster et al., 2021). Our lower bound shows that the γ\gammaγ-DEC is a fundamental limit for any model class F\mathcal{F}F: for any algorithm, there exists some f∈Ff \in \mathcal{F}f∈F for which the γ\gammaγ-regret of that algorithm scales (nearly) with the γ\gammaγ-DEC of F\mathcal{F}F. We provide an upper bound showing that there exists an algorithm attaining a nearly matching γ\gammaγ-regret. Due to significant challenges in applying the prior results on the DEC to the γ\gammaγ-regret case, both our lower and upper bounds require novel techniques and a new algorithm.

View on arXiv
Comments on this paper