ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2401.05794
15
1

Bounds on the price of feedback for mistake-bounded online learning

11 January 2024
Jesse Geneson
Linus Tang
ArXivPDFHTML
Abstract

We improve several worst-case bounds for various online learning scenarios from (Auer and Long, Machine Learning, 1999). In particular, we sharpen an upper bound for delayed ambiguous reinforcement learning by a factor of 2 and an upper bound for learning compositions of families of functions by a factor of 2.41. We also improve a lower bound from the same paper for learning compositions of kkk families of functions by a factor of Θ(ln⁡k)\Theta(\ln{k})Θ(lnk), matching the upper bound up to a constant factor. In addition, we solve a problem from (Long, Theoretical Computer Science, 2020) on the price of bandit feedback with respect to standard feedback for multiclass learning, and we improve an upper bound from (Feng et al., Theoretical Computer Science, 2023) on the price of rrr-input delayed ambiguous reinforcement learning by a factor of rrr, matching a lower bound from the same paper up to the leading term.

View on arXiv
Comments on this paper