ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.10404
24
1

Deterministic Apple Tasting

14 October 2024
Zachary Chase
Idan Mehalel
ArXivPDFHTML
Abstract

In binary (0/10/10/1) online classification with apple tasting feedback, the learner receives feedback only when predicting 111. Besides some degenerate learning tasks, all previously known learning algorithms for this model are randomized. Consequently, prior to this work it was unknown whether deterministic apple tasting is generally feasible. In this work, we provide the first widely-applicable deterministic apple tasting learner, and show that in the realizable case, a hypothesis class is learnable if and only if it is deterministically learnable, confirming a conjecture of [Raman, Subedi, Raman, Tewari-24]. Quantitatively, we show that every class H\mathcal{H}H is learnable with mistake bound O(L(H)Tlog⁡T)O \left(\sqrt{\mathtt{L}(\mathcal{H}) T \log T} \right)O(L(H)TlogT​) (where L(H)\mathtt{L}(\mathcal{H})L(H) is the Littlestone dimension of H\mathcal{H}H), and that this is tight for some classes. We further study the agnostic case, in which the best hypothesis makes at most kkk many mistakes, and prove a trichotomy stating that every class H\mathcal{H}H must be either easy, hard, or unlearnable. Easy classes have (both randomized and deterministic) mistake bound ΘH(k)\Theta_{\mathcal{H}}(k)ΘH​(k). Hard classes have randomized mistake bound Θ~H(k+T)\tilde{\Theta}_{\mathcal{H}} \left(k + \sqrt{T} \right)Θ~H​(k+T​), and deterministic mistake bound Θ~H(k⋅T)\tilde{\Theta}_{\mathcal{H}} \left(\sqrt{k \cdot T} \right)Θ~H​(k⋅T​), where TTT is the time horizon. Unlearnable classes have (both randomized and deterministic) mistake bound Θ(T)\Theta(T)Θ(T). Our upper bound is based on a deterministic algorithm for learning from expert advice with apple tasting feedback, a problem interesting in its own right. For this problem, we show that the optimal deterministic mistake bound is Θ(T(k+log⁡n))\Theta \left(\sqrt{T (k + \log n)} \right)Θ(T(k+logn)​) for all kkk and T≤n≤2TT \leq n \leq 2^TT≤n≤2T, where nnn is the number of experts.

View on arXiv
Comments on this paper