ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.18915
81
1
v1v2 (latest)

Testing Support Size More Efficiently Than Learning Histograms

24 October 2024
Renato Ferreira Pinto Jr.
Nathaniel Harms
ArXiv (abs)PDFHTML
Abstract

Consider two problems about an unknown probability distribution ppp:1. How many samples from ppp are required to test if ppp is supported on nnn elements or not? Specifically, given samples from ppp, determine whether it is supported on at most nnn elements, or it is "ϵ\epsilonϵ-far" (in total variation distance) from being supported on nnn elements.2. Given mmm samples from ppp, what is the largest lower bound on its support size that we can produce?The best known upper bound for problem (1) uses a general algorithm for learning the histogram of the distribution ppp, which requires Θ(nϵ2log⁡n)\Theta(\tfrac{n}{\epsilon^2 \log n})Θ(ϵ2lognn​) samples. We show that testing can be done more efficiently than learning the histogram, using only O(nϵlog⁡nlog⁡(1/ϵ))O(\tfrac{n}{\epsilon \log n} \log(1/\epsilon))O(ϵlognn​log(1/ϵ)) samples, nearly matching the best known lower bound of Ω(nϵlog⁡n)\Omega(\tfrac{n}{\epsilon \log n})Ω(ϵlognn​). This algorithm also provides a better solution to problem (2), producing larger lower bounds on support size than what follows from previous work. The proof relies on an analysis of Chebyshev polynomial approximations outside the range where they are designed to be good approximations, and the paper is intended as an accessible self-contained exposition of the Chebyshev polynomial method.

View on arXiv
@article{jr.2025_2410.18915,
  title={ Testing Support Size More Efficiently Than Learning Histograms },
  author={ Renato Ferreira Pinto Jr. and Nathaniel Harms },
  journal={arXiv preprint arXiv:2410.18915},
  year={ 2025 }
}
Comments on this paper