ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1612.03375
4
25

Sample complexity of the distinct elements problem

11 December 2016
Yihong Wu
Pengkun Yang
ArXivPDFHTML
Abstract

We consider the distinct elements problem, where the goal is to estimate the number of distinct colors in an urn containing k k k balls based on nnn samples drawn with replacements. Based on discrete polynomial approximation and interpolation, we propose an estimator with additive error guarantee that achieves the optimal sample complexity within O(log⁡log⁡k)O(\log\log k)O(loglogk) factors, and in fact within constant factors for most cases. The estimator can be computed in O(n)O(n)O(n) time for an accurate estimation. The result also applies to sampling without replacement provided the sample size is a vanishing fraction of the urn size. One of the key auxiliary results is a sharp bound on the minimum singular values of a real rectangular Vandermonde matrix, which might be of independent interest.

View on arXiv
Comments on this paper