ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2007.08101
14
9

The Sparse Hausdorff Moment Problem, with Application to Topic Models

16 July 2020
Spencer Gordon
Bijan Mazaheri
Leonard J. Schulman
Y. Rabani
ArXivPDFHTML
Abstract

We consider the problem of identifying, from its first mmm noisy moments, a probability distribution on [0,1][0,1][0,1] of support k<∞k<\inftyk<∞. This is equivalent to the problem of learning a distribution on mmm observable binary random variables X1,X2,…,XmX_1,X_2,\dots,X_mX1​,X2​,…,Xm​ that are iid conditional on a hidden random variable UUU taking values in {1,2,…,k}\{1,2,\dots,k\}{1,2,…,k}. Our focus is on accomplishing this with m=2km=2km=2k, which is the minimum mmm for which verifying that the source is a kkk-mixture is possible (even with exact statistics). This problem, so simply stated, is quite useful: e.g., by a known reduction, any algorithm for it lifts to an algorithm for learning pure topic models. We give an algorithm for identifying a kkk-mixture using samples of m=2km=2km=2k iid binary random variables using a sample of size (1/wmin⁡)2⋅(1/ζ)O(k)\left(1/w_{\min}\right)^2 \cdot\left(1/\zeta\right)^{O(k)}(1/wmin​)2⋅(1/ζ)O(k) and post-sampling runtime of only O(k2+o(1))O(k^{2+o(1)})O(k2+o(1)) arithmetic operations. Here wmin⁡w_{\min}wmin​ is the minimum probability of an outcome of UUU, and ζ\zetaζ is the minimum separation between the distinct success probabilities of the XiX_iXi​s. Stated in terms of the moment problem, it suffices to know the moments to additive accuracy wmin⁡⋅ζO(k)w_{\min}\cdot\zeta^{O(k)}wmin​⋅ζO(k). It is known that the sample complexity of any solution to the identification problem must be at least exponential in kkk. Previous results demonstrated either worse sample complexity and worse O(kc)O(k^c)O(kc) runtime for some ccc substantially larger than 222, or similar sample complexity and much worse kO(k2)k^{O(k^2)}kO(k2) runtime.

View on arXiv
Comments on this paper