ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.03794
18
29

The Broad Optimality of Profile Maximum Likelihood

10 June 2019
Yi Hao
A. Orlitsky
ArXivPDFHTML
Abstract

We study three fundamental statistical-learning problems: distribution estimation, property estimation, and property testing. We establish the profile maximum likelihood (PML) estimator as the first unified sample-optimal approach to a wide range of learning tasks. In particular, for every alphabet size kkk and desired accuracy ε\varepsilonε: Distribution estimation\textbf{Distribution estimation}Distribution estimation Under ℓ1\ell_1ℓ1​ distance, PML yields optimal Θ(k/(ε2log⁡k))\Theta(k/(\varepsilon^2\log k))Θ(k/(ε2logk)) sample complexity for sorted-distribution estimation, and a PML-based estimator empirically outperforms the Good-Turing estimator on the actual distribution; Additive property estimation\textbf{Additive property estimation}Additive property estimation For a broad class of additive properties, the PML plug-in estimator uses just four times the sample size required by the best estimator to achieve roughly twice its error, with exponentially higher confidence; α-R\ényi entropy estimation\boldsymbol{\alpha}\textbf{-R\ényi entropy estimation}α-R\ényi entropy estimation For integer α>1\alpha>1α>1, the PML plug-in estimator has optimal k1−1/αk^{1-1/\alpha}k1−1/α sample complexity; for non-integer α>3/4\alpha>3/4α>3/4, the PML plug-in estimator has sample complexity lower than the state of the art; Identity testing\textbf{Identity testing}Identity testing In testing whether an unknown distribution is equal to or at least ε\varepsilonε far from a given distribution in ℓ1\ell_1ℓ1​ distance, a PML-based tester achieves the optimal sample complexity up to logarithmic factors of kkk. Most of these results also hold for a near-linear-time computable variant of PML. Stronger results hold for a different and novel variant called truncated PML (TPML).

View on arXiv
Comments on this paper