Nonparametric MLE for Gaussian Location Mixtures: Certified Computation and Generic Behavior

We study the nonparametric maximum likelihood estimator for Gaussian location mixtures in one dimension. It has been known since (Lindsay, 1983) that given an -point dataset, this estimator always returns a mixture with at most components, and more recently (Wu-Polyanskiy, 2020) gave a sharp bound for subgaussian data. In this work we study computational aspects of . We provide an algorithm which for small enough computes an -approximation of in Wasserstein distance in time . Here is data-dependent but independent of , while is an absolute constant and is the number of atoms in . We also certifiably compute the exact value of in finite time. These guarantees hold almost surely whenever the dataset consists of independent points from a probability distribution with a density (relative to Lebesgue measure). We also show the distribution of conditioned to be -atomic admits a density on the associated dimensional parameter space for all , and almost sure locally linear convergence of the EM algorithm. One key tool is a classical Fourier analytic estimate for non-degenerate curves.
View on arXiv@article{polyanskiy2025_2503.20193, title={ Nonparametric MLE for Gaussian Location Mixtures: Certified Computation and Generic Behavior }, author={ Yury Polyanskiy and Mark Sellke }, journal={arXiv preprint arXiv:2503.20193}, year={ 2025 } }