ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2301.09024
21
9

Statistically Optimal Robust Mean and Covariance Estimation for Anisotropic Gaussians

21 January 2023
A. Minasyan
Nikita Zhivotovskiy
ArXivPDFHTML
Abstract

Assume that X1,…,XNX_{1}, \ldots, X_{N}X1​,…,XN​ is an ε\varepsilonε-contaminated sample of NNN independent Gaussian vectors in Rd\mathbb{R}^dRd with mean μ\muμ and covariance Σ\SigmaΣ. In the strong ε\varepsilonε-contamination model we assume that the adversary replaced an ε\varepsilonε fraction of vectors in the original Gaussian sample by any other vectors. We show that there is an estimator μ^\widehat \muμ​ of the mean satisfying, with probability at least 1−δ1 - \delta1−δ, a bound of the form \[ \|\widehat{\mu} - \mu\|_2 \le c\left(\sqrt{\frac{\operatorname{Tr}(\Sigma)}{N}} + \sqrt{\frac{\|\Sigma\|\log(1/\delta)}{N}} + \varepsilon\sqrt{\|\Sigma\|}\right), \] where c>0c > 0c>0 is an absolute constant and ∥Σ∥\|\Sigma\|∥Σ∥ denotes the operator norm of Σ\SigmaΣ. In the same contaminated Gaussian setup, we construct an estimator Σ^\widehat \SigmaΣ of the covariance matrix Σ\SigmaΣ that satisfies, with probability at least 1−δ1 - \delta1−δ, \[ \left\|\widehat{\Sigma} - \Sigma\right\| \le c\left(\sqrt{\frac{\|\Sigma\|\operatorname{Tr}(\Sigma)}{N}} + \|\Sigma\|\sqrt{\frac{\log(1/\delta)}{N}} + \varepsilon\|\Sigma\|\right). \] Both results are optimal up to multiplicative constant factors. Despite the recent significant interest in robust statistics, achieving both dimension-free bounds in the canonical Gaussian case remained open. In fact, several previously known results were either dimension-dependent and required Σ\SigmaΣ to be close to identity, or had a sub-optimal dependence on the contamination level ε\varepsilonε. As a part of the analysis, we derive sharp concentration inequalities for central order statistics of Gaussian, folded normal, and chi-squared distributions.

View on arXiv
Comments on this paper