66
0

Robust Sparse Estimation for Gaussians with Optimal Error under Huber Contamination

Abstract

We study Gaussian sparse estimation tasks in Huber's contamination model with a focus on mean estimation, PCA, and linear regression. For each of these tasks, we give the first sample and computationally efficient robust estimators with optimal error guarantees, within constant factors. All prior efficient algorithms for these tasks incur quantitatively suboptimal error. Concretely, for Gaussian robust kk-sparse mean estimation on Rd\mathbb{R}^d with corruption rate ϵ>0\epsilon>0, our algorithm has sample complexity (k2/ϵ2)polylog(d/ϵ)(k^2/\epsilon^2)\mathrm{polylog}(d/\epsilon), runs in sample polynomial time, and approximates the target mean within 2\ell_2-error O(ϵ)O(\epsilon). Previous efficient algorithms inherently incur error Ω(ϵlog(1/ϵ))\Omega(\epsilon \sqrt{\log(1/\epsilon)}). At the technical level, we develop a novel multidimensional filtering method in the sparse regime that may find other applications.

View on arXiv
Comments on this paper

We use cookies and other tracking technologies to improve your browsing experience on our website, to show you personalized content and targeted ads, to analyze our website traffic, and to understand where our visitors are coming from. See our policy.