6
22

Robust Sparse Covariance Estimation by Thresholding Tyler's M-Estimator

Abstract

Estimating a high-dimensional sparse covariance matrix from a limited number of samples is a fundamental problem in contemporary data analysis. Most proposals to date, however, are not robust to outliers or heavy tails. Towards bridging this gap, in this work we consider estimating a sparse shape matrix from nn samples following a possibly heavy tailed elliptical distribution. We propose estimators based on thresholding either Tyler's M-estimator or its regularized variant. We derive bounds on the difference in spectral norm between our estimators and the shape matrix in the joint limit as the dimension pp and sample size nn tend to infinity with p/nγ>0p/n\to\gamma>0. These bounds are minimax rate-optimal. Results on simulated data support our theoretical analysis.

View on arXiv
Comments on this paper