Affine Invariant Covariance Estimation for Heavy-Tailed Distributions
In this work we provide an estimator for the covariance matrix of a heavy-tailed random vector. We prove that the proposed estimator admits \textit{affine-invariant} bounds of the form in high probability, where is the unknown covariance matrix, and is the positive semidefinite order on symmetric matrices. The result only requires the existence of fourth-order moments, and allows for where is some measure of kurtosis of the distribution, is the dimensionality of the space, and is the sample size. More generally, we can allow for regularization with level~, then depends on the degrees of freedom number which is generally smaller than . The computational cost of the proposed estimator is essentially~, comparable to the computational cost of the sample covariance matrix in the statistically interesting regime~. Its applications to eigenvalue estimation with relative error and to ridge regression with heavy-tailed random design are discussed.
View on arXiv