16
7

Exact spectral norm error of sample covariance

Abstract

Let X1,,XnX_1,\ldots,X_n be i.i.d. centered Gaussian vectors in Rp\mathbb{R}^p with covariance Σ\Sigma, and let Σ^n1i=1nXiXi\hat{\Sigma}\equiv n^{-1}\sum_{i=1}^n X_iX_i^\top be the sample covariance. A central object of interest in the non-asymptotic theory of sample covariance is the spectral norm error Σ^Σ||\hat{\Sigma}-\Sigma|| of the sample covariance Σ^\hat{\Sigma}. In the path-breaking work of Koltchinskii and Lounici [KL17a], the `zeroth-order' magnitude of Σ^Σ||\hat{\Sigma}-\Sigma|| is characterized by the dimension-free two-sided estimate E{Σ^Σ/Σ}r(Σ)/n+r(Σ)/n\mathbb{E} \{||\hat{\Sigma}-\Sigma||/||\Sigma||\}\asymp \sqrt{r(\Sigma)/n}+r(\Sigma)/n , using the so-called effective rank r(Σ)tr(Σ)/Σr(\Sigma)\equiv \mathrm{tr}(\Sigma)/||\Sigma||. The goal of this paper is to provide a dimension-free first-order characterization for Σ^Σ||\hat{\Sigma}-\Sigma||. We show that \begin{equation*} \bigg|\frac{\mathbb{E} \{||\hat{\Sigma}-\Sigma||/||\Sigma||\} }{\mathbb{E}\sup_{\alpha \in [0,1]}\{(\alpha+n^{-1/2}\mathscr{G}_{\Sigma}(h;\alpha))^2-\alpha^2\}}- 1\bigg| \leq \frac{C}{\sqrt{r(\Sigma)} }, \end{equation*} where {GΣ(h;α):α[0,1]}\{\mathscr{G}_{\Sigma}(h;\alpha): \alpha \in [0,1]\} are (stochastic) Gaussian widths over spherical slices of the (standardized) Σ\Sigma-ellipsoid, playing the role of a first-order analogue to the zeroth-order characteristic r(Σ)r(\Sigma). As an immediate application of the first-order characterization, we obtain a version of the Koltchinskii-Lounici bound with optimal constants. In the more special context of spiked covariance models, our first-order characterization reveals a new phase transition of Σ^Σ||\hat{\Sigma}-\Sigma|| that exhibits qualitatively different behavior compared to the BBP phase transitional behavior of Σ^||\hat{\Sigma}||. A similar phase transition is also proved for the associated eigenvector.

View on arXiv
Comments on this paper