13
1

Sparse PCA Beyond Covariance Thresholding

Abstract

In the Wishart model for sparse PCA we are given nn samples Y1,,YnY_1,\ldots, Y_n drawn independently from a dd-dimensional Gaussian distribution N(0,Id+βvv)N({0, Id + \beta vv^\top}), where β>0\beta > 0 and vRdv\in \mathbb{R}^d is a kk-sparse unit vector, and we wish to recover vv (up to sign). We show that if nΩ(d)n \ge \Omega(d), then for every tkt \ll k there exists an algorithm running in time ndO(t)n\cdot d^{O(t)} that solves this problem as long as \[ \beta \gtrsim \frac{k}{\sqrt{nt}}\sqrt{\ln({2 + td/k^2})}\,. \] Prior to this work, the best polynomial time algorithm in the regime kdk\approx \sqrt{d}, called \emph{Covariance Thresholding} (proposed in [KNV15a] and analyzed in [DM14]), required βknln(2+d/k2)\beta \gtrsim \frac{k}{\sqrt{n}}\sqrt{\ln({2 + d/k^2})}. For large enough constant tt our algorithm runs in polynomial time and has better guarantees than Covariance Thresholding. Previously known algorithms with such guarantees required quasi-polynomial time dO(logd)d^{O(\log d)}. In addition, we show that our techniques work with sparse PCA with adversarial perturbations studied in [dKNS20]. This model generalizes not only sparse PCA, but also other problems studied in prior works, including the sparse planted vector problem. As a consequence, we provide polynomial time algorithms for the sparse planted vector problem that have better guarantees than the state of the art in some regimes. Our approach also works with the Wigner model for sparse PCA. Moreover, we show that it is possible to combine our techniques with recent results on sparse PCA with symmetric heavy-tailed noise [dNNS22]. In particular, in the regime kdk \approx \sqrt{d} we get the first polynomial time algorithm that works with symmetric heavy-tailed noise, while the algorithm from [dNNS22]. requires quasi-polynomial time in these settings.

View on arXiv
Comments on this paper