107
69

A Random Matrix--Theoretic Approach to Handling Singular Covariance Estimates

Abstract

In many practical situations we would like to estimate the covariance matrix of a set of variables from an insufficient amount of data. More specifically, if we have a set of NN independent, identically distributed measurements of an MM dimensional random vector the maximum likelihood estimate is the sample covariance matrix. Here we consider the case where N<MN<M such that this estimate is singular and therefore fundamentally bad. We present a radically new approach to deal with this situation. Let XX be the M×NM\times N data matrix, where the columns are the NN independent realizations of the random vector with covariance matrix Σ\Sigma. Without loss of generality, we can assume that the random variables have zero mean. We would like to estimate Σ\Sigma from XX. Let KK be the classical sample covariance matrix. Fix a parameter 1LN1\leq L\leq N and consider an ensemble of L×ML\times M random unitary matrices, {Φ}\{\Phi\}, having Haar probability measure. Pre and post multiply KK by Φ\Phi, and by the conjugate transpose of Φ\Phi respectively, to produce a non--singular L×LL\times L reduced dimension covariance estimate. A new estimate for Σ\Sigma, denoted by covL(K)\mathrm{cov}_L(K), is obtained by a) projecting the reduced covariance estimate out (to M×MM\times M) through pre and post multiplication by the conjugate transpose of Φ\Phi, and by Φ\Phi respectively, and b) taking the expectation over the unitary ensemble. Another new estimate (this time for Σ1\Sigma^{-1}), invcovL(K)\mathrm{invcov}_L(K), is obtained by a) inverting the reduced covariance estimate, b) projecting the inverse out (to M×MM\times M) through pre and post multiplication by the conjugate transpose of Φ\Phi, and by Φ\Phi respectively, and c) taking the expectation over the unitary ensemble. We have a closed analytical expression for invcovL(K)\mathrm{invcov}_L(K) and covL(K)\mathrm{cov}_L(K) in terms of its eigenvalue decomposition.

View on arXiv
Comments on this paper