107
35

Sharp Variable Selection of a Sparse Submatrix in a High-Dimensional Noisy Matrix

Abstract

We observe a N×MN\times M matrix of independent, identically distributed Gaussian random variables which are centered except for elements of some submatrix of size n×mn\times m where the mean is larger than some a>0a>0. The submatrix is sparse in the sense that n/Nn/N and m/Mm/M tend to 0, whereas n,m,Nn,\, m, \, N and MM tend to infinity. We consider the problem of selecting the random variables with significantly large mean values. We give sufficient conditions on aa as a function of n,m,Nn,\, m,\,N and MM and construct a uniformly consistent procedure in order to do sharp variable selection. We also prove the minimax lower bounds under necessary conditions which are complementary to the previous conditions. The critical values aa^* separating the necessary and sufficient conditions are sharp (we show exact constants). We note a gap between the critical values aa^* for selection of variables and that of detecting that such a submatrix exists given by Butucea and Ingster (2012). When aa^* is in this gap, consistent detection is possible but no consistent selector of the corresponding variables can be found.

View on arXiv
Comments on this paper