Sharp Variable Selection of a Sparse Submatrix in a High-Dimensional Noisy Matrix

We observe a matrix of independent, identically distributed Gaussian random variables which are centered except for elements of some submatrix of size where the mean is larger than some . The submatrix is sparse in the sense that and tend to 0, whereas and tend to infinity. We consider the problem of selecting the random variables with significantly large mean values. We give sufficient conditions on as a function of and and construct a uniformly consistent procedure in order to do sharp variable selection. We also prove the minimax lower bounds under necessary conditions which are complementary to the previous conditions. The critical values separating the necessary and sufficient conditions are sharp (we show exact constants). We note a gap between the critical values for selection of variables and that of detecting that such a submatrix exists given by Butucea and Ingster (2012). When is in this gap, consistent detection is possible but no consistent selector of the corresponding variables can be found.
View on arXiv