Nonasymptotic one-and two-sample tests in high dimension with unknown covariance structure

Let be an i.i.d. sample of square-integrable variables in , \GB{with common expectation and covariance matrix , both unknown.} We consider the problem of testing if is -close to zero, i.e. against ; we also tackle the more general two-sample mean closeness (also known as {\em relevant difference}) testing problem. The aim of this paper is to obtain nonasymptotic upper and lower bounds on the minimal separation distance such that we can control both the Type I and Type II errors at a given level. The main technical tools are concentration inequalities, first for a suitable estimator of used a test statistic, and secondly for estimating the operator and Frobenius norms of coming into the quantiles of said test statistic. These properties are obtained for Gaussian and bounded distributions. A particular attention is given to the dependence in the pseudo-dimension of the distribution, defined as . In particular, for , the minimum separation distance is , in contrast with the minimax estimation distance for , which is (where ). This generalizes a phenomenon spelled out in particular by Baraud (2002).
View on arXiv