590

Estimator selection in the Gaussian setting

Abstract

We consider the problem of estimating the mean ff of a Gaussian vector YY the components of which are independent with a common variance that we assume to be unknown. Our estimation procedure is based on estimator selection. More precisely, we start with a collection \FF\FF of estimators of ff based on YY and, with the same data YY, we aim at selecting an estimator among \FF\FF with the smallest Euclidean risk. We allow the cardinality of \FF\FF to be very large (possibly infinite) and also the dependency of the estimators with respect to the data to be possibly unknown. We establish a non-asymptotic risk bound for the selected estimator. When \FF\FF consists of linear estimators, we derive from this bound an oracle-type inequality. For illustration, we carry out two simulation studies. One aims at comparing our procedure to cross-validation for choosing a tuning parameter. The other shows how to implement our approach to solve the problem of variable selection in practice.

View on arXiv
Comments on this paper