Estimator selection in the Gaussian setting
We consider the problem of estimating the mean of a Gaussian vector the components of which are independent with a common variance that we assume to be unknown. Our estimation procedure is based on estimator selection. More precisely, we start with a collection of estimators of based on and, with the same data , we aim at selecting an estimator among with the smallest Euclidean risk. We allow the cardinality of to be very large (possibly infinite) and also the dependency of the estimators with respect to the data to be possibly unknown. We establish a non-asymptotic risk bound for the selected estimator. When consists of linear estimators, we derive from this bound an oracle-type inequality. For illustration, we carry out two simulation studies. One aims at comparing our procedure to cross-validation for choosing a tuning parameter. The other shows how to implement our approach to solve the problem of variable selection in practice.
View on arXiv