28
13

Adaptive variable selection in nonparametric sparse additive models

Abstract

We consider the problem of recovery of an unknown multivariate signal ff observed in a dd-dimensional Gaussian white noise model of intensity ε\varepsilon. We assume that ff belongs to a class of smooth functions FdL2([0,1]d){\cal F}^d\subset L_2([0,1]^d) and has an additive sparse structure determined by the parameter ss, the number of non-zero univariate components contributing to ff. We are interested in the case when d=dεd=d_\varepsilon \to \infty as ε0\varepsilon \to 0 and the parameter ss stays "small" relative to dd. With these assumptions, the recovery problem in hand becomes that of determining which sparse additive components are non-zero. Attempting to reconstruct most non-zero components of ff, but not all of them, we arrive at the problem of almost full variable selection in high-dimensional regression. For two different choices of Fd{\cal F}^d, we establish conditions under which almost full variable selection is possible, and provide a procedure that gives almost full variable selection. The procedure does the best (in the asymptotically minimax sense) in selecting most non-zero components of ff. Moreover, it is adaptive in the parameter ss.

View on arXiv
Comments on this paper