Non-Parametric Inference Adaptive to Intrinsic Dimension

We consider non-parametric estimation and inference of conditional moment models in high dimensions. We show that even when the dimension of the conditioning variable is larger than the sample size , estimation and inference is feasible as long as the distribution of the conditioning variable has small intrinsic dimension , as measured by locally low doubling measures. Our estimation is based on a sub-sampled ensemble of the -nearest neighbors (-NN) -estimator. We show that if the intrinsic dimension of the covariate distribution is equal to , then the finite sample estimation error of our estimator is of order and our estimate is -asymptotically normal, irrespective of . The sub-sampling size required for achieving these results depends on the unknown intrinsic dimension . We propose an adaptive data-driven approach for choosing this parameter and prove that it achieves the desired rates. We discuss extensions and applications to heterogeneous treatment effect estimation.
View on arXiv