We consider median regression and, more generally, a possibly infinite collection of quantile regressions in high-dimensional sparse models. In these models the overall number of regressors is very large, possibly larger than the sample size , but only of these regressors have non-zero impact on the conditional quantile of the response variable, where grows slower than . We consider quantile regression penalized by the -norm of coefficients (-QR). First, we show that -QR is consistent at the rate . The overall number of regressors affects the rate only through the factor, thus allowing nearly exponential growth in the number of zero-impact regressors. The rate result holds under relatively weak conditions, requiring that converges to zero at a super-logarithmic speed and that regularization parameter satisfies certain theoretical constraints. Second, we propose a pivotal, data-driven choice of the regularization parameter and show that it satisfies these theoretical constraints. Third, we show that -QR correctly selects the true minimal model as a valid submodel, when the non-zero coefficients of the true model are well separated from zero. We also show that the number of non-zero coefficients in -QR is of same stochastic order as . Fourth, we analyze the rate of convergence of a two-step estimator that applies ordinary quantile regression to the selected model. Fifth, we evaluate the performance of -QR in a Monte-Carlo experiment, and illustrate its use on an international economic growth application.
View on arXiv