84
119

Quantile Regression Under Memory Constraint

Abstract

This paper studies the inference problem in quantile regression (QR) for a large sample size nn but under a limited memory constraint, where the memory can only store a small batch of data of size mm. A natural method is the na\"ive divide-and-conquer approach, which splits data into batches of size mm, computes the local QR estimator for each batch, and then aggregates the estimators via averaging. However, this method only works when n=o(m2)n=o(m^2) and is computationally expensive. This paper proposes a computationally efficient method, which only requires an initial QR estimator on a small batch of data and then successively refines the estimator via multiple rounds of aggregations. Theoretically, as long as nn grows polynomially in mm, we establish the asymptotic normality for the obtained estimator and show that our estimator with only a few rounds of aggregations achieves the same efficiency as the QR estimator computed on all the data. Moreover, our result allows the case that the dimensionality pp goes to infinity. The proposed method can also be applied to address the QR problem under distributed computing environment (e.g., in a large-scale sensor network) or for real-time streaming data.

View on arXiv
Comments on this paper

We use cookies and other tracking technologies to improve your browsing experience on our website, to show you personalized content and targeted ads, to analyze our website traffic, and to understand where our visitors are coming from. See our policy.