ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1911.01040
104
33
v1v2v3 (latest)

Online Debiasing for Adaptively Collected High-dimensional Data

4 November 2019
Y. Deshpande
Adel Javanmard
M. Mehrabi
    AI4TS
ArXiv (abs)PDFHTML
Abstract

Adaptive collection of data is commonplace in applications throughout science and engineering. From the point of view of statistical inference however, adaptive data collection induces memory and correlation in the sample, and poses significant challenge. We consider the high-dimensional linear regression, where the sample is collected adaptively, and the sample size nnn can be smaller than ppp, the number of covariates. In this setting, there are two distinct sources of bias: the first due to regularization imposed for consistent estimation, e.g. using the LASSO, and the second due to adaptivity in collecting the sample. We propose \emph{`online debiasing'}, a general procedure for estimators such as the LASSO, which addresses both sources of bias. In two concrete contexts (i)(i)(i) batched data collection and (ii)(ii)(ii) time series analysis, we demonstrate that online debiasing optimally debiases the LASSO estimate when the underlying parameter θ0\theta_0θ0​ has sparsity of order o(n/log⁡p)o(\sqrt{n}/\log p)o(n​/logp). In this regime, the debiased estimator can be used to compute ppp-values and confidence intervals of optimal size.

View on arXiv
Comments on this paper