ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2201.08435
19
3

Noisy linear inverse problems under convex constraints: Exact risk asymptotics in high dimensions

20 January 2022
Q. Han
ArXivPDFHTML
Abstract

In the standard Gaussian linear measurement model Y=Xμ0+ξ∈RmY=X\mu_0+\xi \in \mathbb{R}^mY=Xμ0​+ξ∈Rm with a fixed noise level σ>0\sigma>0σ>0, we consider the problem of estimating the unknown signal μ0\mu_0μ0​ under a convex constraint μ0∈K\mu_0 \in Kμ0​∈K, where KKK is a closed convex set in Rn\mathbb{R}^nRn. We show that the risk of the natural convex constrained least squares estimator (LSE) μ^(σ)\hat{\mu}(\sigma)μ^​(σ) can be characterized exactly in high dimensional limits, by that of the convex constrained LSE μ^Kseq\hat{\mu}_K^{\mathsf{seq}}μ^​Kseq​ in the corresponding Gaussian sequence model at a different noise level. The characterization holds (uniformly) for risks in the maximal regime that ranges from constant order all the way down to essentially the parametric rate, as long as certain necessary non-degeneracy condition is satisfied for μ^(σ)\hat{\mu}(\sigma)μ^​(σ). The precise risk characterization reveals a fundamental difference between noiseless (or low noise limit) and noisy linear inverse problems in terms of the sample complexity for signal recovery. A concrete example is given by the isotonic regression problem: While exact recovery of a general monotone signal requires m≫n1/3m\gg n^{1/3}m≫n1/3 samples in the noiseless setting, consistent signal recovery in the noisy setting requires as few as m≫log⁡nm\gg \log nm≫logn samples. Such a discrepancy occurs when the low and high noise risk behavior of μ^Kseq\hat{\mu}_K^{\mathsf{seq}}μ^​Kseq​ differ significantly. In statistical languages, this occurs when μ^Kseq\hat{\mu}_K^{\mathsf{seq}}μ^​Kseq​ estimates 000 at a faster `adaptation rate' than the slower `worst-case rate' for general signals. Several other examples, including non-negative least squares and generalized Lasso (in constrained forms), are also worked out to demonstrate the concrete applicability of the theory in problems of different types.

View on arXiv
Comments on this paper