ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 0811.3628
89
869

High-dimensional covariance estimation by minimizing ℓ1\ell_1ℓ1​-penalized log-determinant divergence

21 November 2008
Pradeep Ravikumar
Martin J. Wainwright
Garvesh Raskutti
Bin Yu
ArXivPDFHTML
Abstract

Given i.i.d. observations of a random vector X∈RpX \in \mathbb{R}^pX∈Rp, we study the problem of estimating both its covariance matrix Σ∗\Sigma^*Σ∗, and its inverse covariance or concentration matrix {Θ∗=(Σ∗)−1\Theta^* = (\Sigma^*)^{-1}Θ∗=(Σ∗)−1.} We estimate Θ∗\Theta^*Θ∗ by minimizing an ℓ1\ell_1ℓ1​-penalized log-determinant Bregman divergence; in the multivariate Gaussian case, this approach corresponds to ℓ1\ell_1ℓ1​-penalized maximum likelihood, and the structure of Θ∗\Theta^*Θ∗ is specified by the graph of an associated Gaussian Markov random field. We analyze the performance of this estimator under high-dimensional scaling, in which the number of nodes in the graph ppp, the number of edges sss and the maximum node degree ddd, are allowed to grow as a function of the sample size nnn. In addition to the parameters (p,s,d)(p,s,d)(p,s,d), our analysis identifies other key quantities covariance matrix Σ∗\Sigma^*Σ∗; and (b) the ℓ∞\ell_\inftyℓ∞​ operator norm of the sub-matrix ΓSS∗\Gamma^*_{S S}ΓSS∗​, where SSS indexes the graph edges, and Γ∗=(Θ∗)−1⊗(Θ∗)−1\Gamma^* = (\Theta^*)^{-1} \otimes (\Theta^*)^{-1}Γ∗=(Θ∗)−1⊗(Θ∗)−1; and (c) a mutual incoherence or irrepresentability measure on the matrix Γ∗\Gamma^*Γ∗ and (d) the rate of decay 1/f(n,δ)1/f(n,\delta)1/f(n,δ) on the probabilities {∣Σ^ijn−Σij∗∣>δ} \{|\hat{\Sigma}^n_{ij}- \Sigma^*_{ij}| > \delta \}{∣Σ^ijn​−Σij∗​∣>δ}, where Σ^n\hat{\Sigma}^nΣ^n is the sample covariance based on nnn samples. Our first result establishes consistency of our estimate Θ^\hat{\Theta}Θ^ in the elementwise maximum-norm. This in turn allows us to derive convergence rates in Frobenius and spectral norms, with improvements upon existing results for graphs with maximum node degrees d=o(s)d = o(\sqrt{s})d=o(s​). In our second result, we show that with probability converging to one, the estimate Θ^\hat{\Theta}Θ^ correctly specifies the zero pattern of the concentration matrix Θ∗\Theta^*Θ∗.

View on arXiv
Comments on this paper