ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1411.4419
23
85

Automatic Subspace Learning via Principal Coefficients Embedding

17 November 2014
Xi Peng
Jiwen Lu
Zhang Yi
Rui Yan
ArXivPDFHTML
Abstract

In this paper, we address two challenging problems in unsupervised subspace learning: 1) how to automatically identify the feature dimension of the learned subspace (i.e., automatic subspace learning), and 2) how to learn the underlying subspace in the presence of Gaussian noise (i.e., robust subspace learning). We show that these two problems can be simultaneously solved by proposing a new method (called principal coefficients embedding, PCE). For a given data set D∈\mathdsRm×n\mathbf{D}\in \mathds{R}^{m\times n}D∈\mathdsRm×n, PCE recovers a clean data set D0∈\mathdsRm×n\mathbf{D}_{0}\in \mathds{R}^{m\times n}D0​∈\mathdsRm×n from D\mathbf{D}D and simultaneously learns a global reconstruction relation C∈Rn×n\mathbf{C}\in \mathbf{R}^{n\times n}C∈Rn×n of D0\mathbf{D}_{0}D0​. By preserving C\mathbf{C}C into an m′m^{\prime}m′-dimensional space, the proposed method obtains a projection matrix that can capture the latent manifold structure of D0\mathbf{D}_{0}D0​, where m′≪mm^{\prime}\ll mm′≪m is automatically determined by the rank of C\mathbf{C}C with theoretical guarantees. PCE has three advantages: 1) it can automatically determine the feature dimension even though data are sampled from a union of multiple linear subspaces in presence of the Gaussian noise, 2) Although the objective function of PCE only considers the Gaussian noise, experimental results show that it is robust to the non-Gaussian noise (\textit{e.g.}, random pixel corruption) and real disguises, 3) Our method has a closed-form solution and can be calculated very fast. Extensive experimental results show the superiority of PCE on a range of databases with respect to the classification accuracy, robustness and efficiency.

View on arXiv
Comments on this paper