7
0

Bi-Level Unsupervised Feature Selection

Abstract

Unsupervised feature selection (UFS) is an important task in data engineering. However, most UFS methods construct models from a single perspective and often fail to simultaneously evaluate feature importance and preserve their inherent data structure, thus limiting their performance. To address this challenge, we propose a novel bi-level unsupervised feature selection (BLUFS) method, including a clustering level and a feature level. Specifically, at the clustering level, spectral clustering is used to generate pseudo-labels for representing the data structure, while a continuous linear regression model is developed to learn the projection matrix. At the feature level, the 2,0\ell_{2,0}-norm constraint is imposed on the projection matrix for more effectively selecting features. To the best of our knowledge, this is the first work to combine a bi-level framework with the 2,0\ell_{2,0}-norm. To solve the proposed bi-level model, we design an efficient proximal alternating minimization (PAM) algorithm, whose subproblems either have explicit solutions or can be computed by fast solvers. Furthermore, we establish the convergence result and computational complexity. Finally, extensive experiments on two synthetic datasets and eight real datasets demonstrate the superiority of BLUFS in clustering and classification tasks.

View on arXiv
@article{liu2025_2505.20563,
  title={ Bi-Level Unsupervised Feature Selection },
  author={ Jingjing Liu and Xiansen Ju and Xianchao Xiu and Wanquan Liu },
  journal={arXiv preprint arXiv:2505.20563},
  year={ 2025 }
}
Comments on this paper