ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.06694
54
14
v1v2 (latest)

Structures of Spurious Local Minima in kkk-means

16 February 2020
Wei Qian
Yuqian Zhang
Yudong Chen
ArXiv (abs)PDFHTML
Abstract

kkk-means clustering is a fundamental problem in unsupervised learning. The problem concerns finding a partition of the data points into kkk clusters such that the within-cluster variation is minimized. Despite its importance and wide applicability, a theoretical understanding of the kkk-means problem has not been completely satisfactory. Existing algorithms with theoretical performance guarantees often rely on sophisticated (sometimes artificial) algorithmic techniques and restricted assumptions on the data. The main challenge lies in the non-convex nature of the problem; in particular, there exist additional local solutions other than the global optimum. Moreover, the simplest and most popular algorithm for kkk-means, namely Lloyd's algorithm, generally converges to such spurious local solutions both in theory and in practice. In this paper, we approach the kkk-means problem from a new perspective, by investigating the structures of these spurious local solutions under a probabilistic generative model with kkk ground truth clusters. As soon as k=3k=3k=3, spurious local minima provably exist, even for well-separated and balanced clusters. One such local minimum puts two centers at one true cluster, and the third center in the middle of the other two true clusters. For general kkk, one local minimum puts multiple centers at a true cluster, and one center in the middle of multiple true clusters. Perhaps surprisingly, we prove that this is essentially the only type of spurious local minima under a separation condition. Our results pertain to the kkk-means formulation for mixtures of Gaussians or bounded distributions. Our theoretical results corroborate existing empirical observations and provide justification for several improved algorithms for kkk-means clustering.

View on arXiv
Comments on this paper