55
7

On the Consistency of kk-means++ algorithm

Abstract

We prove in this paper that the expected value of the objective function of the kk-means++ algorithm for samples converges to population expected value. As kk-means++, for samples, provides with constant factor approximation for kk-means objectives, such an approximation can be achieved for the population with increase of the sample size. This result is of potential practical relevance when one is considering using subsampling when clustering large data sets (large data bases).

View on arXiv
Comments on this paper

We use cookies and other tracking technologies to improve your browsing experience on our website, to show you personalized content and targeted ads, to analyze our website traffic, and to understand where our visitors are coming from. See our policy.