ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1603.08976
50
129

Local Search Yields a PTAS for k-Means in Doubling Metrics

29 March 2016
Zachary Friggstad
M. Rezapour
M. Salavatipour
ArXivPDFHTML
Abstract

The most well known and ubiquitous clustering problem encountered in nearly every branch of science is undoubtedly kkk-means: given a set of data points and a parameter kkk, select kkk centres and partition the data points into kkk clusters around these centres so that the sum of squares of distances of the points to their cluster centre is minimized. Typically these data points lie Rd\mathbb{R}^dRd for some d≥2d\geq 2d≥2. kkk-means and the first algorithms for it were introduced in the 1950's. Since then, hundreds of papers have studied this problem and many algorithms have been proposed for it. The most commonly used algorithm is known as Lloyd-Forgy, which is also referred to as "the" kkk-means algorithm, and various extensions of it often work very well in practice. However, they may produce solutions whose cost is arbitrarily large compared to the optimum solution. Kanungo et al. [2004] analyzed a simple local search heuristic to get a polynomial-time algorithm with approximation ratio 9+ϵ9+\epsilon9+ϵ for any fixed ϵ>0\epsilon>0ϵ>0 for kkk-means in Euclidean space. Finding an algorithm with a better approximation guarantee has remained one of the biggest open questions in this area, in particular whether one can get a true PTAS for fixed dimension Euclidean space. We settle this problem by showing that a simple local search algorithm provides a PTAS for kkk-means in Rd\mathbb{R}^dRd for any fixed ddd. More precisely, for any error parameter ϵ>0\epsilon>0ϵ>0, the local search algorithm that considers swaps of up to ρ=dO(d)⋅ϵ−O(d/ϵ)\rho=d^{O(d)}\cdot{\epsilon}^{-O(d/\epsilon)}ρ=dO(d)⋅ϵ−O(d/ϵ) centres at a time finds a solution using exactly kkk centres whose cost is at most a (1+ϵ)(1+\epsilon)(1+ϵ)-factor greater than the optimum. Finally, we provide the first demonstration that local search yields a PTAS for the uncapacitated facility location problem and kkk-median with non-uniform opening costs in doubling metrics.

View on arXiv
Comments on this paper