A Tight VC-Dimension Analysis of Clustering Coresets with Applications

We consider coresets for -clustering problems, where the goal is to assign points to centers minimizing powers of distances. A popular example is the -median objective . Given a point set , a coreset is a small weighted subset that approximates the cost of for all candidate solutions up to a multiplicative factor. In this paper, we give a sharp VC-dimension based analysis for coreset construction. As a consequence, we obtain improved -median coreset bounds for the following metrics:Coresets of size for shortest path metrics in planar graphs, improving over the bounds by [Cohen-Addad, Saulpic, Schwiegelshohn, STOC'21] and by [Braverman, Jiang, Krauthgamer, Wu, SODA'21].Coresets of size for clustering -dimensional polygonal curves of length at most with curves of length at most with respect to Frechet metrics, improving over the bounds by [Braverman, Cohen-Addad, Jiang, Krauthgamer, Schwiegelshohn, Toftrup, and Wu, FOCS'22] and by [Conradi, Kolbe, Psarros, Rohde, SoCG'24].
View on arXiv