64
144

Proportionally Fair Clustering

Abstract

We extend the fair machine learning literature by considering the problem of proportional centroid clustering in a metric context. For clustering nn points with kk centers, we define fairness as proportionality to mean that any n/kn/k points are entitled to form their own cluster if there is another center that is closer in distance for all n/kn/k points. We seek clustering solutions to which there are no such justified complaints from any subsets of agents, without assuming any a priori notion of protected subsets. We present and analyze algorithms to efficiently compute, optimize, and audit proportional solutions. We conclude with an empirical examination of the tradeoff between proportional solutions and the kk-means objective.

View on arXiv
Comments on this paper