In this work, we study diversity-aware clustering problems where the data points are associated with multiple attributes resulting in intersecting groups. A clustering solution needs to ensure that the number of chosen cluster centers from each group should be within the range defined by a lower and upper bound threshold for each group, while simultaneously minimizing the clustering objective, which can be either -median, -means or -supplier. We study the computational complexity of the proposed problems, offering insights into their NP-hardness, polynomial-time inapproximability, and fixed-parameter intractability. We present parameterized approximation algorithms with approximation ratios , , and for diversity-aware -median, diversity-aware -means and diversity-aware -supplier, respectively. Assuming Gap-ETH, the approximation ratios are tight for the diversity-aware -median and diversity-aware -means problems. Our results imply the same approximation factors for their respective fair variants with disjoint groups -- fair -median, fair -means, and fair -supplier -- with lower bound requirements.
View on arXiv@article{thejaswi2025_2401.05502, title={ Diversity-aware clustering: Computational Complexity and Approximation Algorithms }, author={ Suhas Thejaswi and Ameet Gadekar and Bruno Ordozgoiti and Aristides Gionis }, journal={arXiv preprint arXiv:2401.05502}, year={ 2025 } }