23
6

Differentially-Private Sublinear-Time Clustering

Abstract

Clustering is an essential primitive in unsupervised machine learning. We bring forth the problem of sublinear-time differentially-private clustering as a natural and well-motivated direction of research. We combine the kk-means and kk-median sublinear-time results of Mishra et al. (SODA, 2001) and of Czumaj and Sohler (Rand. Struct. and Algorithms, 2007) with recent results on private clustering of Balcan et al. (ICML 2017), Gupta et al. (SODA, 2010) and Ghazi et al. (NeurIPS, 2020) to obtain sublinear-time private kk-means and kk-median algorithms via subsampling. We also investigate the privacy benefits of subsampling for group privacy.

View on arXiv
Comments on this paper