27
16

Discrete Distribution Estimation under User-level Local Differential Privacy

Abstract

We study discrete distribution estimation under user-level local differential privacy (LDP). In user-level ε\varepsilon-LDP, each user has m1m\ge1 samples and the privacy of all mm samples must be preserved simultaneously. We resolve the following dilemma: While on the one hand having more samples per user should provide more information about the underlying distribution, on the other hand, guaranteeing the privacy of all mm samples should make the estimation task more difficult. We obtain tight bounds for this problem under almost all parameter regimes. Perhaps surprisingly, we show that in suitable parameter regimes, having mm samples per user is equivalent to having mm times more users, each with only one sample. Our results demonstrate interesting phase transitions for mm and the privacy parameter ε\varepsilon in the estimation risk. Finally, connecting with recent results on shuffled DP, we show that combined with random shuffling, our algorithm leads to optimal error guarantees (up to logarithmic factors) under the central model of user-level DP in certain parameter regimes. We provide several simulations to verify our theoretical findings.

View on arXiv
Comments on this paper