Optimal Schemes for Discrete Distribution Estimation under Locally Differential Privacy

We consider the minimax estimation problem of a discrete distribution with support size under privacy constraints. A privatization scheme is applied to each raw sample independently, and we need to estimate the distribution of the raw samples from the privatized samples. A positive number measures the privacy level of a privatization scheme. For a given we consider the problem of constructing optimal privatization schemes with -privacy level, i.e., schemes that minimize the expected estimation loss for the worst-case distribution. Two schemes in the literature provide order optimal performance in the high privacy regime where is very close to and in the low privacy regime where respectively. In this paper, we propose a new family of schemes which substantially improve the performance of the existing schemes in the medium privacy regime when More concretely, we prove that when our schemes reduce the expected estimation loss by under metric and by under metric over the existing schemes. We also prove a lower bound for the region which implies that our schemes are order optimal in this regime.
View on arXiv