Estimation Efficiency Under Privacy Constraints

We investigate the problem of estimating a random variable under a privacy constraint dictated by another random variable , where estimation efficiency and privacy are assessed in terms of two different loss functions. In the discrete case, we use the Hamming loss function and express the corresponding utility-privacy tradeoff in terms of the privacy-constrained guessing probability , the maximum probability of correctly guessing given an auxiliary random variable , where the maximization is taken over all ensuring that for a given privacy threshold . We prove that is concave and piecewise linear, which allows us to derive its expression in closed form for any when and are binary. In the non-binary case, we derive in the high utility regime (i.e., for sufficiently large values of ) under the assumption that takes values in . We also analyze the privacy-constrained guessing probability for two binary vector scenarios. When and are continuous random variables, we use the squared-error loss function and express the corresponding utility-privacy tradeoff in terms of , which is the smallest normalized minimum mean squared-error (mmse) incurred in estimating from its Gaussian perturbation , such that the mmse of given is within of the variance of for any non-constant real-valued function . We derive tight upper and lower bounds for when is Gaussian. We also obtain a tight lower bound for for general absolutely continuous random variables when is sufficiently small.
View on arXiv