41
5
v1v2v3 (latest)

On the Relation Between Identifiability, Differential Privacy and Mutual-Information Privacy

Abstract

This paper investigates the relation between three different notions of privacy: identifiability, differential privacy and mutual-information privacy. Under a unified privacy-distortion framework, where the distortion is defined to be the Hamming distance of the input and output databases, we establish some fundamental connections between these three privacy notions. Given a distortion level DD, define ϵi(D)\epsilon_{\mathrm{i}}^*(D) to be the smallest (best) identifiability level, and ϵd(D)\epsilon_{\mathrm{d}}^*(D) to be the smallest differential privacy level. We characterize ϵi(D)\epsilon_{\mathrm{i}}^*(D) and ϵd(D)\epsilon_{\mathrm{d}}^*(D), and prove that ϵi(D)ϵXϵd(D)ϵi(D)\epsilon_{\mathrm{i}}^*(D)-\epsilon_X\le\epsilon_{\mathrm{d}}^*(D)\le\epsilon_{\mathrm{i}}^*(D) for DD in some range, where ϵX\epsilon_X is a constant depending on the distribution of the original database XX, and diminishes to zero when the distribution of XX is uniform. Furthermore, we show that identifiability and mutual-information privacy are consistent in the sense that given distortion level DD, the mechanism that optimizes the mutual-information privacy also minimizes the identifiability level.

View on arXiv
Comments on this paper