On the Query Complexity of Training Data Reconstruction in Private Learning

We analyze the number of queries that a whitebox adversary needs to make to a private learner in order to reconstruct its training data. For DP learners with training data drawn from any arbitrary compact metric space, we provide the \emph{first known lower bounds on the adversary's query complexity} as a function of the learner's privacy parameters. \emph{Our results are minimax optimal for every , covering both -DP and DP as corollaries}. Beyond this, we obtain query complexity lower bounds for R\ényi DP learners that are valid for any . Finally, we analyze data reconstruction attacks on locally compact metric spaces via the framework of Metric DP, a generalization of DP that accounts for the underlying metric structure of the data. In this setting, we provide the first known analysis of data reconstruction in unbounded, high dimensional spaces and obtain query complexity lower bounds that are nearly tight modulo logarithmic factors.
View on arXiv