ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2412.14396
84
0

Fingerprinting Codes Meet Geometry: Improved Lower Bounds for Private Query Release and Adaptive Data Analysis

18 December 2024
Xin Lyu
Kunal Talwar
ArXiv (abs)PDFHTML
Abstract

Fingerprinting codes are a crucial tool for proving lower bounds in differential privacy. They have been used to prove tight lower bounds for several fundamental questions, especially in the ``low accuracy'' regime. Unlike reconstruction/discrepancy approaches however, they are more suited for query sets that arise naturally from the fingerprinting codes construction. In this work, we propose a general framework for proving fingerprinting type lower bounds, that allows us to tailor the technique to the geometry of the query set. Our approach allows us to prove several new results, including the following. First, we show that any (sample- and population-)accurate algorithm for answering QQQ arbitrary adaptive counting queries over a universe X\mathcal{X}X to accuracy α\alphaα needs Ω(log⁡∣X∣⋅log⁡Qα3)\Omega(\frac{\sqrt{\log |\mathcal{X}|}\cdot \log Q}{\alpha^3})Ω(α3log∣X∣​⋅logQ​) samples, matching known upper bounds. This shows that the approaches based on differential privacy are optimal for this question, and improves significantly on the previously known lower bounds of log⁡Qα2\frac{\log Q}{\alpha^2}α2logQ​ and min⁡(Q,log⁡∣X∣)/α2\min(\sqrt{Q}, \sqrt{\log |\mathcal{X}|})/\alpha^2min(Q​,log∣X∣​)/α2. Second, we show that any (ε,δ)(\varepsilon,\delta)(ε,δ)-DP algorithm for answering QQQ counting queries to accuracy α\alphaα needs Ω(log⁡∣X∣log⁡(1/δ)log⁡Qεα2)\Omega(\frac{\sqrt{ \log|\mathcal{X}| \log(1/\delta)} \log Q}{\varepsilon\alpha^2})Ω(εα2log∣X∣log(1/δ)​logQ​) samples, matching known upper bounds up to constants. Our framework allows for proving this bound via a direct correlation analysis and improves the prior bound of [BUV'14] by log⁡(1/δ)\sqrt{\log(1/\delta)}log(1/δ)​. Third, we characterize the sample complexity of answering a set of random 000-111 queries under approximate differential privacy. We give new upper and lower bounds in different regimes. By combining them with known results, we can complete the whole picture.

View on arXiv
Comments on this paper