This article describes the disclosure avoidance algorithm that the U.S. Census Bureau used to protect the Detailed Demographic and Housing Characteristics File A (Detailed DHC-A) of the 2020 Census. The tabulations contain statistics (counts) of demographic characteristics of the entire population of the United States, crossed with detailed races and ethnicities at varying levels of geography. The article describes the SafeTab-P algorithm, which is based on adding noise drawn to statistics of interest from a discrete Gaussian distribution. A key innovation in SafeTab-P is the ability to adaptively choose how many statistics and at what granularity to release them, depending on the size of a population group. We prove that the algorithm satisfies a well-studied variant of differential privacy, called zero-concentrated differential privacy (zCDP). We then describe how the algorithm was implemented on Tumult Analytics and briefly outline the parameterization and tuning of the algorithm.
View on arXiv@article{haney2025_2505.01472, title={ SafeTab-P: Disclosure Avoidance for the 2020 Census Detailed Demographic and Housing Characteristics File A (Detailed DHC-A) }, author={ Sam Haney and Skye Berghel and Bayard Carlson and Ryan Cumings-Menon and Luke Hartman and Michael Hay and Ashwin Machanavajjhala and Gerome Miklau and Amritha Pai and Simran Rajpal and David Pujol and William Sexton and Ruchit Shrestha and Daniel Simmons-Marengo }, journal={arXiv preprint arXiv:2505.01472}, year={ 2025 } }