Abstract:
|
Over the past decade, advances in data science and the availability of vast amounts of public data has increased the risk of identifying individuals from published statistics. For this reason, the U. S. Census Bureau is developing a formal differentially private disclosure avoidance system for protecting 2020 Census data. A consequence of this is that it may not be feasible to publish the more granular Census data, such as counts for some detailed race, ethnicity, and tribal groups at low levels of geography, that has been produced in the past. In this paper, we introduce a spatial change of support model for the differentially private measurements, which utilizes auxiliary publicly available data sources, such as American Community Survey data, and past Census data. We show that accurate, model-based estimates of the number in a detailed race group in a geography which may be misaligned from the source data, such as an American Indian or Alaska Native area, can be made.
|