Online Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 239 - Synthetic Data and Differential Privacy: Data, Privacy and the Public Good
Type: Invited
Date/Time: Tuesday, August 4, 2020 : 1:00 PM to 2:50 PM
Sponsor: Survey Research Methods Section
Abstract #309532
Title: On the Formally Private Use of Public Historical Data to Control Upward Bias in Small Counts
Author(s): Philip Leclerc*
Companies: US Census Bureau
Keywords:
Abstract:

The 2020 Decennial Census will be the first nation-wide Census in the world to use formally private disclosure limitation methods to protect the privacy of respondents. The principal algorithm developed for this purpose is called the “Topdown Algorithm” (TDA). This algorithm proceeds recursively from the national level down to the Census block level. TDA generates fully saturated contingency tables (called “histograms” in the differential privacy literature) that are informationally equivalent to microdata. The corresponding microdata are used to support the production of tens of billions of tabulations about the demographic characteristics of the U.S. population. Many technical challenges were encountered in the development of TDA. In this talk, I discuss one important challenge: the lack of good theory for controlling upward bias. Upward bias in small counts results when using mathematical optimization to estimate nonnegative integer-valued histograms at massive scale. To combat this problem, we have developed techniques that use public historical data to estimate “small cell structure” without weakening differential privacy’s worst-case privacy guarantees. I will discuss bo


Authors who are presenting talks have a * after their name.

Back to the full JSM 2020 program