Activity Number:
|
591
- Synthetic Data and Data Disclosure
|
Type:
|
Contributed
|
Date/Time:
|
Wednesday, August 1, 2018 : 2:00 PM to 3:50 PM
|
Sponsor:
|
Government Statistics Section
|
Abstract #329812
|
Presentation
|
Title:
|
Preserving Privacy in Person-Level Data for the American Community Survey
|
Author(s):
|
Michael H. Freiman* and Rolando A. RodrÃguez and Jerome P. Reiter and Amy D. Lauger
|
Companies:
|
U.S. Census Bureau and U.S. Census Bureau and Duke University and U.S. Census Bureau
|
Keywords:
|
privacy;
confidentiality;
synthetic data;
survey data
|
Abstract:
|
The Census Bureau is researching model-based synthetic person-level data that maintain many of the properties of the original American Community Survey (ACS) data while protecting individual privacy. Protecting the ACS while maintaining data quality presents particular challenges because of the ACS's sample weighting, the survey's large number of variables and the small geographies for which ACS data are desired. This paper discusses the reasons adapting existing differential privacy methods is difficult and the describes the approaches we are investigating to protect the data, including tree-based methods for categorical or discrete variables and regression for continuous variables.
|
Authors who are presenting talks have a * after their name.