Online Program Home
My Program

Abstract Details

Activity Number: 649 - Recent Advances in Spatial and Spatial-Temporal Methods
Type: Topic Contributed
Date/Time: Thursday, August 1, 2019 : 10:30 AM to 12:20 PM
Sponsor: Section on Bayesian Statistical Science
Abstract #304176 Presentation
Title: Using Spatiotemporal Models to Generate Synthetic Data for Public Use
Author(s): Harrison Quick* and Lance Waller
Companies: Drexel University and Emory University

When agencies release public-use data, they must be cognizant of the potential risk of disclosure associated with making their data publicly available. This issue is particularly pertinent in disease mapping, where small counts pose both inferential challenges and potential disclosure risks. While the small area estimation, disease mapping, and statistical disclosure limitation literatures are individually robust, there have been few intersections between them. Here, we formally propose the use of spatiotemporal data analysis methods to generate synthetic data for public use. Specifically, we analyze ten years of county-level heart disease death counts for multiple age-groups using a Bayesian model that accounts for dependence spatially, temporally, and between age-groups; generating synthetic data from the resulting posterior predictive distribution will preserve these dependencies. After demonstrating the synthetic data's privacy-preserving features, we illustrate their utility by comparing estimates of urban/rural disparities from the synthetic data to those from data with small counts suppressed.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2019 program