Online Program

Return to main conference page

All Times ET

Program is Subject to Change

Wednesday, June 16
Wed, Jun 16, 10:30 AM - 12:00 PM
Producing Synthetic Microdata with 'High' Utility and 'Low' Disclosure Risk

Validating Outcomes from Synthetic Data: Results from a Multi-Year Implementation (307996)

*Lars Vilhuber, Cornell University 

Keywords: synthetic data, validation server, remote access, data dissemination

The SDS at Cornell University was set up to provide early access to new synthetic data products by the U.S. Census Bureau. These datasets are made available to interested researchers in a controlled environment, prior to a more generalized release. Over the past 7 years, 4 synthetic datasets were made available on the server, and over 120 users have accessed the server over that time period. This paper reports on outcomes of the activity: results of validation requests from a user perspective, functioning of the feedback loop due to validation and user input, and the role of the SDS as a access gateway to and educational tool for other mechanisms of accessing detailed person, household, establishment, and firm statistics.