JSM Preliminary Online Program
This is the preliminary program for the 2009 Joint Statistical Meetings in Washington, DC.

The views expressed here are those of the individual authors
and not necessarily those of the ASA or its board, officers, or staff.


Back to main JSM 2009 Program page




Activity Number: 71
Type: Contributed
Date/Time: Sunday, August 2, 2009 : 4:00 PM to 5:50 PM
Sponsor: Section on Survey Research Methods
Abstract - #304074
Title: General Discrete-Data Modeling Methods for Producing Synthetic Data with Reduced Reidentification Risk That Preserve Analytic Properties
Author(s): William E. Winkler*+
Companies: U.S. Census Bureau
Address: 4600 Silver Hill Road, Suitland, MD, 20746,
Keywords: privacy ; confidentiality ; synthetic data ; analytic validity
Abstract:

This paper describes a modeling framework to produce synthetic microdata that better corresponds to external benchmark constraints on certain aggregates (such as margins) and on which certain cell probabilities are bounded both below and above to reduce re-identification risk. Rather than use linear constraints (Meng and Rubin 1993), the modeling methods use convex constraints (Winkler 1990, 1993) in an extended MCECM procedure. The methods preserve analytic properties and are presented as a computationally tractable alternative to epsilon-privacy (Dwork 2008). Epsilon-privacy and its extensions have not yet been shown to preserve analytic properties (e.g., Dwork, McSherry, and Talwar 2007, section 5). A lone exception is Machanavajjhala et al. (2008) that preserves analytic properties in a narrowly focused "on-the-map" application.


  • The address information is for the authors that have a + after their name.
  • Authors who are presenting talks have a * after their name.

Back to the full JSM 2009 program


JSM 2009 For information, contact jsm@amstat.org or phone (888) 231-3473. If you have questions about the Continuing Education program, please contact the Education Department.
Revised September, 2008