JSM 2014 Home
Online Program Home
My Program

Abstract Details

Activity Number: 462
Type: Contributed
Date/Time: Wednesday, August 6, 2014 : 8:30 AM to 10:20 AM
Sponsor: Government Statistics Section
Abstract #313030
Title: Local Synthesis for Disclosure Limitation via Model-Based Clustering
Author(s): Anna Oganyan*+
Companies: NCHS
Keywords: synthetic data ; mixture model ; Expectation-Maximization (EM) ; hybrid SDL method ; latent class regression model
Abstract:

Medical data records often contain sensitive information about the data subjects. Before releasing such data, e.g. for clinical research purposes, data owners have to apply Statistical Disclosure Limitation (SDL) methods to such data. SDL methods often consist of masking or synthesizing the original data records in such a way to minimize the risk of disclosure of the confidential information and at the same time provide legitimate data users with accurate information about the population of interest. In this paper we propose a new scheme for disclosure limitation which is based on the idea of local synthesis of data. We argue that the procedures of dividing the records in homogeneous groups (the ``local" part) and synthesizing the records in the groups should be carefully chosen, so that clustering and synthesis would ``fit" each other in the best possible way. Our approach to this problem is based on model-based clustering. Our experiments with genuine medical data sets show that local synthesis is superior to other methods considered for comparison including synthetic data generated using the sequential regression approach, because it can preserve complex relatio


Authors who are presenting talks have a * after their name.

Back to the full JSM 2014 program




2014 JSM Online Program Home

For information, contact jsm@amstat.org or phone (888) 231-3473.

If you have questions about the Professional Development program, please contact the Education Department.

The views expressed here are those of the individual authors and not necessarily those of the JSM sponsors, their officers, or their staff.

ASA Meetings Department  •  732 North Washington Street, Alexandria, VA 22314  •  (703) 684-1221  •  meetings@amstat.org
Copyright © American Statistical Association.