Online Program Home
My Program

Abstract Details

Activity Number: 632 - Advances in Statistical Disclosure Control Methodology
Type: Invited
Date/Time: Thursday, August 1, 2019 : 10:30 AM to 12:20 PM
Sponsor: SSC
Abstract #300426
Title: Balancing Inferential Integrity and Disclosure Risk via Model Targeted Masking and Multiple Imputation
Author(s): Bei Jiang* and Adrian Raftery and Russell Steele and Naisyin Wang
Companies: University of Alberta and University of Washington and Mcgill University and U of Michigan
Keywords: statistical disclosure control; data augmentation; latent variable; match risk; mixture modeling
Abstract:

In the context of survey sampling, Rubin (1993) proposed to release multiply imputed synthetic datasets with the target sensitive values replaced by values drawn from the posterior predictive distributions under proper imputation models. However, information loss due to incorrect specification of imputation models can weaken or even invalidate the inference obtained from the synthetic datasets. In this talk, we discuss a new masking framework through data augmentation that has promising potential to remedy this issue. Moreover, the new framework can always guarantee valid inferences obtained using synthetic datasets, and it allows data users to obtain their desired level of data utility while satisfying the disclosure requirement set by agencies. This new framework can be extended and combined with other existing methods to accommodate different levels of disclosure protection to further optimize the utility-risk profile. We demonstrate through simulations and an illustrative example that our proposed framework outperforms the classical MI approach in preserving better data utility while providing similar or even better protection against disclosure.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2019 program