Online Program Home
My Program

Abstract Details

Activity Number: 295 - SPEED: Big Data, Small Area Estimation, and Methodological Innovations Under Development, Part 1
Type: Contributed
Date/Time: Tuesday, July 30, 2019 : 8:30 AM to 10:20 AM
Sponsor: Survey Research Methods Section
Abstract #307363
Title: Identity Disclosure Control in Microdata Release by Post-Randomization
Author(s): Xiaoyu Zhai* and Tapan Nayak
Companies: and George Washington University
Keywords: identity disclosure control; PRAM; data utility; Public-use Micro-data

Motivated by RR methods in sampling, Gouweleeuw et al. (1998) introduced Post Randomization (PRAM) Method to protect categorical data from disclosure. PRAM can be applied independently to multiple categorical variables or jointly to the cross-classification variable. Perturbation probabilities for the cross-classification are needed for proper inferences and measuring confidentiality. We adopt the identity disclosure concept in previous literatures, which is considered the most serious confidentiality disclosure, and propose a new structure to control identity disclosure risk for a prefixed upper bound. We derived and explored properties of this structure and examined the trade-off between confidentiality disclosure and data utility. At last, we described the methodology in full details and applied it to a Public-use Micro-data Sample. In the empirical study, we showed that we achieved the disclosure control goals and maintained a sound data utility.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2019 program