JSM 2013

Technical Support

Phone: (410) 638-9239

Fax: (410) 638-6108

GoToMeeting: Meet Now!

Web: www.CadmiumCD.com

←Back

82 – Monte Carlo Methods: Models and Tests

Alternative Variance Estimators for Data Perturbed for Confidentiality Protection

Sponsor: Survey Research Methods Section

Keywords: Data Perturbation, Disclosure Limitation, Small Area Model, Bootstrap, Multiple Perturbation

Jianzhu Li

Westat

Michael D. Larsen

The George Washington University

Tom Krenzke

Westat

Laura Zayatz

U.S. Census Bureau

One method of protecting confidentiality of tabular data is to apply random perturbation on select variables in the underlying microdata. Perturbation variability needs to be appropriately accounted for in variance estimation for estimates derived from a data file altered through random perturbation. In previous work, we had studied methods for estimating variances using a single perturbed data set, and developed a variance estimator that incorporates a variance component associated with data perturbation. In this paper, we further explore three alternative approaches that can be considered in comparison to the initial estimator, with a goal of increasing the stability of the variance estimation, especially when estimates are extreme. The first alternative modifies the initial estimator through use of multiple perturbed data sets. The second alternative is a limited bootstrap approach that can be done by conducting the perturbation of the bootstrap samples multiple times, producing the replicate estimates, and subsequently computing the variance among the replicate estimates. The third alternative adjusts the initial estimator through the idea of small area estimation. Computational aspects of estimators are discussed. A simulation study was conducted to evaluate and compare the performance of the initial and alternative variance estimators using select variables in two test sites from the American Community Survey 2005-2009 sample data. The results are summarized in terms of the coverage rates and margin of errors of the estimators.

View paper

"eventScribe", the eventScribe logo, "CadmiumCD", and the CadmiumCD logo are trademarks of CadmiumCD LLC, and may not be copied, imitated or used, in whole or in part, without prior written permission from CadmiumCD. The appearance of these proceedings, customized graphics that are unique to these proceedings, and customized scripts are the service mark, trademark and/or trade dress of CadmiumCD and may not be copied, imitated or used, in whole or in part, without prior written notification. All other trademarks, slogans, company names or logos are the property of their respective owners. Reference to any products, services, processes or other information, by trade name, trademark, manufacturer, owner, or otherwise does not constitute or imply endorsement, sponsorship, or recommendation thereof by CadmiumCD.

As a user you may provide CadmiumCD with feedback. Any ideas or suggestions you provide through any feedback mechanisms on these proceedings may be used by CadmiumCD, at our sole discretion, including future modifications to the eventScribe product. You hereby grant to CadmiumCD and our assigns a perpetual, worldwide, fully transferable, sublicensable, irrevocable, royalty free license to use, reproduce, modify, create derivative works from, distribute, and display the feedback in any manner and for any purpose.