JSM 2019 Proceedings

Technical Support

Phone: (410) 638-9239

Fax: (410) 638-6108

GoToMeeting: Meet Now!

Web: www.CadmiumCD.com

Submit Support Ticket

‹‹ Go Back

Tom Krenzke

Westat

‹‹ Go Back

Jianzhu Li

Westat

‹‹ Go Back

Lin Li

Westat

‹‹ Go Back

Natalie Shlomo

University of Manchester

‹‹ Go Back

←Back

295 – SPEED: Big Data, Small Area Estimation, and Methodological Innovations Under Development, Part 1

Re-Examining File-Level Re-Identification Risk Assessment for Survey Microdata

Sponsor: Survey Research Methods Section

Keywords: re-identification risk, disclosure, log-linear models, goodness of fit, sensitivity analysis

Tom Krenzke

Westat

Jianzhu Li

Westat

Lin Li

Westat

Natalie Shlomo

University of Manchester

In this paper we discuss some practical issues encountered when estimating file-level disclosure risk measures of re-identification in survey microdata. We typically use the log-linear modeling approach (Skinner and Shlomo (2008)) to estimate disclosure risk in survey microdata files. Several challenges emerge that relate to satisfying goodness of fit criteria of the log-linear models in the presence of model assumption violations, and handling large numbers of variables. In the former, we explore several approaches to improve the fit of log-linear models particularly for the case of complex survey designs and differential survey weights. For the latter, we provide guidance for variable selection with insights on how to proceed with the risk assessment and provide meaningful results. We used the National Science Foundation‘s Survey of Doctorate Recipients data as a case study. The results of evaluating the disclosure risk under several approaches lead to guidance for a sensitivity analysis that helps to provide for a better estimate of file-level risk of re-identification in survey microdata.

View paper

"eventScribe", the eventScribe logo, "CadmiumCD", and the CadmiumCD logo are trademarks of CadmiumCD LLC, and may not be copied, imitated or used, in whole or in part, without prior written permission from CadmiumCD. The appearance of these proceedings, customized graphics that are unique to these proceedings, and customized scripts are the service mark, trademark and/or trade dress of CadmiumCD and may not be copied, imitated or used, in whole or in part, without prior written notification. All other trademarks, slogans, company names or logos are the property of their respective owners. Reference to any products, services, processes or other information, by trade name, trademark, manufacturer, owner, or otherwise does not constitute or imply endorsement, sponsorship, or recommendation thereof by CadmiumCD.

As a user you may provide CadmiumCD with feedback. Any ideas or suggestions you provide through any feedback mechanisms on these proceedings may be used by CadmiumCD, at our sole discretion, including future modifications to the eventScribe product. You hereby grant to CadmiumCD and our assigns a perpetual, worldwide, fully transferable, sublicensable, irrevocable, royalty free license to use, reproduce, modify, create derivative works from, distribute, and display the feedback in any manner and for any purpose.

SUBMIT FEEDBACK

Technical Support

Submit Support Ticket

Tom Krenzke

Jianzhu Li

Lin Li

Natalie Shlomo

Please enter your access key

Email This Presentation:

Re-Examining File-Level Re-Identification Risk Assessment for Survey Microdata