Conference Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 152 - Statistical Methods for Data Privacy and Statistical Modeling of Social and Economic Factors
Type: Contributed
Date/Time: Monday, August 8, 2022 : 10:30 AM to 12:20 PM
Sponsor: Social Statistics Section
Abstract #322916
Title: A Formal Privacy Framework for Partially Private Data
Author(s): Jeremy Seeman* and Aleksandra B. Slavkovic and Matthew Reimherr
Companies: Penn State University and Penn State University and Penn State University
Keywords: Differential Privacy; Measurement Error; Synthetic Data; Optimal Transport
Abstract:

Despite its many useful theoretical properties, differential privacy (DP) has one substantial blind spot: any release that non-trivially depends on confidential data without additional privacy-preserving randomization fails to satisfy DP. Such a restriction is rarely met in practice, as most data releases under DP are actually "partially privateā€ data (PPD). This poses a significant barrier to accounting for privacy risk and utility under logistical constraints imposed on data curators, especially those working with official statistics. In this paper, we propose a privacy definition which accommodates PPD and prove it maintains similar properties to standard DP. We derive optimal transport-based mechanisms for releasing PPD that satisfy our definition and algorithms for valid statistical inference using PPD, demonstrating their improved performance over post-processing methods. Finally, we apply these methods to a case study on US Census and CDC PPD to investigate private COVID-19 infection rates. In doing so, we show how data curators can use our framework to overcome barriers to operationalizing formal privacy while providing more transparency and accountability to users.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2022 program