Name: 2022 Joint Statistical Meetings
Start: 2022-08-06T07:00:00+00:00
End: 2022-08-11
Location: Walter E. Washington Convention Center

Conference Program Home
My Program

All Times EDT

Abstract Details

Activity Number:	559 - Spectral Analysis, Process Monitoring, and Sampling
Type:	Contributed
Date/Time:	Thursday, August 11, 2022 : 10:30 AM to 12:20 PM
Sponsor:	Section on Physical and Engineering Sciences
Abstract #322634
Title:	Population Obfuscation: A Masking Problem and Some Solutions
Author(s):	Michael Frey* and Adam Wunderlich and Kyle Caudle and Randy Hoover and Lucas Koepke and David Newton
Companies:	National Institute of Standards and Technology and National Institute of Standard and Technology and South Dakota School of Mines and Technology and South Dakota School of Mines and Technology and National Institute of Standards and Technology and National Institute of Standards and Technology
Keywords:	generative adversarial network; differential privacy; obfuscation; optimal transport map; masking; redaction
Abstract:	Sample obfuscation is the widely studied, challenging problem of providing access to a data sample while guarding aspects of its privacy. Sample obfuscation can take different forms, including masking or redaction to protect sample variables or anonymization or the methodology of differential privacy to secure individuals' data records. This work extends the notion of sample obfuscation to obfuscation of populations. Population obfuscation aims to protect information and features of a whole statistical population of data, the population being represented by an algorithm, formula, model, or sampling plan from which users can synthesize or otherwise access unlimited numbers of data records. Canonical sample masking can be extended to allow masking generally of functions of sample variables. With this extension we present a conceptual framework for population masking, with elementary examples of both canonical and general population masking. Three procedures are outlined for masking a population, one based on transfer learning, one on data augmentation, and one on optimal transport. We also introduce the idea of inherent population masking and offer a simple class of time series examples in which it occurs.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2022 program

JSM 2022 Conference Program

Abstract Details

American Statistical Association