Assessing the privacy of randomized vector valued queries to a database using the area under the receiver-operator characteristic curve
*Ofer Harel, Department of Statistics, University of Connecticut 
Gregory J. Matthews, University of Massachusetts 

Keywords: Privacy, ROC curve, Statistical Disclosure Limitation

As the amount of data generated continues to increase, consideration of individuals' privacy is a growing concern. As a result, there has been a vast quantity of research done on methods of statistical disclosure control (SDC). Some of these methods propose to release a randomized version of the data rather than the actual data. While methods of this type certainly offer some layer of protection since no actual data is released, there is still the potential for private information to be disclosed. Quantifying the level of privacy provided my these methods is often difficult. In the past, a method for assessing privacy using the receiver-operator characteristic (ROC) curve based on ideas related to differential privacy has been proposed. However, the method was only demonstrated for univariate randomized releases. Here, the ROC based privacy measure is extended to the release of randomized vectors.