Abstract:
|
Statistical disclosure limitation techniques, such as data swapping, have been used as disclosure protection strategies. In this study we create homogenous clusters based on Gower's distance matrix and complete linkage method and then swap data within the clusters. We compare our method with the random data swapping method and the rank data swapping method in terms of data utility and protection. We conduct simulation studies and apply our method to the 2013 National Health Interview Survey (NHIS) public-use data set, available from the National Center for Health Statistics.
|