Abstract:
|
Since October 2003, researchers applying for NIH funding of $500,000 or more are expected to include a data sharing plan. Researchers releasing public or open data need ensure that the data have enough information to reproduce the main research findings while protecting private information from the study participants. Removal of basic identifiers is not enough to ensure study participants' confidentiality. As a result, more rigorous procedures are often needed to ensure privacy protection for all study participants. In this paper, we present different standard privacy preserving techniques (also known as disclosure techniques) for releasing data for public access that have been developed elsewhere. We discuss with detail the methods that are relevant to clinical trials, and illustrate each method with simulated clinical trial data. This paper evaluates each technique, and to what extent the technique limits utility and protects confidentiality. Finally, open data are recorded factual material accepted in the scientific community as necessary to document and validate research findings. We also discuss the components required for an NIH data release.
|