Online Program

Return to main conference page
Saturday, February 16
Sat, Feb 16, 8:00 AM - 9:15 AM
St. James Ballroom
Poster Session 3 and Continental Breakfast

The Data Detective's Toolkit (303863)

View Presentation View Presentation

*Kim Chantala, RTI International 

Keywords: Data cleaning, data validation, skip patterns, codebooks, data crosswalks

The Data Detective’s Toolkit is a set of SAS macros to help keep data preparation on schedule and within budget. This toolkit provides an efficient and low-cost way to create codebooks, master lists of SAS data sets for a project, reports of variables needing special investigation or cleaning, validation of skip patterns, and data crosswalks showing the relationship of variables across datasets. Traditionally, these documents are produced at the end of the project with a great deal of programming and manual effort, but these tools allow the programmer to seamlessly create these documents and reports at any time during the data preparation task. Producing these documents early in data collection improves data quality as well as communication between the data collection team and client. The only requirement for producing these documents and reports is having SAS data sets with formats and labels. The Data Detective’s Toolkit is provided as open-source SAS code and can be downloaded from Github at no charge.