Online Program Home
My Program

Abstract Details

Activity Number: 289 - Assessing the Quality of Integrated Data
Type: Topic Contributed
Date/Time: Tuesday, July 30, 2019 : 8:30 AM to 10:20 AM
Sponsor: Government Statistics Section
Abstract #304734 Presentation
Title: Tools for Evaluating Quality of State and Local Administrative Data
Author(s): Zachary H Seeskin* and Gabriel Ugarte and Rupa Datta
Companies: NORC at the University of Chicago and NORC at the University of Chicago and NORC at the University of Chicago
Keywords: Data quality; Administrative data; R; Data visualization; Exploratory data analysis

State and local administrative data sources, including data used for managing benefit programs, are increasingly recognized as powerful resources for evidence-building, either as standalone data sources or through linkage to other sources. Evaluating administrative data quality is critical for agencies to make proper use of their data and to improve the data for future use. However, state and local agencies often lack the resources and training for staff to conduct rigorous evaluations of data quality. We present an R-based toolkit to assist researchers working with these administrative datasets to assess data quality, providing guidance and code for checks on data accuracy, the completeness of the records, and the comparability of the data over time and among subgroups of interest. The data quality assessment methods employed draw from the literature, incorporating descriptive statistics, data visualization, and exploratory data analysis to identify sets of records or variables for which quality may be a concern. Further, we discuss principles for undertaking customized data quality analyses for a specific data source that go beyond these general tools.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2019 program