Online Program Home
My Program

Abstract Details

Activity Number: 169 - SPEED:Improving Survey Data Quality with Multiple Data Sources, Administrative Data, and Nonresponse Bias Control
Type: Contributed
Date/Time: Monday, July 29, 2019 : 10:30 AM to 12:20 PM
Sponsor: Survey Research Methods Section
Abstract #304589
Title: Measures for Identifying Highly Associated Categorical Variables in Survey Data
Author(s): Natalia Weil* and Ismael Flores Cervantes
Companies: Westat and Westat
Keywords: complex survey data; auxiliary categorical data; Goodman and Kruskal’s tau; measures of association for categorical variables ; nonresponse adjustments; Cramer’s V

In some population surveys that collect data at different stages (e.g., screener and person questionnaires), auxiliary categorical variables are commonly used for the development of nonresponse adjustments. Having fewer variables in the nonresponse adjustment model makes it easier to interpret the model; therefore, it is important to identify and remove predictor variables that are highly associated among each other before proceeding with analysis. Although well-established methods for identifying correlated continuous variables exist, standard methods for highly associated categorical variables are not well-developed nor well-suited for survey data. In this paper, we develop measures of association that account for complex sample design from existing measures, such as Goodman and Kruskal’s tau and Cramer’s V. We also evaluate their performance and impact on weighting adjustments when using these weighted measures to select categorical auxiliary variables for nonresponse adjustments as compared to a manual selection of variables.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2019 program