Online Program

Friday, October 21
Knowledge
Community
Influence
Fri, Oct 21, 8:00 AM - 8:50 AM
Carolina Ballroom
Poster Session 2 and Continental Breakfast
Sponsored by Bank of America

Variable Selection Methods for Identifying Predictor and Predictor Interactions Associated with Repeatedly Measured Binary Outcomes (303418)

*Bethany Lynn Wolf, Medical University of South Carolina 

Predicting patient disease risk over time may require modeling interactions among variables and within subject correlation. Generalized linear mixed models (GLMMs) can model interactions and within subject correlation, but interactions should be specified a priori and sufficient data are needed to model interactions and main effects. Variables can be selected using stepwise selection, but these methods produce unstable estimates. GlmmLasso, a GLMM adaptation of lasso regression, is an alternative to stepwise selection that penalizes the likelihood to yield sparse models. GMMBoost, another variable selection algorithm, yields sparse models through iterative reweighting of residuals. Both glmmLasso and GMMBoost have been shown to produce biased estimates. We propose a 2-stage approach to address the bias in estimates from glmmLasso and GMMBoost. Simulation studies show that glmmLasso and GMMBoost more effectively recover variables and interactions associated with a repeated binary outcome than step-wise selection. We also show the 2-stage approach for glmmLasso and GMMBoost reduces parameter estimate bias. We apply each method to 2 clinical datasets and compare the resulting models.