Online Program Home
My Program

Abstract Details

Activity Number: 434 - SPEED: Classification and Data Science
Type: Contributed
Date/Time: Tuesday, July 31, 2018 : 2:00 PM to 2:45 PM
Sponsor: Section on Statistical Learning and Data Science
Abstract #332734
Title: Comparison of Missing Data Methods in the Use of LASSO Regression for Model Selection with Applications to the National Trauma Data Bank
Author(s): Sarah B Peskoe and Tracy Truong* and Lily R Mundy and Ronnie L Shammas and Scott T Hollenbeck
Companies: Duke University and Duke University and Duke University School of Medicine and Duke University School of Medicine and Duke University School of Medicine
Keywords: National Database; IPW; Multiple Imputation; LASSO; Missing Data; Model Selection

National databases provide rich sources to identify relevant predictors for diseases and clinical outcomes. These databases, however, often lack strict guidelines on reporting of patient demographics and outcomes, leading to missing data. In this paper, we compare several methods to control for bias due to missing data in their ability to identify predictors using a lasso algorithm for model selection. Traditionally speaking, multiple imputation is problematic in using shrinkage estimators due to the potential for model variation across imputations. Recent advances have been made to correct for this, namely stacked and grouped multiple imputation. Here, we compare these newer methods to complete case analysis and a proposed inverse probability weighting approach. Through extensive simulation, we demonstrate that multiple imputation can be overly sensitive when the outcome of interest is missing, rather than potential exposures, but may be influenced by the method used for tuning parameter selection. This is evident both with continuous and binary exposure data. We apply these methods to data from the National Trauma Data Bank to identify predictors of open tibia fracture outcomes.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2018 program