Name: 2018 Joint Statistical Meetings
Start: 2018-07-28T07:00:00+00:00
End: 2018-08-02
Location: Vancouver Convention Centre

Activity Number:	358 - Contributed Poster Presentations: Biometrics Section
Type:	Contributed
Date/Time:	Tuesday, July 31, 2018 : 10:30 AM to 12:20 PM
Sponsor:	Biometrics Section
Abstract #330858
Title:	Variable Selection May Be Overrated
Author(s):	Tristan Grogan* and David Elashoff
Companies:	UCLA and UCLA
Keywords:	Regularization; Ridge regression; biomarkers; Variable selection; Stepwise; LASSO
Abstract:	Medical researchers are often interested in selecting a panel of predictor variables for diagnostic or prognostic models. A standard statistical approach is the use of logistic regression to identify markers of patient status such as cancer or control with performance assessed by the area under the ROC curve (AUC). This scenario is especially common in biomarker validation studies which can include large numbers of predictor variables relative to the sample size. Researchers typically try to select the "best" model by using automated variable selection techniques such as forward stepwise, best subsets, or LASSO. We propose that ridge regression often has a higher out of sample AUC than the more standard methods in most circumstances and should be more frequently used. Our study involves assessing the different variable selection methods across 20 real biomarker datasets ranging in sample sizes from 12-160 and number of markers from 5-800.

Authors who are presenting talks have a * after their name.

JSM 2018 Online Program