Name: 2019 Joint Statistical Meetings
Start: 2019-07-27T07:00:00+00:00
End: 2019-08-01
Location: Colorado Convention Center

Abstract Details

Activity Number:	653 - Machine Learning and Other Statistical Methods in Clinical Trials
Type:	Contributed
Date/Time:	Thursday, August 1, 2019 : 10:30 AM to 12:20 PM
Sponsor:	Biopharmaceutical Section
Abstract #306845	Presentation
Title:	Machine Learning Methods Evaluation for Small-Size Overlapping Data with Class Imbalance Issue
Author(s):	Guolin Zhao* and Shuo Li and Feng Gao
Companies:	Biogen Inc. and Boston University, and Biogen Inc.
Keywords:	overlapping; class-imbalance; biomarker; machine-learning
Abstract:	For the biomarkers developed for neurodegenerative disorders, interested treatment groups or disease subgroups may be heavily overlapped distributions on these biomarkers. Therefore, differentiating patients become very challenging. Moreover, due to the nature of the therapeutic areas of interest, class imbalance and small sample size are also commonly observed. This imposes additional challenges in differentiating patients using these biomarkers. Through a simulation study, classification performance is evaluated among selected machine learning methods on small sample-size overlapping data with class imbalance. Methods include combinations of multiple resampling approaches and classification algorithms along with two workflows applying different cross-validation (CV) strategies. The simulation results suggested that the random under sampling (RUS) is preferred and needed. Comparing to accuracy as the performance metric, geometric mean balances the accuracy of prediction between both classes and it is a more robust performance measurement. Taking both time consuming and prediction performance into consideration, the one-step repeated CV strategy is preferred.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2019 program

JSM 2019 Online Program

Abstract Details

American Statistical Association