JSM 2017 Online Program

Activity Number:	156 - Modern Statistical Methods for Biological Discovery
Type:	Topic Contributed
Date/Time:	Monday, July 31, 2017 : 10:30 AM to 12:20 PM
Sponsor:	International Indian Statistical Association
Abstract #323131	View Presentation
Title:	Sample Size Methods for Developing Predictors from Genomic Data
Author(s):	Kevin Dobbin*
Companies:	University of Georgia
Keywords:	sample size ; machine learning ; classification ; high dimensional data
Abstract:	A common goal of high dimensional genomic data analyses is the development of a class predictor that can be used to assign samples to predefined classes. The class labels may be derived from a binary endpoint or right-censored survival data. Typically in cancer applications, other prognostic markers are available for the samples as well. While fairly standard methods of analyzing such datasets have been developed, sample size methods for such studies are less well established. We present sample size methods we have developed for these settings and discuss various challenges, such as inclusion of clinical covariates into the estimation procedure and computational issues due to the large size and complexity of the calculations.

Authors who are presenting talks have a * after their name.