Online Program Home
My Program

Abstract Details

Activity Number: 590
Type: Invited
Date/Time: Wednesday, August 3, 2016 : 2:00 PM to 3:50 PM
Sponsor: Government Statistics Section
Abstract #318096 View Presentation
Title: Random Forests for High-Throughput Omics Data and Survival Endpoints: A Tour of the Horizon
Author(s): Andreas Ziegler* and Marvin N. Wright and Matthias Schmid
Companies: University of Lübeck and University of Lübeck and University of Bonn
Keywords: Random forests ; Survival analysis ; Gene expression ; Single nucleotide polymorphisms ; Microarray
Abstract:

Random Forests (RF) are fast, flexible and represent a robust approach to mining high-dimensional continouous and discrete survival data. They perform well even in big data problems. The tree-building process of random forests implicitly allows for interaction between features and high correlation between features. Approaches are available to measuring variable importance, which is the basis for feature selection. Although RF perform well in many applications, their theoretical properties have been understood only recently. After a non-theoretical introduction into RF, we summarize the theoretical findings. We survey different versions of RF, including random survival forests (RSF) and conditional inference forests (CIF). Split criteria are summarized and their end-cut preference is discussed. Implementations of RF for survival data are compared with respect to options and runtime. We provide a brief overview of different areas of application of RF with survival data and present real data gene expression and genetic studies.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2016 program

 
 
Copyright © American Statistical Association