Online Program Home
My Program

Abstract Details

Activity Number: 274 - Random Forests in Big Data, Machine Learning and Statistics
Type: Invited
Date/Time: Tuesday, July 31, 2018 : 8:30 AM to 10:20 AM
Sponsor: Section on Statistical Learning and Data Science
Abstract #326983 Presentation
Title: Standard Errors and Confidence Intervals for Variable Importance in Random Forest Regression, Classification, and Survival
Author(s): Hemant Ishwaran*
Companies: University of Miami

Random forests is a popular nonparametric tree ensemble well known for highly accurate prediction. But another important feature is that it provides a fully nonparametric measure of variable importance (VIMP) for ranking variables. However, inference for VIMP is difficult due to its highly complex nature. Therefore, we describe a subsampling approach that can be used to estimate the variance of VIMP and to construct confidence intervals. The method is applicable to a wide variety of problems, including regression, classification, and survival, and is found to be highly effective, even surpassing bootstrapping, and most importantly it is computationally fast and attractive for big data settings.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2018 program