Conference Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 125 - Practical Recommendations for Prediction Modeling That Advance Innovation
Type: Invited
Date/Time: Monday, August 8, 2022 : 10:30 AM to 12:20 PM
Sponsor: Section on Statistical Learning and Data Science
Abstract #319183
Title: Variable Selection with Random Forest: Choosing the Best Method for Different Types of Data
Author(s): Jaime Lynn Speiser*
Companies: Wake Forest School of Medicine
Keywords: random forest; prediction modeling; variable selection
Abstract:

Random forest is a popular method for developing prediction models. The first step in developing prediction models often involves reducing the number of variables to be included. Dozens of random forest variable selection methods exist; however, there is a paucity of literature to guide users as to which method may be preferable for different types of datasets. Using several hundred datasets freely available online, we evaluate the prediction error rates, number of variables selected, and computation times. This presentation will discuss preferable random forest variable selection methods, tailored for different types of innovative real-world applications.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2022 program