Conference Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 244 - Advances in Statistical Machine Learning
Type: Contributed
Date/Time: Tuesday, August 9, 2022 : 8:30 AM to 10:20 AM
Sponsor: IMS
Abstract #322504
Title: High-Dimensional Random Forests
Author(s): ROLAND FIAGBE*
Companies: UNIVERSITY OF CENTRAL FLORIDA
Keywords: Random Forest; Ridge Regression; High-Dimensional; noninformative
Abstract:

The significant advances in technology have enabled easy collection and management of high-dimensional data in many fields, however, the process of modeling these data imposes a huge problem in the field of data science. Dealing with high-dimensional data is one of the significant challenges that degenerate the performance and precision of most classification and regression algorithms, e.g., random forests. Random Forest (RF) is among the few methods that can be extended to model high-dimensional data; nevertheless, its performance and precision, like others, are highly affected by high dimensions, especially when the dataset contains a huge number of noise or noninformative features. It is known in literature that data dominated with a high number of uninformative features have a small number of expected informative variables that could lead to the challenge of obtaining an accurate or robust random forest model. My study presents a new algorithm that incorporates ridge regression as a variable screening tool to discern informative features in the setting of high dimensions and apply the classical random forest to a top portion of selected important features.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2022 program