Online Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 288 - SLDS CSpeed 5
Type: Contributed
Date/Time: Wednesday, August 11, 2021 : 1:30 PM to 3:20 PM
Sponsor: Section on Statistical Learning and Data Science
Abstract #319023
Title: Unsupervised Feature Decorrelation for Variable Selection
Author(s): Ana Maria Kenney* and Francesca Chiaromonte
Companies: Pennsylvania State University and Penn State University
Keywords: Decorrelation; Whitening; Feature Selection
Abstract:

Strong correlations among features are well-known hurdles for existing variable selection/screening methods. Previous studies demonstrated that transforming predictors through a pre-processing step called ZCA whitening can greatly improve accuracy in certain selection procedures. However, this whitening method induces complete de-correlation at the cost of similarity with the original set of predictors and thus, interpretability. We propose a more general technique that allows one to leave a small, harmless level of collinearity in order to strengthen the mapping between original and transformed variables through the use of semidefinite programming. We demonstrate the benefits and drawbacks of this method along with other decorrelation procedures when applied prior to selection techniques through an in-depth simulation study and real data application.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2021 program