Conference Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 347 - Contributed Poster Presentations: Section on Statistical Computing
Type: Contributed
Date/Time: Tuesday, August 9, 2022 : 2:00 PM to 3:50 PM
Sponsor: Section on Statistical Computing
Abstract #323206
Title: Robustifying and Increasing Performance of Models for a Categorical Response Using Improved Variable Selection
Author(s): Myriam Maumy* and Frédéric Bertrand
Companies: Troyes Technology University and Troyes Technology University
Keywords: resampling; variable selection; logistic regression; classification; robust; categorical response
Abstract:

A major challenge in current statistics is variable selection. Even though many authors have proposed methods in the literature for the past years, in a context where the number of variables vastly exceeds the number of observations or in a highly correlated framework, their performances are generally limited in recall and precision.

We improve the performance of existing classification models, for instance, regression-based ones, using correlated resampling techniques. Taking into account this correlation structure is the fundamental strength of our approach, which allows us to select reliable variables in parsimonious or non-parsimonious classification problems. For example, we have succeeded in increasing the performance of glmnet logistic models, variational approximation methods for a binary response, spls discriminant analysis models and sparse generalized pls models. In addition, we can compute a confidence index based on the resamplings that helps assess the stability of each of the variables that the model may select.

We demonstrated the performance increase due to our method by using a comprehensive simulation benchmark based on simulated and real data sets.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2022 program