Online Program

Return to main conference page

All Times EDT

Friday, October 8
Knowledge
Influence
Fri, Oct 8, 2:45 PM - 4:00 PM
Virtual
This Is Statistics

An Analysis of Estimators in Simulation Optimization-Based Feature Selection (309878)

Sara Shashaani, North Carolina State University 
*Kimia Vahdat, North Carolina State University 

Keywords: Feature Selection, Simulation Optimization, Robust Statistics

Finding the most informative and contributing variables for a prediction rule in big datasets is a challenging task. There are two main obstacles to this problem. First, the decision space is binary with high dimensions, and it is impossible to do an exhaustive search. Second, prediction rules often suffer from the overfitting problem which can misguide the search. Most of the current variable selection methods follow greedy algorithms that end up with sub-optimal solutions. We propose a simulation optimization-based feature selection algorithm with a focus on improving the candidate solution estimation in the simulation process as we separate the estimation and optimization steps. Our proposed estimators employ sampling schemes and influence functions that ensure low bias and variance. We show improvement in several simulated and real datasets in comparison with the state-of-the-art algorithms.