Online Program

Return to main conference page

All Times ET

Friday, June 4
Machine Learning
Random Forests Shaping Decisions
Fri, Jun 4, 3:20 PM - 4:55 PM
TBD
 

Augmented Bagging and Implications for Variable Importance (309640)

Lucas Mentch, University of Pittsburgh 
*Siyu Zhou, University of Pittsburgh 

Keywords: Variable Importance, Regularization, Random Forests

Here we propose an augmented bagging (AugBagg) procedure, which performs bagging on an augmented feature space containing additional randomly generated noise features. Surprisingly and counterintuitively, this simple inclusion of noise features has implicit regularization effect, leading to improved model performance in low signal-to-noise ratio (SNR) settings. As a result, common notions of variable importance based on improvements in model accuracy can be fatefully flawed.