Online Program Home
  My Program

Abstract Details

Activity Number: 131 - Predictive Modeling in Data Science
Type: Contributed
Date/Time: Monday, July 31, 2017 : 8:30 AM to 10:20 AM
Sponsor: Section on Statistical Learning and Data Science
Abstract #323081 View Presentation
Title: Selected Model Averaging in High-Dimensional Linear Regression
Author(s): Craig Rolling* and Yongli Zhang
Companies: Saint Louis University and University of Oregon
Keywords: Bootstrap ; Cross-validation ; High-dimensional regression ; Model averaging ; Model selection
Abstract:

While much work has been done in the area of model selection for high-dimensional regression, less attention has been given to model averaging in high dimensions. Because the high-dimensional setting increases the difficulty of model identification, a well-constructed model average often can provide large gains in prediction accuracy over model selection when the number of covariates is large. However, the challenges in high-dimensional regression regarding which, and how many, models to combine make it an under-studied topic. Another important question regarding model combination is whether it really improves prediction when a single good model does exist. To address these challenges, we introduce a procedure called Selected Model Averaging (SMA) that uses resampling to adaptively determine which and how many models to combine. Unlike many other model averaging methods, our method reduces to model selection when appropriate and thus bridges the gap between model selection and combination. Numerical studies demonstrate that our method performs well in a broad variety of high-dimensional settings.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2017 program

 
 
Copyright © American Statistical Association