JSM 2013 Home
Online Program Home
My Program

Abstract Details

Activity Number: 191
Type: Contributed
Date/Time: Monday, August 5, 2013 : 10:30 AM to 12:20 PM
Sponsor: Section on Statistical Learning and Data Mining
Abstract - #310291
Title: On the Sensitivity of the Lasso to the Number of Predictor Variables
Author(s): Cheryl Flynn*+ and Clifford M. Hurvich and Jeffrey S. Simonoff
Companies: New York University and Stern School of Business, New York University and Stern School of Business, New York University
Keywords: Lasso ; Oracle inequalities ; High-dimensional data
Abstract:

The Lasso is a computationally efficient procedure that can produce sparse estimators when the number of predictors (p) is large. Oracle inequalities provide probability loss bounds for the Lasso estimator at a deterministic choice of the regularization parameter. These bounds tend to zero if p is appropriately controlled, and are thus commonly cited as theoretical justification for the Lasso and its ability to handle high-dimensional settings. Unfortunately, in practice the regularization parameter is not selected to be a deterministic quantity, but is instead chosen using a random, data-dependent procedure. To address this shortcoming of previous theoretical work, we study the loss of the Lasso estimator when tuned optimally for prediction. Assuming orthonormal predictors and a sparse true model, we prove that the best possible predictive performance of the Lasso deteriorates as $p$ increases with positive probability. We further demonstrate empirically that the deterioration in performance can be far worse than suggested by the commonly held views in the literature and that this deterioration persists as the sample size increases.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2013 program




2013 JSM Online Program Home

For information, contact jsm@amstat.org or phone (888) 231-3473.

If you have questions about the Continuing Education program, please contact the Education Department.

The views expressed here are those of the individual authors and not necessarily those of the JSM sponsors, their officers, or their staff.

ASA Meetings Department  •  732 North Washington Street, Alexandria, VA 22314  •  (703) 684-1221  •  meetings@amstat.org
Copyright © American Statistical Association.