JSM 2013 Home
Online Program Home
My Program

Abstract Details

Activity Number: 366
Type: Contributed
Date/Time: Tuesday, August 6, 2013 : 10:30 AM to 12:20 PM
Sponsor: Section on Statistical Learning and Data Mining
Abstract - #308045
Title: Regularization and Estimation in Regression with Cluster Variables
Author(s): Qingzhao Yu*+ and Bin Li
Companies: LSU Health Sciences Center and Louisiana State University
Keywords: Clustered variables ; Lasso ; Principal component analysis
Abstract:

We propose the Cluster Lasso, a new regularization method for linear regressions. The Cluster Lasso can do variable selections while keeping the correlation structures among variables. In addition, Cluster Lasso encourages selection of clusters of variables, in which variables having the same mechanism in predicting the response variable will be selected in the regression model together. Real microarray data example and simulation studies show that Cluster Lasso outperforms lasso in terms of the prediction performance, particularly when there is collinearity among variables and/or when the number of predictors is larger than the number of observations. The Cluster Lasso paths can be obtained using any established algorithms for lasso solution. An algorithm is proposed to detect variable correlation structures and to compute Cluster Lasso paths efficiently.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2013 program




2013 JSM Online Program Home

For information, contact jsm@amstat.org or phone (888) 231-3473.

If you have questions about the Continuing Education program, please contact the Education Department.

The views expressed here are those of the individual authors and not necessarily those of the JSM sponsors, their officers, or their staff.

ASA Meetings Department  •  732 North Washington Street, Alexandria, VA 22314  •  (703) 684-1221  •  meetings@amstat.org
Copyright © American Statistical Association.