JSM Preliminary Online Program
This is the preliminary program for the 2009 Joint Statistical Meetings in Washington, DC.

The views expressed here are those of the individual authors
and not necessarily those of the ASA or its board, officers, or staff.


Back to main JSM 2009 Program page




Activity Number: 562
Type: Contributed
Date/Time: Thursday, August 6, 2009 : 8:30 AM to 10:20 AM
Sponsor: Section on Statistical Learning and Data Mining
Abstract - #304273
Title: Cancer Microarray Feature Selection Using Support Vector Machines: Comparing Regularization Techniques
Author(s): Tim Peters*+ and David Bulger and Jean Y.H. Yang and To-ha Loi and David Ma
Companies: Macquarie University and Macquarie University and University of Sydney and St. Vincent's Hospital and St. Vincent's Hospital
Address: Balaclava Road, North Ryde, New South Wales, Sydney, International, 2109, Australia
Keywords: Microarray ; Machine learning ; Feature Selection ; Support Vector Machines ; Regularization ; Unbalanced datasets
Abstract:

Microarray data set dimensionality reduction is a prerequisite for avoiding overfitting, and a precursor to developing diagnostic tools. We show that a generic stepwise regression using the linear support vector machine penalized margin width as the objective function, subject to regularization parameter grid-search, gives superior performance to three other feature-selection methods (least-angle regression, Random Forest, and stepwise regression on Fisher discriminants). We use a hierarchical validation method, applying leave-one-out cross-validation within the training subset, and applying the trained classifier to a separate test subset, on each of four two-class cancer-related gene expression data sets. The generic method shows superior results, especially on unbalanced data sets, including Golub (1999). A fixed regularization value appears nearly optimal for all four data sets


  • The address information is for the authors that have a + after their name.
  • Authors who are presenting talks have a * after their name.

Back to the full JSM 2009 program


JSM 2009 For information, contact jsm@amstat.org or phone (888) 231-3473. If you have questions about the Continuing Education program, please contact the Education Department.
Revised September, 2008