JSM 2004 - Toronto

Abstract #301462

This is the preliminary program for the 2004 Joint Statistical Meetings in Toronto, Canada. Currently included in this program is the "technical" program, schedule of invited, topic contributed, regular contributed and poster sessions; Continuing Education courses (August 7-10, 2004); and Committee and Business Meetings. This on-line program will be updated frequently to reflect the most current revisions.

To View the Program:
You may choose to view all activities of the program or just parts of it at any one time. All activities are arranged by date and time.

The views expressed here are those of the individual authors
and not necessarily those of the ASA or its board, officers, or staff.


Back to main JSM 2004 Program page



Activity Number: 384
Type: Contributed
Date/Time: Wednesday, August 11, 2004 : 2:00 PM to 3:50 PM
Sponsor: Biometrics Section
Abstract - #301462
Title: Feature Selection with SVM Reformulation and Expression-array-based Cancer Classification
Author(s): Sunhee K. Ro and Wei-min Liu*+
Companies: Roche Molecular Systems, Inc. and Roche Molecular Systems, Inc.
Address: 1145 Atlantic Ave., Alameda, CA, 94501,
Keywords: microarray ; cancer classification ; feature selection ; support vector machine
Abstract:

Gene expression microarray analysis is being used extensively for cancer classification and support vector machine (SVM) is one of the successful classification methods. While SVM can be applied to extremely high-dimensional data such as microarray, the removal of irrelevant and redundant features may produce better results. Classical wrapper methods such as the SVM RFE can be used for feature selection while avoiding the combinatorial explosion with greedy technique; however, it is slow because the classification performance should be calculated at every iteration to find the optimum number of features. Weston et al. (2003) proposed to reformulate the SVM problem as minimizing the number of nonzero elements in the weight vector in SVM decision function. The algorithm amounts to a modification of SVM with multiplicative rescaling of the training data at every iteration so that the weights of useless features approach 0 rapidly. We have extended this algorithm to classify the multiple subclasses of leukemia, which consists of the all-pairwise classifiers each of which was trained with the union of the genes selected pairwisely with the multiplicative rescaling.


  • The address information is for the authors that have a + after their name.
  • Authors who are presenting talks have a * after their name.

Back to the full JSM 2004 program

JSM 2004 For information, contact jsm@amstat.org or phone (888) 231-3473. If you have questions about the Continuing Education program, please contact the Education Department.
Revised March 2004