JSM Preliminary Online Program
This is the preliminary program for the 2006 Joint Statistical Meetings in Seattle, Washington.

The views expressed here are those of the individual authors
and not necessarily those of the ASA or its board, officers, or staff.


Back to main JSM 2006 Program page




Activity Number: 418
Type: Contributed
Date/Time: Wednesday, August 9, 2006 : 10:30 AM to 12:20 PM
Sponsor: Section on Statistical Computing
Abstract - #306089
Title: A Systematic Benchmark of Dimension Reduction in Remote Homology Detection with Support Vector Machines
Author(s): Melissa M. Matzke*+ and Bobbie-Jo Webb-Robertson and Christopher S. Oehmen and Jorge F. Reyes Spindola
Companies: Pacific Northwest National Laboratory and Pacific Northwest National Laboratory and Pacific Northwest National Laboratory and Pacific Northwest National Laboratory
Address: MS K1-90, Richland, WA, 99352,
Keywords: bioinformatics ; support vector machine (SVM) ; multivariate analysis ; dimensionality reduction ; homology
Abstract:

Biopolymer sequence comparison to identify evolutionarily related proteins is one of the most common and data intensive computing tasks in bioinformatics. One of the most accurate approaches implements support vector machines (SVMs) to classify proteins into families via vectorization of the protein by sequence similarity scores obtained from the Bayesian Algorithm for Local Sequence Alignment (BALSA). However, one primary computational issue with SVMs is the size of the variable set. In this study, the performance of the SVM built with the complete BALSA score set is assessed against a reduced dimensionality. Principal components analysis, sequential projection pursuit, independent component analysis and kernel principal components analysis are used for dimension reduction. The area under the ROC curve is used to compare model performance.


  • The address information is for the authors that have a + after their name.
  • Authors who are presenting talks have a * after their name.

Back to the full JSM 2006 program

JSM 2006 For information, contact jsm@amstat.org or phone (888) 231-3473. If you have questions about the Continuing Education program, please contact the Education Department.
Revised April, 2006