JSM Preliminary Online Program
This is the preliminary program for the 2007 Joint Statistical Meetings in Salt Lake City, Utah.

The views expressed here are those of the individual authors
and not necessarily those of the ASA or its board, officers, or staff.



Back to main JSM 2007 Program page




Activity Number: 340
Type: Contributed
Date/Time: Tuesday, July 31, 2007 : 2:00 PM to 3:50 PM
Sponsor: Section on Statistical Computing
Abstract - #309724
Title: Clustering Gene Expressions in the Presence of Scatter
Author(s): Ivan Ramler*+ and Ranjan Maitra
Companies: Iowa State University and Iowa State University
Address: , Ames, IA, 50011-1219,
Keywords: Gene Expression ; k-mean-directions ; Bayes Information Criterion
Abstract:

A new methodology is proposed for clustering gene expression datasets in the presence of scattered observations. These are defined to be observations that are unlike any other, so traditional approaches that force them into groups can lead to all-around erroneous conclusions. Our suggested approach is an iterative scheme which proceeds by building the core for each cluster around the centers, identifies points outside as scatter and updates the method until convergence. In the absence of scatter, the algorithm reduces to a k-means algorithm designed for constrained directional data. We also provide methodology to initialize the algorithm as well as to estimate the number of clusters in the dataset. Results on several sets of test experiments show excellent performance. The methodology is applied to gene expression data on the diurnal starch cycle of Arabidposis L. Heynth.


  • The address information is for the authors that have a + after their name.
  • Authors who are presenting talks have a * after their name.

Back to the full JSM 2007 program

JSM 2007 For information, contact jsm@amstat.org or phone (888) 231-3473. If you have questions about the Continuing Education program, please contact the Education Department.
Revised September, 2007