JSM 2005 - Toronto

Abstract #304730

This is the preliminary program for the 2005 Joint Statistical Meetings in Minneapolis, Minnesota. Currently included in this program is the "technical" program, schedule of invited, topic contributed, regular contributed and poster sessions; Continuing Education courses (August 7-10, 2005); and Committee and Business Meetings. This on-line program will be updated frequently to reflect the most current revisions.

To View the Program:
You may choose to view all activities of the program or just parts of it at any one time. All activities are arranged by date and time.



The views expressed here are those of the individual authors
and not necessarily those of the ASA or its board, officers, or staff.


The Program has labeled the meeting rooms with "letters" preceding the name of the room, designating in which facility the room is located:

Minneapolis Convention Center = “MCC” Hilton Minneapolis Hotel = “H” Hyatt Regency Minneapolis = “HY”

Back to main JSM 2005 Program page



Legend: = Applied Session, = Theme Session, = Presenter
Activity Number: 226
Type: Contributed
Date/Time: Tuesday, August 9, 2005 : 8:30 AM to 10:20 AM
Sponsor: Biometrics Section
Abstract - #304730
Title: Genome Sequence Analysis Using Mixture Trees
Author(s): Shu-Chuan Chen*+ and Bruce G. Lindsay
Companies: Arizona State University and The Pennsylvania State University
Address: Department of Mathematics and Statistics, Tempe, AZ, 85287, United States
Keywords: Ancestral Mixture Models ; Single nucleotide polymorphisms
Abstract:

Clustering methods have been investigated broadly in the last decade. Since the rapid progress of human genome sequencing, more efficient clustering methods are demanded. In this paper, a new method using ancestral mixture model for clustering binary sequences is proposed. I first show how an ancestral mixture model can be used to build up a hierarchical tree from binary sequence data using an example of genetic single nucleotide polymorphisms (SNP) data. Properties of the ancestral mixture model, such as its nested structure and the relationship to the coalescent process of population genetics, are presented. A model selection method based on an easy-to-calculate quadratic-distance is then proposed. This distance arises by first applying kernel smoothing to both the data and the fitted model to get densities e* and a* on the sequence space. Then, one uses the L2 distance between these to assess the fit of the data to the model. An example of SNP data will be presented to demonstrate how our method works.


  • The address information is for the authors that have a + after their name.
  • Authors who are presenting talks have a * after their name.

Back to the full JSM 2005 program

JSM 2005 For information, contact jsm@amstat.org or phone (888) 231-3473. If you have questions about the Continuing Education program, please contact the Education Department.
Revised March 2005