Abstract #300063

This is the preliminary program for the 2003 Joint Statistical Meetings in San Francisco, California. Currently included in this program is the "technical" program, schedule of invited, topic contributed, regular contributed and poster sessions; Continuing Education courses (August 2-5, 2003); and Committee and Business Meetings. This on-line program will be updated frequently to reflect the most current revisions.

To View the Program:
You may choose to view all activities of the program or just parts of it at any one time. All activities are arranged by date and time.

The views expressed here are those of the individual authors
and not necessarily those of the ASA or its board, officers, or staff.


Back to main JSM 2003 Program page



JSM 2003 Abstract #300063
Activity Number: 475
Type: Contributed
Date/Time: Thursday, August 7, 2003 : 10:30 AM to 12:20 PM
Sponsor: Section on Statistical Computing
Abstract - #300063
Title: Applications of the Split-Merge Markov Chain Monte Carlo Technique in the Analysis of Gene Expression Data
Author(s): Sonia Jain*+ and R. M. Neal
Companies: University of California, San Diego and University of Toronto
Address: 9500 Gilman Dr., MC 0622, La Jolla, CA, 92093-0622,
Keywords: Dirichlet process ; mixture model ; Metropolis-Hastings ; Gibbs sampling ; microarray data
Abstract:

The inferential problem of associating data in high dimensions to mixture components is difficult when components are nearby or overlapping. We introduce a new split-merge Markov chain Monte Carlo technique that efficiently classifies observations by splitting and merging mixture components of a nonconjugate Bayesian mixture model. Our method, which is a Metropolis-Hastings procedure with split-merge proposals, samples clusters of observations simultaneously rather than incrementally assigning observations to mixture components. Split-merge moves are produced by exploiting properties of a restricted Gibbs sampling scan. We apply our split-merge technique to a cancer classification problem, in which patients are clustered according to leukemia type based on their gene expression data obtained from DNA microarray experiments.


  • The address information is for the authors that have a + after their name.
  • Authors who are presenting talks have a * after their name.

Back to the full JSM 2003 program

JSM 2003 For information, contact meetings@amstat.org or phone (703) 684-1221. If you have questions about the Continuing Education program, please contact the Education Department.
Revised March 2003