Abstract #302060

This is the preliminary program for the 2003 Joint Statistical Meetings in San Francisco, California. Currently included in this program is the "technical" program, schedule of invited, topic contributed, regular contributed and poster sessions; Continuing Education courses (August 2-5, 2003); and Committee and Business Meetings. This on-line program will be updated frequently to reflect the most current revisions.

To View the Program:
You may choose to view all activities of the program or just parts of it at any one time. All activities are arranged by date and time.

The views expressed here are those of the individual authors
and not necessarily those of the ASA or its board, officers, or staff.


Back to main JSM 2003 Program page



JSM 2003 Abstract #302060
Activity Number: 95
Type: Contributed
Date/Time: Monday, August 4, 2003 : 8:30 AM to 10:20 AM
Sponsor: Biometrics Section
Abstract - #302060
Title: Selecting the Number of Components in a Mixture Model
Author(s): Surajit Ray*+
Companies: Pennsylvania State University
Address: 0326 Thomas Building, University Park, PA, 16802-2111,
Keywords: mixture of normals ; quadratic distance ; multivariate bimodality ; kernel-based distance ; gene expression ; risk analysis
Abstract:

Multivariate mixture models provide well known and widely used methods for density estimation, model-based clustering, and explanations for the data generation process. However, the problem of choosing the number of components of a mixture model in a statistically meaningful way is still a subject of considerable research. I introduce several rules for selecting a finite mixture model, and hence estimating the number of components, using quadratic distance functions. In one approach, the goal is to find the minimal number of components that are needed to adequately describe the true distribution, where the decision is based on a nonparametric confidence set for the true distribution. Two alternative approaches for estimating the number of components are based on density concordance and risk analysis. Applications of my methods to gene expression data will be presented during the talk.


  • The address information is for the authors that have a + after their name.
  • Authors who are presenting talks have a * after their name.

Back to the full JSM 2003 program

JSM 2003 For information, contact meetings@amstat.org or phone (703) 684-1221. If you have questions about the Continuing Education program, please contact the Education Department.
Revised March 2003