This is the program for the 2010 Joint Statistical Meetings in Vancouver, British Columbia.

Abstract Details

Activity Number: 507
Type: Topic Contributed
Date/Time: Wednesday, August 4, 2010 : 10:30 AM to 12:20 PM
Sponsor: SSC
Abstract - #308495
Title: Simulating Data to Study Performance of Finite Mixture Modeling and Clustering Algorithms
Author(s): Volodymyr Melnykov*+ and Ranjan Maitra
Companies: North Dakota State University and Iowa State University
Address: , Fargo, ND, 58102,
Keywords: cluster overlap ; eccentricity of ellipsoid ; Mclust ; mixture distribution ; MixSim ; parallel distribution plot
Abstract:

We propose a new method to generate sample Gaussian mixture distributions according to pre-specified overlap characteristics. Such methodology is useful in the context of evaluating performance of clustering algorithms. Our suggested approach involves derivation of and calculation of the exact overlap between every cluster pair, measured in terms of their total probability of misclassification, and then guided simulation of Gaussian components satisfying pre-specified overlap characteristics. The algorithm is illustrated in two and five dimensions using contour plots and parallel distribution plots, respectively, which we introduce and develop to display mixture distributions in higher dimensions. The utility of the algorithm is demonstrated via a study of initialization strategies in Gaussian clustering.


The address information is for the authors that have a + after their name.
Authors who are presenting talks have a * after their name.

Back to the full JSM 2010 program




2010 JSM Online Program Home

For information, contact jsm@amstat.org or phone (888) 231-3473.

If you have questions about the Continuing Education program, please contact the Education Department.