Online Program Home
  My Program

Abstract Details

Activity Number: 649 - Advances in Finite Mixture Modeling and Model-Based Clustering
Type: Topic Contributed
Date/Time: Thursday, August 3, 2017 : 10:30 AM to 12:20 PM
Sponsor: Section on Statistical Computing
Abstract #323525 View Presentation
Title: Merging K-Means Solutions for Clustering
Author(s): Semhar Michael* and Volodymyr Melnykov
Companies: South Dakota State University and The University of Alabama
Keywords: K-means ; finite mixture models ; merging ; BIC ; pairwise overlap
Abstract:

Existing clustering methods range from simple but very restrictive to complex but more flexible. The K-means algorithm is one of the most popular clustering procedures due to its computational speed and intuitive construction. Unfortunately, the application of K-means in its traditional form based on Euclidean distances is limited to cases with spherical clusters of approximately the same size. At the same time, it is a common practice among researchers to use the algorithm without checking underlying assumptions. As a result, obtained solutions are often meaningless or misleading. We propose merging solutions obtained by K-means to produce meaningful groupings. The notion of pairwise overlap is used to measure the closeness of groups in the K-means solution. A novel display called overlap map is proposed to decide on the optimal number of clusters. The ideas are illustrated on challenging examples and applied to the problem of color quantization with good results.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2017 program

 
 
Copyright © American Statistical Association