Online Program Home
My Program

Abstract Details

Activity Number: 659 - Recent Advances in Dimension Reduction and Clustering
Type: Contributed
Date/Time: Thursday, August 1, 2019 : 10:30 AM to 12:20 PM
Sponsor: Section on Statistical Learning and Data Science
Abstract #307132
Title: Cluster Analysis via Random Partition Distributions
Author(s): David Dahl* and Brandon Carter
Companies: Brigham Young University and Brigham Young University
Keywords: Hierarchical clustering; Ewens-Pitman attraction distribution; Chinese restaurant process
Abstract:

Cluster analysis is often used for exploratory data analysis in a variety of fields. Hierarchical clustering specifically is a heuristic method that uses as its input the pairwise distance among the items being clustered. Alternatively, we propose a new method for cluster analysis based on random partition distributions, such as the Ewens-Pitman Attraction (EPA) distribution. The EPA distribution is a probability distribution over all possible clusterings of the observations and is based on the same pairwise distance information used in hierarchical clustering. We seek to characterize the differences and similarities between hierarchical clustering and our clustering analysis based on random partition distributions. The advantages of this new method are illustrated through several case studies.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2019 program