Online Program Home
My Program

Abstract Details

Activity Number: 419 - Bayesian Computation and Spatial Modeling
Type: Contributed
Date/Time: Tuesday, July 31, 2018 : 2:00 PM to 3:50 PM
Sponsor: Section on Bayesian Statistical Science
Abstract #330418 Presentation
Title: Bayesian Dimension and Variable Selection for Model-Based Clustering
Author(s): Love Tanzy* and Kyra Singh
Companies: University of Rochester Medical Center and Google, Inc.
Keywords: RJMCMC; wine; model selection

In identifying subpopulations within data, true group structure can be masked by extraneous variables, thus motivating the need for a variable selection procedure to identify important variables for model-based clustering. Currently in the clustering literature, empirical Bayes methods tackle the simultaneous model-based clustering and variable selection problem. These approaches have limitations, primarily in the assumption that a single locally optimal solution exists. We propose a fully Bayesian approach, in which a set of globally optimal solutions are found using the reversible-jump Markov chain Monte Carlo algorithm. Our method permits modeling of the full likelihood in which the proportion of cluster membership, mean, and covariance parameters of each component are estimated. We also incorporate variance constraint selection with covariance constraints from Banfield and Raftery [1993]. Our method allows dimension changing in the variables, variance constraints, and group subspaces, resulting in a complete representation of the clustering model with simultaneous variable selection.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2018 program