![IconGems-Print](images/IconGems-Print.png)
87 – Invited ePoster Session: a Statistical SmörgÃ¥sbord
Addressing Overfitting in Mixtures of Factor Analyzers
Phillip Shreeves
University of British Columbia
Jeffrey L. Andrews
University of British Columbia Okanagan
The expectation-maximization (EM) algorithm is a common approach for parameter estimation in the context of cluster analysis using finite mixture models. This approach suffers from the well-known issue of convergence to local maxima, but also the less obvious problem of overfitting. Mixtures of factor analyzers assume an underlying factor analysis structure and thereby perform dimensionality reduction simultaneously during the alternating expectation conditional maximization (AECM) model-fitting process. Importantly though, both convergence to local maxima and overfitting remain a concern. We address these concerns by introducing an algorithm that augments the traditional AECM with the nonparametric bootstrap. Further simulations and applications to real data lend support for the usage of this bootstrap augmented AECM-style algorithm.