Abstract:
|
Clustering with finite mixture models (FMMs) can reveal subgroups with similar characteristics. However, skewed data are common in medicine, and different FMM approaches can produce substantially different findings for such data. The Gaussian FMM is a commonly used method that assumes a skewed distribution is a mixture of normal distributions. Researchers might also consider transforming the data prior to fitting a Gaussian FMM in an attempt to better meet the model's assumptions. Alternatively, the recently proposed skew-t FMM allows clusters to follow skewed distributions. The aim of this work is to provide an empirical assessment and comparison of these recent and conventional FMM methods in order to guide researchers in selecting the most appropriate approach for their data. To accomplish this aim, these approaches are reviewed with respect to the way they manage skewed data, empirically evaluated and compared in a simulation study, and demonstrated using sleep diary and inflammatory marker data.
|
ASA Meetings Department
732 North Washington Street, Alexandria, VA 22314
(703) 684-1221 • meetings@amstat.org
Copyright © American Statistical Association.