Online Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 202 - SLDS Student Paper Awards
Type: Topic-Contributed
Date/Time: Tuesday, August 10, 2021 : 1:30 PM to 3:20 PM
Sponsor: Section on Statistical Learning and Data Science
Abstract #317423
Title: Estimating the Number of Components in Finite Mixture Models via the Group-Sort-Fuse Procedure
Author(s): Tudor Manole* and Abbas Khalili
Companies: Carnegie Mellon University and McGill University
Keywords: Mixture Models; Model Selection; Penalized Likelihood; Wasserstein Distance
Abstract:

Finite mixture models provide a natural framework for analyzing data from heterogeneous populations. In practice, however, the number of mixture components (or order) may be unknown. We propose the Group-Sort-Fuse (GSF) procedure---a new penalized likelihood approach for simultaneous estimation of the order and mixing measure in multidimensional finite mixture models. Unlike methods which fit and compare mixtures with varying orders using criteria involving model complexity, our approach directly penalizes a continuous function of the model parameters. Specifically, given a conservative upper bound on the order, the GSF groups and sorts mixture component parameters in order to fuse those which are redundant. For a wide range of finite mixture models, we show that the GSF is consistent in estimating the true mixture order and achieves the parametric convergence rate for mixing measure estimation up to polylogarithmic factors, under a suitable Wasserstein distance.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2021 program