Online Program

Return to main conference page

All Times ET

Friday, June 4
Computational Statistics
New Models and Methods
Fri, Jun 4, 1:20 PM - 2:55 PM
TBD
 

Clustering Data with Nonignorable Missingness using Semi-Parametric Mixture Models (309749)

Marie Du Roy de Chaumaray, CREST / ENSAI 
*Matthieu Marbac, CREST / ENSAI 

Keywords: Clustering, Mixture Model, Nonignorable Missigness, Smoothed Likelihood.

We are concerned in clustering continuous data sets subject to nonignorable missingness. We perform clustering with a specific semi-parametric mixture, avoiding the component distributions and the missingness process to be specified, under the assumption of conditional independence given the component. Estimation is performed by maximizing an extension of smoothed likelihood allowing missingness. This optimization is achieved by a Majorization-Minorization algorithm. We illustrate the relevance of our approach by numerical experiments. Under mild assumptions, we show the identifiability of our model, the monotony of the MM algorithm as well as the consistency of the estimator. We propose an extension of the new method to the case of mixed-type data that we illustrate on a real data set.

More details are available in https://arxiv.org/abs/2009.07662