Online Program Home
My Program

Abstract Details

Activity Number: 167
Type: Topic Contributed
Date/Time: Monday, August 1, 2016 : 10:30 AM to 12:20 PM
Sponsor: Section on Statistical Learning and Data Science
Abstract #320411 View Presentation
Title: Model-Based Regression Clustering for High-Dimensional Data
Author(s): Emilie Devijver*
Keywords: model based clustering ; model selection ; Lasso ; high-dimension ; regression

Finite mixture regression models are useful for modeling the relationship between response and predictors arising from different subpopulations. In this talk, we study high-dimensional predictors and high-dimensional response and propose a procedure to cluster observations according to the link between predictors and the response. To reduce the dimension, we propose to use the Lasso estimator, which takes into account the sparsity and a maximum likelihood estimator to reduce the bias. To select the number of components and the sparsity level, we construct a collection of models, varying those two parameters and we select a model among this collection with a non-asymptotic criterion. We apply and evaluate our methods both on simulated and real datasets, to understand how they work in practice.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2016 program

Copyright © American Statistical Association