Online Program Home
  My Program

Abstract Details

Activity Number: 460 - Clustering Methods for Big Data Problems
Type: Topic Contributed
Date/Time: Wednesday, August 2, 2017 : 8:30 AM to 10:20 AM
Sponsor: Section on Statistical Learning and Data Science
Abstract #323090 View Presentation
Title: A Bayesian Lasso Functional Clustering Model
Author(s): Alejandro Murua* and Folly Adjogou and Wolfgang Raffelsberger
Companies: Universite de Montreal and Université de Montréal and Institut de Génetique et de Biologie Moléculaire et Cellulaire, Université de Strasbourg
Keywords: functional data analysis ; clustering ; classification ; lasso penalty ; gene expression data
Abstract:

We introduce a Bayesian lasso penalty model for time-course data. The goal is to do clustering of functional data. Our model combines functional data analysis with model based clustering. The functional framework is used to model time-course data. Principal functional components are described by score coefficients which embed the curves in a much lower dimensional space. Model based clustering is performed on the score space, thus avoiding the curse of dimensionality in the curves space. The number of clusters as well as the dimension of the score space are determined via a Bayesian lasso penalty model. Monte Carlo techniques are used in order to estimate the normalizing constants for different values of the penalty parameters. One of the advantages of the Bayesian lasso model is that it avoids the need to perform a costly cross-validation to select the penalty parameters. We present an application to the analysis of gene-expression data associated with the effects of exposure to tobacco smoke.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2017 program

 
 
Copyright © American Statistical Association