Online Program Home
  My Program

Abstract Details

Activity Number: 237 - Feature Selection and Statistical Learning in Genomics
Type: Contributed
Date/Time: Monday, July 31, 2017 : 2:00 PM to 3:50 PM
Sponsor: Section on Statistics in Genomics and Genetics
Abstract #324238 View Presentation
Title: Group Variable Selection with Compositional Covariates
Author(s): Anna Plantinga* and Michael C. Wu
Companies: University of Washington and Fred Hutchinson Cancer Research Center
Keywords: Compositional ; Microbiome ; Group lasso ; ADMM
Abstract:

Feature selection methods for microbiome compositional data have recently been proposed as an alternative to taxon level analyses or distance-based methods comparing entire microbial communities. Such models can effectively handle the high dimensionality of the covariates while enforcing the unit sum constraint of compositional data. However, existing compositional feature selection models do not take full advantage of the multi-level structure of microbiome data. For high dimensional regression models with multi-level compositional covariates, we propose an L1/L2 regularized linear log-contrast model that provides group- and taxon-level sparsity. We express the model as a constrained convex optimization problem and propose an alternating direction method of multipliers algorithm, and we demonstrate selection consistency and bounded loss. The selection and estimation accuracy of our method is evaluated using simulation studies; we also demonstrate its efficacy by applying it to a study relating host gene expression to gut microbiome composition.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2017 program

 
 
Copyright © American Statistical Association