Online Program Home
My Program

Abstract Details

Activity Number: 234 - Novel Statistical Methods for High-Dimensional Microbiome and Metagenomics Data Analysis
Type: Topic Contributed
Date/Time: Monday, July 29, 2019 : 2:00 PM to 3:50 PM
Sponsor: Section on Statistics in Epidemiology
Abstract #306563 Presentation 1 Presentation 2
Title: Robust Regression for Microbiome Data Analysis
Author(s): Aditya Mishra* and Christian Lorenz Mueller
Companies: Flatiron Institute and Flatiron Institute, Simons Foundation
Keywords: Microbiome; Robustness; Regularization

Recent advances in low-cost metagenomic and amplicon sequencing techniques enable routine sampling of environmental and host-associated microbial communities across different habitats. The data produced by these large-scale surveys typically comprise relative abundances (or compositions) of microbial taxa at different taxonomic levels. To investigate the dependency of additional covariate measurements such as metabolites or host phenotypes on the microbial compositions we introduce a general robust regression framework for compositional data. We propose a novel log-contrast regression model with mean shift parameters that allow the identification of sample outliers and maintains sub-compositional coherence with respect to the associated phylogenetic tree. The model is estimated using a sparse penalized regression approach that simultaneously enforces sparsity in the mean shift and covariate parameters. We demonstrate the superiority of our approach using a wide range of synthetic simulation scenarios and infer novel associations between body mass index measurements and human gut microbes on a large public collection of human gut microbiome data.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2019 program