Activity Number:
|
234
- Novel Statistical Methods for High-Dimensional Microbiome and Metagenomics Data Analysis
|
Type:
|
Topic Contributed
|
Date/Time:
|
Monday, July 29, 2019 : 2:00 PM to 3:50 PM
|
Sponsor:
|
Section on Statistics in Epidemiology
|
Abstract #306563
|
Presentation 1
Presentation 2
|
Title:
|
Robust Regression for Microbiome Data Analysis
|
Author(s):
|
Aditya Mishra* and Christian Lorenz Mueller
|
Companies:
|
Flatiron Institute and Flatiron Institute, Simons Foundation
|
Keywords:
|
Microbiome;
Robustness;
Regularization
|
Abstract:
|
Recent advances in low-cost metagenomic and amplicon sequencing techniques enable routine sampling of environmental and host-associated microbial communities across different habitats. The data produced by these large-scale surveys typically comprise relative abundances (or compositions) of microbial taxa at different taxonomic levels. To investigate the dependency of additional covariate measurements such as metabolites or host phenotypes on the microbial compositions we introduce a general robust regression framework for compositional data. We propose a novel log-contrast regression model with mean shift parameters that allow the identification of sample outliers and maintains sub-compositional coherence with respect to the associated phylogenetic tree. The model is estimated using a sparse penalized regression approach that simultaneously enforces sparsity in the mean shift and covariate parameters. We demonstrate the superiority of our approach using a wide range of synthetic simulation scenarios and infer novel associations between body mass index measurements and human gut microbes on a large public collection of human gut microbiome data.
|
Authors who are presenting talks have a * after their name.