Conference Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 386 - SPEED: Statistics in Epidemiology Part 1
Type: Contributed
Date/Time: Wednesday, August 10, 2022 : 8:30 AM to 10:20 AM
Sponsor: Section on Statistics in Epidemiology
Abstract #323186
Title: Quantifying Bacterial Strain-Host Associations with ANPAN
Author(s): Andrew Ghazi* and Yan Yan and Eric A. Franzosa and Curtis Huttenhower
Companies: Broad Institute and Harvard TH Chan School of Public Health and Harvard T. H. Chan School of Public Health and Harvard T.H. Chan School of Public Health
Keywords: Microbiome; strains; phylogenetics; regularization; R; high-dimensionality
Abstract:

Microbial strain variation can strongly influence the impact of microbes on host health, though methods for quantitatively understanding these important differences have been lacking. Strain data have several features that make traditional statistical methods challenging to use, including high dimensionality, person-specific strain carriage, and complex phylogenetic relatedness. We present ANPAN, an R package that consolidates methods for strain statistics. Combining modern hierarchical modeling strategies with novel adaptive filtering methods specifically designed to interrogate microbial strain profiles, ANPAN facilitates the identification of strain-specific genetic elements associated with host health outcomes. Additionally, we use regularized phylogenetic generalized linear mixed models to characterize the effect of strain-level community structure. We validate our methods by simulation, as well as application to a dataset of 1262 colorectal cancer patients, showing that we achieve more accurate effect size estimation and a lower false positive rate compared to current methodologies. The open source ANPAN repository is available at https://github.com/biobakery/anpan.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2022 program