Online Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 578 - SBSS Student Paper Competition II
Type: Topic Contributed
Date/Time: Thursday, August 6, 2020 : 3:00 PM to 4:50 PM
Sponsor: Section on Bayesian Statistical Science
Abstract #309775
Title: SBSS Student Paper Competition II: Bayesian Biclustering for Metagenomic Sequencing Data via Multinomial Matrix Factorization
Author(s): Fangting Zhou*
Companies: Texas A&M University
Keywords: mixture model; phylogenetic Indian buffet process; compositional data analysis; Bayesian nonparametric prior
Abstract:

High-throughput sequencing technology provides unprecedented opportunities to quantitatively explore human gut microbiome and its relation to diseases. Microbiome data are compositional, sparse, noisy, and heterogeneous, which pose serious challenges for statistical modeling. We propose a Bayesian multinomial matrix factorization model to infer overlapping clusters on both microbes and human hosts. The proposed method represents the observed over-dispersed zero-inflated count matrix as Dirichlet-multinomial mixtures on which the latent cluster structures are built hierarchically. Under the Bayesian framework, the number of clusters is automatically determined and available information from a taxonomic rank tree of the microbes is incorporated, which greatly improves the interpretability of the findings. We demonstrate the utility of the proposed approach using simulations and an application to a human inflammatory bowel disease microbiome dataset. The application reveals interesting clusters, some of which contain known bacteria that are related to the disease, supported by existing biological literature.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2020 program