Online Program Home
My Program

Abstract Details

Activity Number: 182 - SPEED: New Methods in Statistical Genomics and Genetics Part 2
Type: Contributed
Date/Time: Monday, July 29, 2019 : 10:30 AM to 11:15 AM
Sponsor: Section on Statistics in Genomics and Genetics
Abstract #307547
Title: Identifying Appropriate Probabilistic Models for Sparse Discrete Omics Data
Author(s): Hani Aldirawi*
Companies: UIC
Keywords: Omics data; KS test; likelihood ratio test; Zero-inflated

Modeling sparse and discrete omics data such as microbiome and transcriptomics is challenging due to exceeded number of zeros. Many probabilistic models have been used, including Poisson, negative binomial, zero-inflated Poisson, and zero-inflated negative binomial models. In this paper, we propose a statistical procedure for identifying the most appropriate discrete probabilistic models for zero-inflated or Hurdle models based on the p-value of the discrete Kolmogorov-Smirnov (KS) test. We develop a general procedure for estimating the parameters for a large class of zero-inflated models and Hurdle models. We also develop a general likelihood ratio test based on Neyman-Pearson lemma for choosing the best model when appropriate ones are more than one.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2019 program