Name: 2019 Joint Statistical Meetings
Start: 2019-07-27T07:00:00+00:00
End: 2019-08-01
Location: Colorado Convention Center

Activity Number:	126 - SPEED: New Methods in Statistical Genomics and Genetics Part 1
Type:	Contributed
Date/Time:	Monday, July 29, 2019 : 8:30 AM to 10:20 AM
Sponsor:	Section on Statistics in Genomics and Genetics
Abstract #306422	Presentation
Title:	Identifying Appropriate Probabilistic Models for Sparse Discrete Omics Data
Author(s):	Hani Aldirawi*
Companies:	UIC
Keywords:	Omics data; KS test; likelihood ratio test; Zero-inflated
Abstract:	Modeling sparse and discrete omics data such as microbiome and transcriptomics is challenging due to exceeded number of zeros. Many probabilistic models have been used, including Poisson, negative binomial, zero-inflated Poisson, and zero-inflated negative binomial models. In this paper, we propose a statistical procedure for identifying the most appropriate discrete probabilistic models for zero-inflated or Hurdle models based on the p-value of the discrete Kolmogorov-Smirnov (KS) test. We develop a general procedure for estimating the parameters for a large class of zero-inflated models and Hurdle models. We also develop a general likelihood ratio test based on Neyman-Pearson lemma for choosing the best model when appropriate ones are more than one.

Authors who are presenting talks have a * after their name.

JSM 2019 Online Program