Online Program Home
My Program

Abstract Details

Activity Number: 575 - Statistical Methods for Batch Effect Correction and Cell Type Deconvolution
Type: Contributed
Date/Time: Wednesday, July 31, 2019 : 2:00 PM to 3:50 PM
Sponsor: Section on Statistics in Genomics and Genetics
Abstract #303010 Presentation
Title: Assessing Reproducibility of High-Throughput Experiments in Case of Missing Data
Author(s): Roopali Singh* and Qunhua Li
Companies: and Penn State University
Keywords: Reproducibility; Copula; Correspondence curve; High throughput experiments; Sequencing depth

The outcome of high-throughput biological experiments often has a lot of missing observations when the signals are below the detection level. For example- the majority of reported expression levels in single cell RNA-seq are zeros. The existing methods for reproducibility assessment do not take account of the missing values, leading to biased results. In this paper, we study how the reproducibility of high-throughput experiments is affected by the choices of operational factors (e.g. platform or sequencing depth), when a large amount of measurements is missing. Using a latent variable approach, we extend the correspondence curve regression to incorporate missing values. Our approach estimates the independent effects of covariates on reproducibility and the amount of missing data. Using simulations, we show that our method is more accurate in detecting difference in reproducibility than existing measures of reproducibility. We illustrate the usefulness of our method using a study of HCT116 cells from scRNA-seq libraries made using microfluidic and tube-based methods. We also determine the cost-effective sequencing depth that is required to achieve sufficient reproducibility.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2019 program