Activity Number:
|
181
- Statistical Methods in Gene Expression Data Analysis II
|
Type:
|
Contributed
|
Date/Time:
|
Tuesday, August 4, 2020 : 10:00 AM to 2:00 PM
|
Sponsor:
|
Section on Statistics in Genomics and Genetics
|
Abstract #313680
|
|
Title:
|
Quantile Normalization of Single-Cell RNA-Seq Read Counts Without Unique Molecular Identifiers
|
Author(s):
|
Frederick Townes* and Rafael Irizarry
|
Companies:
|
Princeton University and Harvard T.H. Chan School of Public Health
|
Keywords:
|
single-cell;
RNA-seq;
UMIs;
normalization;
gene expression
|
Abstract:
|
Single-cell RNA-seq (scRNA-seq) profiles gene expression of individual cells. Unique molecular identifiers (UMIs) remove duplicates in read counts resulting from polymerase chain reaction, a major source of noise. For scRNA-seq data lacking UMIs, we propose quasi-UMIs: quantile normalization of read counts to a compound Poisson distribution empirically derived from UMI datasets. When applied to ground-truth datasets having both reads and UMIs, quasi-UMI normalization has higher accuracy than alternatives such as census counts. Using quasi-UMIs enables methods designed specifically for UMI data to be applied to non-UMI scRNA-seq datasets.
|
Authors who are presenting talks have a * after their name.