Conference Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 156 - Contributed Poster Presentations: Section on Statistics in Genomics and Genetics
Type: Contributed
Date/Time: Monday, August 8, 2022 : 10:30 AM to 12:20 PM
Sponsor: Section on Statistics in Genomics and Genetics
Abstract #323085
Title: Robust and Accurate Estimation of Cellular Fraction from Tissue Omics Data via Ensemble Deconvolution
Author(s): Manqi Cai* and Molin Yue and Tianmeng Chen and Jinling Liu and Erick Forno and Xinghua Lu and Timothy Billar and Juan Carlos Celedón and Chris McKennan and Wei Chen and Jiebiao Wang
Companies: University of Pittsburgh and University of Pittsburgh and University of Pittsburgh and Missouri University of Science and Technology and University of Pittsburgh and University of Pittsburgh and University of Pittsburgh and University of Pittsburgh and University of Pittsburgh and University of Pittsburgh and University of Pittsburgh
Keywords: Cell type deconvolution; Ensemble learning; Marker gene selection; DNA methylation data
Abstract:

Tissue-level gene expression analysis is known to be confounded by cellular heterogeneity. To adjust for the confounding, dozens of cellular deconvolution methods have been proposed to infer cell-type fractions from tissue omics data. However, these methods produce vastly different results under various settings, and benchmarking showed no universally best deconvolution methods. To achieve a robust estimation of cellular fractions, we proposed EnsDeconv (Ensemble Deconvolution), which uses L1-loss-based ensemble learning to synthesize the results from deconvolution methods, reference datasets, marker gene selection procedures, data normalization, and transformations. Different from simulation-based benchmarking, we compiled four large real datasets with measured cellular fractions and comprehensively evaluated EnsDeconv’s performance in different tissue types. Evaluations demonstrated that EnsDeconv yields more stable, robust, and accurate results than existing methods. In addition, we illustrated that EnsDeconv enables various downstream analyses such as differential fractions associated with clinical variables. We further extended EnsDeconv to analyze bulk DNA methylation data.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2022 program