Name: 2018 Joint Statistical Meetings
Start: 2018-07-28T07:00:00+00:00
End: 2018-08-02
Location: Vancouver Convention Centre

Abstract Details

Activity Number:	535 - Contributed Poster Presentations: Section on Statistics in Genomics and Genetics
Type:	Contributed
Date/Time:	Wednesday, August 1, 2018 : 10:30 AM to 12:20 PM
Sponsor:	Section on Statistics in Genomics and Genetics
Abstract #327152
Title:	SAME-Clustering: Single-Cell Aggregated Clustering via Mixture Model Ensemble
Author(s):	Ruth Huh* and Yuchen Yang and Houston Culpepper and Jin Szatkiewicz and Yun Li
Companies:	University of North Carolina at Chapel Hill and University of North Carolina at Chapel Hill and University of North Carolina at Chapel Hill and University of North Carolina at Chapel Hill and University of North Carolina at Chapel Hill
Keywords:	scRNA-seq; clustering; Multinomial Mixture Model; ensemble clustering
Abstract:	Clustering single-cell RNA-seq (scRNA-seq) data is a critically important task. Clustering results themselves are of great importance for shedding light on tissue complexity including the number of cell types present and transcriptomic signatures of each cell type. Due to its importance, several novel methods have been developed recently for clustering scRNA-seq data. However, different approaches generate varying estimates regarding number of clusters and cluster assignments. It is usually hard to gauge which method to use because none of the clustering methods always outperforms others across various datasets. Our SAME-clustering takes multiple sets of clustering results and adopts a probabilistic model to build a consensus, which provides robust and improved clustering results. Specifically, SAME-clustering uses a finite mixture model of multinomial distributions. We have tested SAME-clustering across 15 datasets, with number of clusters varying from 3 to 14, and number of single cells from 49 to 32,695. Results show that our SAME-clustering ensemble method, using a mixture model, yields enhanced clustering, in terms of both cluster assignments and number of clusters.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2018 program

JSM 2018 Online Program

Abstract Details

American Statistical Association