Online Program Home
My Program

Abstract Details

Activity Number: 182
Type: Contributed
Date/Time: Monday, August 1, 2016 : 10:30 AM to 12:20 PM
Sponsor: IMS
Abstract #319730 View Presentation
Title: Large-Scale Cluster Analysis Using Fusion Penalties
Author(s): Trambak Banerjee* and Peter Radchenko and Gourab Mukherjee
Companies: University of Southern California and University of Southern California and University of Southern California
Keywords: Fusion penalty ; Convex clustering ; Asymptotic optimality ; High-dimensional ; Single-cell biology

We propose a novel methodology for variable screening in clustering large scale datasets which not only have very large sample sizes but are also high-dimensional. Using a fusion penalty based convex clustering criterion, we propose a very fast screening procedure which efficiently discards non-informative variables from the data. We establish asymptotic optimality properties of our proposed method. Through extensive simulation experiments, we compare the performance of our proposed method with other clustering algorithms and obtain encouraging results. We demonstrate the applicability of our method for cluster analysis of big datasets arising in single-cell proteomic and genomic studies.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2016 program

Copyright © American Statistical Association