JSM 2015 Preliminary Program

Online Program Home
My Program

Abstract Details

Activity Number: 297
Type: Topic Contributed
Date/Time: Tuesday, August 11, 2015 : 8:30 AM to 10:20 AM
Sponsor: Section on Statistical Learning and Data Mining
Abstract #316988
Title: Bayesian Clustering of Multi-Source Data
Author(s): Eric Lock*
Companies: University of Minnesota
Keywords: Bayesian ; Clustering ; Multi-source ; Data integration ; Dirichlet mixtures
Abstract:

In biomedical research a growing number of platforms and technologies are used to measure diverse but related information. The task of clustering a set of objects based on multiple sources of data arises in several applications. Most current approaches to multi-source clustering either independently determine a clustering for each data source, or determine a single joint clustering based on all data sources. We propose an integrative statistical model that permits a different clustering for each data source. These source-specific clusterings adhere loosely to an overall consensus clustering, and hence they are not independent. We describe a computationally scalable Bayesian framework for simultaneous estimation of both the consensus clustering and the source-specific clusterings. We demonstrate that this flexible approach is more robust than joint clustering of all data sources, and is more powerful than clustering each data source independently. Furthermore, We present an application to subtype identification of breast cancer tumor samples using publicly available data from The Cancer Genome Atlas.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2015 program





For program information, contact the JSM Registration Department or phone (888) 231-3473.

For Professional Development information, contact the Education Department.

The views expressed here are those of the individual authors and not necessarily those of the JSM sponsors, their officers, or their staff.

2015 JSM Online Program Home