Online Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 33 - Junior Research in Methods for Integrating Heterogeneous Data: From Clustering to Factor Analysis
Type: Topic Contributed
Date/Time: Monday, August 3, 2020 : 10:00 AM to 11:50 AM
Sponsor: International Society for Bayesian Analysis (ISBA)
Abstract #312589
Title: Bayesian Combinatorial Multi-Study Factor Analysis with the Indian Buffet Process
Author(s): Isabella Grabski* and Roberta De Vito and Lorenzo Trippa and Giovanni Parmigiani
Companies: Harvard University and Brown University and Harvard School of Public Health and Harvard T.H. Chan School of Public Health
Keywords: Factor analysis; Multi-study; Bayesian nonparametrics ; Heterogeneous data
Abstract:

Using multiple studies in a single statistical analysis leverages heterogeneous data to distinguish signal from artifacts by identifying what signal is shared by some or all of the studies, and what signal is specific to an individual study. The unsupervised identification of latent factors can be particularly useful for uncovering signal in the high-dimensional setting, but existing extensions of factor analysis to the multi-study context can only identify latent factors if they are common to all studies or unique to a single study. In this work, we introduce Bayesian Combinatorial Multi-Study Factor Analysis (BCMSFA), which learns latent factors shared by any subset of studies. We do so by using the Indian Buffet Process to model the shared ownership of factors across multiple studies. Our approach encourages sparse high-dimensional factor loading matrices through the multiplicative gamma process shrinkage prior. We estimate parameters using a computationally efficient Gibbs sampling algorithm. We demonstrate the robustness of BCMSFA through a broad range of simulations, and apply BCMSFA to multiple breast cancer gene expression datasets.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2020 program