Abstract:
|
In medical research, identifying clinically relevant patient subgroups is an important step toward personalized medicine. Integrative clustering is often used to identify patient subgroups by joint modeling datasets collected from multiple sources describing the same set of individuals. Simultaneously clustering multi-source data allows more information to be incorporated into the clustering process and could generate deeper biological insight regarding disease heterogeneity. Motivated by a birth cohort study, we propose a consensus clustering model for integrating multi-source longitudinal data with distinct data structures and variable types. An efficient Gibbs sampling algorithm is developed to estimate the model parameters and the number of clusters. Analysis of real and simulated data will be presented and discussed.
|