Online Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 8 - Recent Advances in Statistical Learning for High-Dimensional and Heterogeneous Complex Data
Type: Invited
Date/Time: Monday, August 3, 2020 : 10:00 AM to 11:50 AM
Sponsor: Section on Statistical Learning and Data Science
Abstract #314446
Title: High-dimensional factor regression for heterogeneous subpopulations
Author(s): Peiyao Wang and Quefeng Li and Yufeng Liu* and Dinggang Shen
Companies: University of North Carolina at Chapel Hill and UNC Chapel Hill and University of North Carolina at Chapel Hill and University of North Carolina at Chapel Hill

In modern scientific research, data heterogeneity is commonly observed due to the abundance of complex data. We propose a factor regression model for data with heterogeneous subpopulations. In particular, the proposed model can be represented as a decomposition of heterogeneous and homogeneous terms. The heterogeneous term is driven by latent factors in different subpopulations. The homogeneous term captures common variation in the covariates and shares common regression coefficients across the subpopulations. Our proposed model attains a good balance between a global model and a group-specific model. The global model ignores data heterogeneity, while the group-specific model fits each subgroup separately. Both theoretical and numerical results are used to demonstrate the performance of the proposed model. Finally, analysis of a dataset from Alzheimer's Disease Neuroimaging Initiative further demonstrates the competitiveness and interpretability of our proposed factor regression model.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2020 program