Online Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 25 - Medical Devices and Diagnostics Speed Session
Type: Contributed
Date/Time: Sunday, August 8, 2021 : 1:30 PM to 3:20 PM
Sponsor: Section on Medical Devices and Diagnostics
Abstract #317694
Title: Feature Space Fusion for Heterogeneous Scattered Data
Author(s): Jingyi Zhang* and Wenxuan Zhong and Ping Ma
Companies: Center for Statistical Science, Tsinghua University and University of Georgia and University of Georgia
Keywords: Data fusion; Feature space fusion; Data heterogeneity; Dimension reduction; Multi-index model
Abstract:

Scattered data or multi-center data, which are collected and stored individually at local data centers, can be highly heterogeneous if data centers are very different. Clearly, a simple collating or pooling of data is not enough, sometimes even not feasible due to data privacy and bandwidth limitation. There is an urgent need for data fusion methods to integrate scattered data. We present a general feature space fusion framework through the multi-index model, which assumes that the response variable depends on several linear combinations of predictors through some unknown link functions. By fusing the feature space spanned by the regression indices for data in each center, we can borrow the strength of multiple local centers, and obtain a more accurate estimation. We show theoretically that the fused feature space is asymptotically consistent under some mild regularity conditions. We also establish the asymptotic convergence rate of the proposed algorithm. As we allow center-specific predictor distributions and link functions for local data centers, the method can well address the data heterogeneity in scattered data.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2021 program