Online Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 91 - High Dimensional Data, Causal Inference, Biostats Education, and More
Type: Contributed
Date/Time: Monday, August 9, 2021 : 10:00 AM to 11:50 AM
Sponsor: ENAR
Abstract #318969
Title: Multisite Learning with High-Dimensional Data via a Distributed Algorithm for Penalized Regression (ADAP)
Author(s): Xiaokang Liu* and Rui Duan and Chongliang Luo and Alexis Ogdie and Jason H. Moore and Henry R. Kranzler and Jiang Bian and Yong Chen
Companies: University of Pennsylvania and Harvard University and University of Pennsylvania and University of Pennsylvania and University of Pennsylvania and University of Pennsylvania and University of Florida Health Cancer Center and University of Pennsylvania
Keywords: Data integration; Distributed algorithm; Electronic health record; Penalized regression
Abstract:

Electronic health records (EHR) contain rich information about patients’ diagnoses, lab tests, and medications, etc., and the wide adoption of EHRs throughout the United States facilitates data integration among different institutions to enrich the study population in biomedical research and improve statistical power. To this end, we propose a one-shot summary-statistics-based distributed algorithm for fitting penalized generalized linear model in multicenter research networks based on patient-level data from a leading site and summary-level statistics from other participating sites. This method only requires one round of communication of summary statistics and avoids transferring patient-level data. Taking the logistic lasso regression as an example, we evaluate the performance of the proposed method in terms of estimation, prediction, and feature selection using both simulation studies and a real-world application.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2021 program