Online Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 18 - Emerging Issues in EHR: Methods to Enhance Knowledge Discovery and Real World Implementation
Type: Invited
Date/Time: Monday, August 3, 2020 : 10:00 AM to 11:50 AM
Sponsor: Section on Statistics in Epidemiology
Abstract #309553
Title: Modeling Heterogeneity and Missing Data in Electronic Health Records
Author(s): Ying Wei*
Companies: Columbia University

Electronic health records are increasingly adopted in US health systems. A natural feature of EHR data is unobserved, or "latent" heterogeneity, whereby unobservable subgroups of patients are characterized by distinctive patterning in their longitudinal health trajectories. Researchers have used growth mixture models to analyze latent heterogeneity in longitudinal data. One of the primary challenges is to handle the large numbers of missing data in EHR, which are informative and associated with patient's underlying health status. To address this issue, we propose a Bayesian shared parameter model to model latent heterogeneity in multiple longitudinal health outcomes in EHRs, while accounting for MNAR missing data mechanisms for the visit process and response process given a clinic visit. An MCMC algorithm is designed to estimate the proposed model. We evaluated the performance of proposed model in simulation studies as well as a real EHR data, and showed clear advantages in comparison to a naive GMM model with completely observed data only, and the one adjusting missing data with the MAR assumption. This is joint work with Rebecca Anthopolos and Qixuan Chen.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2020 program