Conference Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 61 - Population Inference with Electronic Health Records (EHRs) Data: Addressing Selection and Representativeness Bias
Type: Topic Contributed
Date/Time: Sunday, August 7, 2022 : 4:00 PM to 5:50 PM
Sponsor: Survey Research Methods Section
Abstract #323198
Title: Population Inference with Electronic Health Records (EHRs) Data: Addressing Selection and Representativeness Bias
Author(s): Yu B Chen* and Sarah Conderino* and Imaani Easthausen* and Emily Pfaff* and Judy Zhong* and Bo Cai*
Companies: CDC and NYU Grossman School of Medicine and Aetion and University of North Carolina at Chapel Hill and New York University Grossman School of Medicine and University of South Carolina, Arnold School of Public Health
Keywords: Electronic health records; public health surveillance; representativeness; misclassification; selection bias
Abstract:

Electronic health records (EHRs) are increasingly used for public health surveillance. As routinely collected data, EHRs offer a less expensive and fast alternative to national surveys and registries. However, use of EHR data to estimate finite population parameters such as disease prevalence requires great care. Data collected in EHRs are convenience samples that are not selected at random. Selection depends on various factors, including demographics, health status, and health care referral patterns. Moreover, conditional on inclusion in the EHR, data completeness and quality may influence the construction of the analysis dataset. To illuminate potential sources of selection bias, we describe the process of identifying an EHR cohort based on diagnosis codes and available encounter data. Relevant to surveillance, we probe challenges with capturing race and ethnicity data in EHRs, such as missing values and misclassification, which may result in misleading inferences. Finally, we present model and weight-based correction methods to address non-representativeness of the EHR sample with respect to the target population for which inference is desired.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2022 program