Online Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 320 - Electronic Health Records, Causal Inference and Miscellaneous
Type: Contributed
Date/Time: Wednesday, August 11, 2021 : 3:30 PM to 5:20 PM
Sponsor: Section on Statistics in Epidemiology
Abstract #318942
Title: Double Sampling for Data Missing Not at Random: Designs and Efficient Estimation Strategies
Author(s): Alexander Levis* and Sebastien Haneuse
Companies: Harvard T.H. Chan School of Public Health and Harvard TH Chan School of Public Health
Keywords: missing data; study design; double sampling; causal inference; MNAR

Large observational studies derived from electronic health record (EHR) data are increasingly being used for comparative effectiveness research. Though these data have many advantages, investigators must acknowledge and handle a typically substantial amount of missing data. Most existing methods for missing data focus on identification and estimation of parameters of interest when data are missing at random, however this assumption is likely untenable in EHR data for which the missingness process is complex and poorly understood. We consider a double sampling design in which a subsample of subjects with initially missing data are more intensively followed up to obtain complete information. We discuss scenarios and assumptions under which the joint density of interest is identified in the augmented sample. Further, we present semiparametric efficient and multiply robust estimators of causal average treatment effects when outcome data are initially missing not at random. Finally, we demonstrate our statistical approach, as well as the practical feasibility of the design, in an EHR-based analysis of weight outcomes following bariatric surgery.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2021 program