Online Program Home
My Program

Abstract Details

Activity Number: 179 - Statistical Methods for Measurement Error and Missing Data in Covariates/Exposures
Type: Contributed
Date/Time: Monday, July 29, 2019 : 10:30 AM to 12:20 PM
Sponsor: Section on Statistics in Epidemiology
Abstract #306463
Title: Design and Analysis of Two-Phase Samples in Discrete-Time Survival Analysis with Error-Prone Exposures
Author(s): Kyunghee Han* and Thomas Lumley and Bryan E Shepherd and Pamela Shaw
Companies: University of Pennsylvania and University of Auckland and Vanderbilt University School of Medicine and University of Pennsylvania
Keywords: Auxiliary variables; Discrete failure time; Mean score; Measurement error; Surrogate data; Two-phase sampling

Increasingly medical research is dependent on data collected for non-research purposes, such as electronic health records data (EHR). EHR data and other large databases can be prone to measurement error in key exposures. Validating a subset of records is a cost-effective way of gaining information on the error structure, which in turn can be used to adjust analyses for this error and improve inference. We extend the mean score method for the two-stage analysis of discrete-time survival models, which uses the unvalidated covariates as auxiliary variables that can act as surrogates for the unobserved true exposure. This method allows for a two-phase sampling analysis approach that preserves the consistency of the regression model estimates in the validated subset, with increased precision leveraged from the auxiliary data. Further, we develop optimal sampling strategies which minimize the variance of the mean score estimator for a target exposure under a fixed cost constraint. Through simulations, we evaluate efficiency gains of the mean score estimator using optimal validation designs compared to random sampling. We also apply the proposed method to the Wilms tumor study.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2019 program