Online Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 91 - High Dimensional Data, Causal Inference, Biostats Education, and More
Type: Contributed
Date/Time: Monday, August 9, 2021 : 10:00 AM to 11:50 AM
Sponsor: ENAR
Abstract #319147
Title: Three-Phase Generalized Raking and Multiple Imputation Estimators to Address Errors in Routinely Collected Data
Author(s): Gustavo Amorim* and Ran Tao and Sarah Lotspeich and Pamela Shaw and Thomas Lumley and Rena Patel and Bryan Shepherd
Companies: Vanderbilt University Medical Center and Vanderbilt University Medical Center and Vanderbilt University Medical Center and Kaiser Permanente Washington Health Research Institute and University of Auckland and University of Washington and Department of Biostatistics, Vanderbilt University School of Medicine
Keywords: Design-based estimator; Electronic Medical Records; Measurement error; Model-based estimator; Multiple imputation; Three-phase design
Abstract:

Validation studies are often used to reduce measurement error and get more reliable information on certain variables of interest. These studies consist of selecting a sample of patients from which error-prone records had been collected previously, e.g. in an observational database, and performing either a more detailed measurement or more refined data collection procedure. In practice, however, more than one round of data validation may be required, and direct application of standard design-based or multiple imputation techniques may lead to estimators that are inefficient, as information available in intermediate validation steps are ignored or only partially considered. We present two novel extensions of generalized regression estimators and a multiple imputation technique that makes full use of all available data and show through simulations that incorporating information from intermediate steps may lead to substantial gains in efficiency. This is illustrated using electronic health record data from 85,324 HIV-positive women, of whom 5,080 had their charts reviewed, and then 1,285 also had a telephone interview to validate key variables for a study of contraceptive effectiveness


Authors who are presenting talks have a * after their name.

Back to the full JSM 2021 program