JSM 2017 Online Program

Online Program Home

My Program

Abstract Details

Activity Number:	464 - Missing Data
Type:	Contributed
Date/Time:	Wednesday, August 2, 2017 : 8:30 AM to 10:20 AM
Sponsor:	Biometrics Section
Abstract #324492
Title:	How Many Validation Samples Are Needed for Phenotyping Error Correction in EHR-Based Association Studies?
Author(s):	Jing Huang* and Rui Duan and Rebecca Hubbard and Hua Xu and Yong Chen
Companies:	University of Pennsylvania and University of Pennsylvania and University of Pennsylvania and The University of Texas Health Science Center at Houston and University of Pennsylvania
Keywords:	association study ; bias correction ; EHR ; misclassification ; phenotyping ; validation
Abstract:	In electronic health records (EHR)-related research, health status ascertained using phenotyping algorithms is sometimes error-prone. Ignoring misclassifications in EHR-derived phenotypes can lead to biased estimates of effect sizes of risk factors. To correct for such bias, manual chart reviews are usually conducted to obtain the true health status for a small validation set. Current methods to utilize the validation data for bias correction include direct estimation of misclassification rates or joint modeling of the validation data with the data without validation. There is lack of guideline on which method performs better under what scenarios. In this talk, we compare the relative performances of these two commonly used procedures under various real application motivated scenarios, and also propose two additional methods that can effectively incorporate the knowledge on misclassification rates in the bias correction. Simulation studies and case studies will be presented, in order to shed light on deciding the size of validation samples in practical EHR-based investigations.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2017 program

Copyright © American Statistical Association

Privacy Policy | Conduct Policy | Previous JSMs