Online Program Home
My Program

Abstract Details

Activity Number: 294 - Epidemiologic Methods for the Re-Use of Existing Data
Type: Topic Contributed
Date/Time: Tuesday, July 31, 2018 : 8:30 AM to 10:20 AM
Sponsor: Section on Statistics in Epidemiology
Abstract #328579 Presentation
Title: Genetic Association Testing with Imperfect Phenotypes Derived from Electronic Health Records
Author(s): Jennifer Sinnott*
Companies: Ohio State University
Keywords: electronic health records; electronic medical records; genetic association test; phenome-wide association study; PheWAS
Abstract:

Electronic health records (EHRs) linked to blood samples form a powerful data resource for testing for associations between genotypes and phenotypes, provided that accurate information about phenotypes can be extracted from the health records. Some existing strategies require validation sets with "gold standard" phenotypes, but these can be time-consuming to create, which is especially prohibitive when many phenotypes are of interest such as in phenome-wide association studies (PheWASs). Other strategies identify cases based on thresholding counts of billing codes related to each disease; these strategies are rapid but produce inaccurate phenotyping which may compromise statistical power. We propose a new method to perform genetic association tests in this setting that better leverages information in the billing code counts. The method employs unsupervised clustering to separate patients into two groups based on diagnosis codes. Subjects are assigned a probability of being a disease case based on that clustering. The method is rapid, and can improve power to detect known associations over the standard methods based on thresholding.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2018 program