Online Program Home
My Program

Abstract Details

Activity Number: 140 - Frontiers of Statistical Genetics: Genomics, Transcriptomics, and PheWAS
Type: Invited
Date/Time: Monday, July 29, 2019 : 10:30 AM to 12:20 PM
Sponsor: WNAR
Abstract #300557 Presentation
Title: Predictive Modeling of Transcriptomics in Ancestrally Diverse Populations
Author(s): Timothy Thornton* and Anya Mikhaylova
Companies: University of Washington and Universtiy of Washington
Keywords: Prediction; Supervised Learning; Transcriptomics; Ancestry; Machine Learning

Predictive models of gene expression, or transcriptomics, from genotyping or sequencing data are now widely used for the identification of genes involved in complex traits. One of the most popular transciptomic prediction methods is PrediXcan, where predictions are derived from supervised machine learning algorithms, such as LASSO and elastic net. PrediXcan models used training data from subjects with European ancestry, however, many genetic studies includes samples from ancestrally diverse populations. Using transcriptomic data from the GEUVADIS (Genetic European Variation in Health and Disease) RNA sequencing project and whole genome sequencing data from the 1000 Genomes project, we evaluate and compare the predictive performance of PrediXcan in diverse populations. We show that predictive performance varies across populations, with the Yoruban (YRI) sample from Nigeria having the lowest correlations between the observed and predicted gene expression values, on average. We also propose a new approach for modeling gene expression that incorporates ancestry in order to improve prediction accuracy in diverse populations, including populations with admixed ancestry.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2019 program