Online Program Home
My Program

Abstract Details

Activity Number: 85 - SPEED: An Ensemble of Advances in Genomics and Genetics
Type: Contributed
Date/Time: Sunday, July 29, 2018 : 5:05 PM to 5:50 PM
Sponsor: Section on Statistics in Genomics and Genetics
Abstract #332709
Title: Discrete Principal Component Analysis for Population Stratification
Author(s): Nedret Billor* and Yuan Yuan and Asuman Seda Turkmen
Companies: Auburn University and Auburn University and The Ohio State University
Keywords: Logistic PCA; Similarity measure; clustering; population structure; rare variants

The computational simplicity of principal component analysis (PCA) makes it a widely used method for population stratification adjustment. However, given that categorical nature of genotype data, it is not appropriate to directly apply PCA, designed specifically for continuous variables, on genotype data. In addition, although common variants have been extensively studied, little is known about the stratification of rare variants and its impact on association tests. The fact that rare variants are not stratified in the same way as common variants necessitates the development of statistical methods that can capture stratification patterns for low-frequency and rare variants. To address these limitations, we investigate performances of categorical PCA and similarity-matrix based PCA which might be able to detect underlying structures for rare variants. We demonstrate, through simulated and real data sets, that similarity-matrix based PCA is able to adjust for population stratification in rare variants much more effectively than does standard and categorical PCA.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2018 program