Online Program Home
  My Program

Abstract Details

Activity Number: 390 - Challenges in Whole-Genome Sequence Analysis: Experiences and Approaches in the TOPMed Project
Type: Invited
Date/Time: Tuesday, August 1, 2017 : 2:00 PM to 3:50 PM
Sponsor: WNAR
Abstract #322378 View Presentation
Title: FastSKAT: Sequence Kernel Association Tests for Very Large Sets of Markers in Large-Scale Sequencing Data
Author(s): Kenneth Rice*
Companies: University of Washington
Keywords:
Abstract:

The Sequence Kernel Association Test (SKAT) is widely used to test for associations between a phenotype and a set of variants. Computing p-values for SKAT requires the eigenvalues of the genotype covariance matrix, or a similar matrix of equal size - an n x n matrix, where n is the number of subjects or variants, whichever is lower. Extracting the full set of eigenvalues has computational complexity proportional to n^3, and currently limits the use of SKAT. To overcome this, we propose fastSKAT, a new computationally-efficient but accurate approximation, in which only the k largest eigenvalues for SKAT are extracted and a remainder term is evaluated using a Satterthwaite approach. For sample sizes seen in current sequencing studies, these innovations make SKAT tests feasible with at least an order of magnitude more variants than current approaches. We illustrate fastSKAT on several large datasets, describing its computation stability, accuracy in terms of Type I error rates, and computational speed. We show that fastSKAT quickly and accurately implements SKAT analyses for large numbers of markers, and illustrate how, used with sequence data, it will help address questions that were previously intractable.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2017 program

 
 
Copyright © American Statistical Association