Abstract:
|
The analysis of whole-genome sequencing studies is challenging due to the large number of rare variants in noncoding regions and the lack of natural units for testing. Existing association tests often suffer from insufficient power due to the substantially increased burden of multiple testing and correspondingly stringent multiple testing corrections that do not appropriately account for correlations. Using population-level whole-genome sequencing data, we propose a new statistical method to detect and localize rare and common risk variants based on a recently developed knockoff framework. The proposed approach includes a novel sequential knockoff generator for both rare and common variants in whole-genome sequencing studies, paired with a powerful screening method to comprehensively analyze rare and common variants across the genome. The proposed knockoff generator is scalable to large whole-genome sequencing datasets, and the screening method is powerful in detecting signals across the genome with guaranteed false discovery rate control. Furthermore, it can (1) prioritize causal variants over associations due to linkage disequilibrium thereby improving interpretability; (2) help dist
|