Online Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 319 - SLDS CSpeed 6
Type: Contributed
Date/Time: Wednesday, August 11, 2021 : 3:30 PM to 5:20 PM
Sponsor: Section on Statistical Learning and Data Science
Abstract #318621
Title: Controlled Group Variable Selection Using Variational Autoencoder-Generated Knockoffs and Reproducibility Evaluation
Author(s): Xinran Qi*
Companies: Medical College of Wisconsin
Keywords: Group-wise feature selection; False discovery rate control; Knockoffs; Generative model; Robustness
Abstract:

For genetic and genomic data, it is important to select representative single nucleotide polymorphism (SNP) blocks in which neighboring SNPs are correlated to predict survival outcomes and to construct biological pathways. In this case, controlling the familywise error rate is too restrictive and hence we will focus on the false discovery rate (FDR) control. We propose a generative model, a variational autoencoder, to generate knockoffs for controlled group variable selection. We also evaluate the reproducibility of the feature selection algorithm by sub-sampling and compare it with other alternatives. Simulations are used to show that the proposed method has comparatively low group FDR and high power. Finally, we apply the method to the 1000 Genomes Project data to select SNP blocks for the prediction of human leukocyte antigen allele haplotypes.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2021 program