Conference Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 28 - SPEED: Statistical Computing and Statistics in Genomics Part 1
Type: Contributed
Date/Time: Sunday, August 7, 2022 : 2:00 PM to 3:50 PM
Sponsor: Section on Statistics in Genomics and Genetics
Abstract #323285
Title: Flexible Non-Parametric Tests of Sample Exchangeability and Feature Independence
Author(s): Alan Aw* and Yun Song and Jeffrey Spence
Companies: University of California, Berkeley and University of California, Berkeley and Stanford University
Keywords: exchangeability; feature independence; non-parametric test; population stratification; single-cell ATAC-seq; World Values Survey

In scientific studies involving analyses of multivariate data, two common questions that arise are whether the sample is exchangeable, meaning that the joint distribution of the sample is invariant to the ordering of the units; and whether the features can be grouped so that the groups are mutually independent. We propose a non-parametric approach that addresses these two questions. Our approach is conceptually simple, yet fast and flexible. It controls the Type I error across realistic scenarios, and handles data of arbitrary dimensions by leveraging large-sample asymptotics. In the exchangeability detection setting, through extensive simulations and a comparison against unsupervised tests of stratification based on random matrix theory, we find our approach compares favorably in various scenarios of interest. We apply our method to address genomic questions like identifying optimal LD blocks and identifying panmictic populations. We also apply our approach to post-clustering single-cell chromatin accessibility data and World Values Survey data, where we show how users can divide features into independent groups, which helps generate new scientific hypotheses about the features.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2022 program