Conference Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 191 - Misspecification and Robustness: Novel Methods and Innovative Insights
Type: Topic Contributed
Date/Time: Monday, August 8, 2022 : 2:00 PM to 3:50 PM
Sponsor: International Society for Bayesian Analysis (ISBA)
Abstract #320879
Title: Bayesian Data Selection
Author(s): Eli Nathan Weinstein* and Jeffrey Miller
Companies: Columbia University and Harvard TH Chan School of Public Health
Keywords: nonparametrics; misspecification; robustness; Stein discrepancy

Insights into complex, high-dimensional data can be obtained by discovering features of the data that match or do not match a model of interest. To formalize this task, we introduce the "data selection" problem: finding a lower-dimensional statistic - such as a subset of variables - that is well fit by a given parametric model of interest. A Bayesian approach to data selection would be to parametrically model the value of the statistic, nonparametrically model the remaining "background" components of the data, and perform standard Bayesian model selection for the choice of statistic. However, fitting a nonparametric model to high-dimensional data tends to be highly inefficient. We propose a novel score for performing data selection, the "Stein volume criterion (SVC)", that does not require fitting a nonparametric model. We prove that the SVC is consistent for data selection. We apply the SVC to the analysis of single-cell RNA sequencing datasets.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2022 program