Conference Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 188 - SLDS Student Paper Awards
Type: Topic Contributed
Date/Time: Monday, August 8, 2022 : 2:00 PM to 3:50 PM
Sponsor: Section on Statistical Learning and Data Science
Abstract #322094
Title: Floodgate: Inference for Model-Free Variable Importance
Author(s): Lu Zhang* and Lucas Janson
Companies: Harvard University and Harvard University
Keywords: variable importance; effect size; model-X; heterogeneous treatment effects; heritability
Abstract:

Many modern applications seek to understand the relationship between an outcome variable Y and a covariate X in the presence of a confounding variable Z. Although much attention has been paid to testing whether Y depends on X given Z, in this paper we seek to go beyond testing by inferring the strength of that dependence. We first define our estimand, the minimum mean squared error (mMSE) gap, which quantifies the conditional relationship between Y and X in a way that is deterministic, model-free, interpretable, and sensitive to nonlinearities and interactions. We then propose a new approach called floodgate that can leverage any working regression function chosen by the user to construct asymptotic confidence bounds, and we apply it to the mMSE gap. In addition to proving floodgate's asymptotic validity, we rigorously quantify its accuracy and robustness. We then show we can apply the same floodgate principle to a different measure of variable importance when Y is binary. Finally, we demonstrate floodgate's performance in a series of simulations and apply it to data from the UK Biobank to infer the strengths of dependence of platelet count on various groups of genetic mutations.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2022 program