Abstract:
|
In modern machine learning, it often makes no sense to assume that a model for Y|X is well-specified. But there are some settings in genetics and neuroscience where modeling the covariates X might be reasonable instead. We discuss the MX(2) model, where we know the first two moments of X, and discuss how a prediction-based perspective can neatly interface with classical semiparametric theory to deliver an interpretable notion of variable importance for all variables, along with valid confidence intervals and controlled false discovery rate.
|