Abstract:
|
Regression diagnostics related to influence and leverage have been used for decades to evaluate the extremity of observations found within the space spanned by the predictor variables $X$ = ($X_1$,...,$X_q$). In many applications, the q-dimensional rectangular region formed by the ranges of the predictor variables includes subregions where combinations of the predictors are unlikely or even physically impossible to observe. For potential prediction locations found within such subregions, traditional regression diagnostics may fail to properly quantify the potential risks associated with prediction, particularly when the linear model for the relationship between the response and predictors contains polynomial terms that characterize nonlinear relationships.
In this talk, we consider a broader definition of extrapolation based on the inter-point distances among the points in the training data and we propose graphical and numerical approaches for assessing the potential for extrapolation. Methods are illustrated using data associated with the prediction of glacier melt rates.
|
ASA Meetings Department
732 North Washington Street, Alexandria, VA 22314
(703) 684-1221 • meetings@amstat.org
Copyright © American Statistical Association.