Online Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 413 - Analyses of Environmental Data
Type: Contributed
Date/Time: Thursday, August 12, 2021 : 2:00 PM to 3:50 PM
Sponsor: Section on Statistics and the Environment
Abstract #319010
Title: Prediction and Model Evaluation for Space-Time Data
Author(s): Gregory Watson* and donatello telesca
Companies: UCLA and UCLA
Keywords: Prediction; Cross-Validation; Space-Time; Spatiotemporal; Interpolation; Error
Abstract:

Evaluation metrics for prediction error, model selection and model averaging on space-time data are understudied and poorly understood. The absence of independent replication makes prediction ambiguous as a concept and renders evaluation procedures developed for independent data inappropriate for most space-time prediction problems. Motivated by air pollution data collected during California wildfires in 2008, we attempt a formalization of the true prediction error associated with spatial interpolation. We investigate a variety of cross-validation (CV) procedures employing both simulations and case studies to provide insight into the nature of the estimand targeted by alternative data partition strategies. Consistent with recent best practice, we find that location-based cross-validation is appropriate for estimating spatial interpolation error as in our analysis of the California wildfire data. Interestingly, commonly held notions of bias-variance trade-off of CV fold size do not trivially apply to dependent data, and we recommend leave-one-location-out (LOLO) CV as the preferred prediction error metric for spatial interpolation.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2021 program