Online Program

Return to main conference page

All Times ET

Friday, June 4
Computational Statistics
New Models and Methods
Fri, Jun 4, 1:20 PM - 2:55 PM
TBD
 

Quantifying the Terms from the Bias-Variance Decomposition Using Metalog Distributions (309704)

*Neil A Hamlett, Uncertainty Research, LLC 

Keywords: machine learning, mathematical statistics, probability distributions

The bias-variance decomposition (BVD) serves as a key guidepost in machine-learning practice. Comparing model scores for training, validation, and test data provide qualitative indications of possible model overfitting. Explicit calculation of BVD terms is generally problematic, absent assumptions about convenient probability distributions.

Metalog distributions, an emerging technique from the Decision-Analysis community, provide a mechanism for explicit calculation of BVD terms. Metalogs fit arbitrary empirical distributions to closed-form, multiply-differentiable, continuous functions represented as series.

The bias and variance BVD terms result from metalog-distribution fits to prediction-model training- and test-data residual errors employing a method resembling Locally Weighted Scatterplot Smoothing (LOWESS). Instead of calculating means as is done with LOWESS, metalog-fit probability-density functions are instead obtained. This allows for some statistical inference using machine-learning results, including estimates of prediction and confidence intervals. Applying Kolmogorov-Smirnov or other tests to the resulting bias and variance distributions leads to a quantitative characterization of prediction-model overfitting.

Metalog distributions can similarly be employed to estimate Cramer-Row Lower Bounds (CRLB) on estimation-error variance. This occurs from fitting orthogonal-transform-space representations of the explanatory variables to metalog distributions. The CLRB offers a proxy for the irreducible-variance term in the BVD. Explicit calculation of BVD terms using metalog distributions affords the opportunity to extend Analysis of Variance (ANOVA) methods from classical statistics into the machine-learning realm.