JSM 2013 Home
Online Program Home
My Program

Abstract Details

Activity Number: 627
Type: Topic Contributed
Date/Time: Thursday, August 8, 2013 : 8:30 AM to 10:20 AM
Sponsor: Quality and Productivity Section
Abstract - #309976
Title: Density-Based Partitioning for K-Fold Cross-Validation
Author(s): Lulu Kang*+
Companies: Illinois Institute of Technology-Department of Applied Mathematics
Keywords: Cross Validation ; K-fold ; kernel density estimation ; optimal band-width
Abstract:

Cross-validation, sometimes called rotation estimation, is a technique for assessing how the results of a statistical analysis will generalize to an independent data set. K-fold cross-validation is the most commonly used cross-validation method. Typically in K-fold cross-validation, the original sample is randomly partitioned into K sub-samples. In this paper, we propose a new sampling method to partition the complete data set into K folds such that the probability distribution within each fold is similar to the probability distribution of the complete date set. The L2-norm of the difference between the empirical kernel density functions of the within-fold data and the complete data the measure is used here as objective function to search for the optimal partition of the K folds. Numerical examples are shown to compare the proposed cross-validation methods and the conventional ones.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2013 program




2013 JSM Online Program Home

For information, contact jsm@amstat.org or phone (888) 231-3473.

If you have questions about the Continuing Education program, please contact the Education Department.

The views expressed here are those of the individual authors and not necessarily those of the JSM sponsors, their officers, or their staff.

ASA Meetings Department  •  732 North Washington Street, Alexandria, VA 22314  •  (703) 684-1221  •  meetings@amstat.org
Copyright © American Statistical Association.