JSM 2005 - Toronto

Abstract #303052

This is the preliminary program for the 2005 Joint Statistical Meetings in Minneapolis, Minnesota. Currently included in this program is the "technical" program, schedule of invited, topic contributed, regular contributed and poster sessions; Continuing Education courses (August 7-10, 2005); and Committee and Business Meetings. This on-line program will be updated frequently to reflect the most current revisions.

To View the Program:
You may choose to view all activities of the program or just parts of it at any one time. All activities are arranged by date and time.



The views expressed here are those of the individual authors
and not necessarily those of the ASA or its board, officers, or staff.


The Program has labeled the meeting rooms with "letters" preceding the name of the room, designating in which facility the room is located:

Minneapolis Convention Center = “MCC” Hilton Minneapolis Hotel = “H” Hyatt Regency Minneapolis = “HY”

Back to main JSM 2005 Program page



Legend: = Applied Session, = Theme Session, = Presenter
Activity Number: 51
Type: Invited
Date/Time: Sunday, August 7, 2005 : 4:00 PM to 5:50 PM
Sponsor: Biometrics Section
Abstract - #303052
Title: Variance Estimation of Crossvalidation Estimators of the Generalization Error
Author(s): Hong Tian*+ and Marianthi Markatou and Shameek Biswas and George Hripcsak
Companies: Columbia University and Columbia University and Columbia University and Columbia University
Address: 154 Haven Ave., New York, NY, 10032,
Keywords: cross-validation ; generalization error ; moment approximation ; prediction ; variance estimation
Abstract:

An important aspect of algorithmic performance is the generalization error. The estimation of the generalization error is usually done via bootstrap or crossvalidation. However, providing an estimator of the variance of the crossvalidation (CV) estimator of the generalization error is a more difficult problem, particularly various sources of variability are taken into account. We look at the problem of variance estimation of the CV estimators of generalization error as a problem in approximating the moments of a statistic. The approximation illustrates the role of training and test sets in the performance of the algorithm. It provides a unifying approach to evaluation of various methods used in obtaining training and test sets. It takes into account the variability due to different training and test sets. For the simple problem of predicting the sample mean and in the case of smooth loss functions, we derive our moment approximation estimator and compare it with the one proposed by Nadeau and Bengio (2003). We extend these results in the regression case and the case of absolute error loss. We illustrate the results through simulation.


  • The address information is for the authors that have a + after their name.
  • Authors who are presenting talks have a * after their name.

Back to the full JSM 2005 program

JSM 2005 For information, contact jsm@amstat.org or phone (888) 231-3473. If you have questions about the Continuing Education program, please contact the Education Department.
Revised March 2005