JSM Preliminary Online Program
This is the preliminary program for the 2009 Joint Statistical Meetings in Washington, DC.

The views expressed here are those of the individual authors
and not necessarily those of the ASA or its board, officers, or staff.


Back to main JSM 2009 Program page




Activity Number: 357
Type: Topic Contributed
Date/Time: Tuesday, August 4, 2009 : 2:00 PM to 3:50 PM
Sponsor: Section on Survey Research Methods
Abstract - #304580
Title: Estimating Variance Components Using Random Forest
Author(s): Guillermo Mendez*+ and Sharon Lohr
Companies: American Express and Arizona State University
Address: , Phoenix, AZ, 85027,
Keywords: variance components ; random forest ; mixed-effects ; data-mining
Abstract:

Random forests, a data mining technique which uses multiple classification or regression trees, is a popular algorithm used for prediction. Inference and goodness-of-fit assessment, however, may require an estimator of variability. When a modified random forest algorithm is used to model mixed-effects data, an estimator of the vector of variance components is also required for prediction. We propose two estimators of the vector of variance components for random forest regression that take advantage of byproducts of the algorithm. The first estimator is based on the residual sum of squares from a random forest fit and uses a bootstrap bias correction. The second estimator is a difference-based estimator that uses proximity measures as weights. The estimators are evaluated through Monte Carlo simulations.


  • The address information is for the authors that have a + after their name.
  • Authors who are presenting talks have a * after their name.

Back to the full JSM 2009 program


JSM 2009 For information, contact jsm@amstat.org or phone (888) 231-3473. If you have questions about the Continuing Education program, please contact the Education Department.
Revised September, 2008