This is the program for the 2010 Joint Statistical Meetings in Vancouver, British Columbia.

Abstract Details

Activity Number: 356
Type: Contributed
Date/Time: Tuesday, August 3, 2010 : 10:30 AM to 12:20 PM
Sponsor: Section on Statistical Learning and Data Mining
Abstract - #307130
Title: Thinning Random Forests
Author(s): Jie Xu* and J. Brian Gray+
Companies: The University of Alabama and The University of Alabama
Address: Dept of Info Systems, Statistics, and Mgt Science, Tuscaloosa, AL, 35487-0226,
Keywords: cluster analysis ; ensemble model ; machine learning ; predictive modeling
Abstract:

Ensemble models, such as bagging, random forests, and boosting, have better predictive accuracy than single classifiers. However, an ensemble model typically consists of hundreds of single classifiers, which makes model interpretation difficult. In this presentation, we propose a new method based on cluster analysis for significantly reducing the size of an ensemble while maintaining roughly the same predictive accuracy. Model interpretation is improved, and computation time for future predictions is reduced. The method is demonstrated on several data sets and compared to other ensemble methods.


The address information is for the authors that have a + after their name.
Authors who are presenting talks have a * after their name.

Back to the full JSM 2010 program




2010 JSM Online Program Home

For information, contact jsm@amstat.org or phone (888) 231-3473.

If you have questions about the Continuing Education program, please contact the Education Department.