2015 Joint Statistical Meetings - Statistics: Making Better Decisions.

JSM 2015 Home

JSM 2015 Preliminary Program

Abstract Details

Activity Number:	547
Type:	Contributed
Date/Time:	Wednesday, August 12, 2015 : 10:30 AM to 12:20 PM
Sponsor:	Section on Statistical Learning and Data Mining
Abstract #317640
Title:	Measuring the Convergence Rate of Random Forests via the Bootstrap
Author(s):	Miles Lopes*
Companies:	UC Berkeley
Keywords:	random forests ; bootstrap ; computational and statistical trade-offs ; ensemble classifiers ; majority vote ; bagging
Abstract:	When aggregating predictions with a voting rule, it is natural to ask "How many votes are needed to obtain a reliable prediction?" In the context of ensemble classifiers such as Random Forests, this question specifies a trade-off between computational cost and statistical performance. Namely, by a paying a larger computational price for more classifiers, the prediction error of the ensemble tends to decrease and become more stable. Conversely, by sacrificing some statistical efficiency, it is possible to speed up the tasks of training the ensemble and making new predictions. In this paper, we quantify this tradeoff for the methods of Bagging and Random Forests, using a bootstrap-based approach. To be specific, let the random variable Err_t denote the prediction error of a randomly generated ensemble of t classifiers, trained on a fixed dataset. Then as t tends to infinity, we show that the variance var(Err_t) can be consistently estimated via our proposed resampling method. As a consequence, this result offers practitioners a guideline for choosing the smallest number of base classifiers (e.g. decision trees) needed to ensure that var(Err_t) is less than a given value.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2015 program

For program information, contact the JSM Registration Department or phone (888) 231-3473.

For Professional Development information, contact the Education Department.

The views expressed here are those of the individual authors and not necessarily those of the JSM sponsors, their officers, or their staff.

2015 JSM Online Program Home