2015 Joint Statistical Meetings - Statistics: Making Better Decisions.

JSM 2015 Home

JSM 2015 Online Program

Abstract Details

Activity Number:	226
Type:	Invited
Date/Time:	Monday, August 10, 2015 : 2:00 PM to 3:50 PM
Sponsor:	International Indian Statistical Association
Abstract #314156	View Presentation
Title:	Asymptotic Theory for Random Forests
Author(s):	Stefan Wager*
Companies:	Stanford University
Keywords:
Abstract:	Random forests have proven themselves to be reliable predictive algorithms in many application areas. Not much is known, however, about the statistical properties of random forests. Several authors have established conditions under which their predictions are consistent, but these results do not provide practical estimates of the scale of random forest errors. In this paper, we analyze a random forest model based subsampling, and show that random forest predictions are asymptotically normal provided that the subsample size s scales as s(n)/n = o(log(n)^{-d}), where n is the number of training examples and d is the number of features. Moreover, we show that the asymptotic variance can consistently be estimated using an infinitesimal jackknife for bagged ensembles recently proposed by Efron (2013). In other words, our results let us both characterize and estimate the error-distribution of random forest predictions. Thus, random forests need not only be treated as black-box predictive algorithms, and can also be used for statistical inference.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2015 program

For program information, contact the JSM Registration Department or phone (888) 231-3473.

For Professional Development information, contact the Education Department.

The views expressed here are those of the individual authors and not necessarily those of the JSM sponsors, their officers, or their staff.

2015 JSM Online Program Home