2013 Joint Statistical Meetings - Celebrating the International Year of Statistics

JSM 2013 Online Program

Online Program Home
My Program

Abstract Details

Activity Number:	245
Type:	Contributed
Date/Time:	Monday, August 5, 2013 : 2:00 PM to 3:50 PM
Sponsor:	WNAR
Abstract - #309233
Title:	The Superior Prediction Accuracy of the Random Generalized Linear Model Predictor (RandomGLM)
Author(s):	Lin Song*+ and Peter Langfelder and Steve Horvath
Companies:	University of California, Los Angeles and Genetics, UCLA and University of California, Los Angeles
Keywords:	RGLM ; machine learning ; ensemble predictor ; generalized linear model
Abstract:	Ensemble predictors such as the random forest are known to have superior accuracy but their black-box predictions are difficult to interpret. In contrast, a generalized linear model (GLM) is very interpretable especially when forward feature selection is used. However, forward selection tends to overfit the data and leads to low predictive accuracy. The random generalized linear model (RGLM) combines the advantages of ensemble predictors (high accuracy) with that of forward regression (interpretability). RGLM is a bootstrap aggregated GLM based predictor that incorporates several elements of randomness and instability: random subspace method, optional interaction terms and forward selection. Here we present comprehensive evaluations involving hundreds of genomic data sets and the UCI machine learning benchmark data. RGLM often outperforms alternative methods including random forests, support vector machines, and penalized regression models. RGLM provides variable importance measures that can be used to define a "thinned" ensemble predictor (involving few features) that retains excellent predictive accuracy. These methods are implemented in the R software package randomGLM.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2013 program

2013 JSM Online Program Home

For information, contact jsm@amstat.org or phone (888) 231-3473.

If you have questions about the Continuing Education program, please contact the Education Department.

The views expressed here are those of the individual authors and not necessarily those of the JSM sponsors, their officers, or their staff.