JSM 2011 Online Program

The views expressed here are those of the individual authors and not necessarily those of the JSM sponsors, their officers, or their staff.

Abstract Details

Activity Number: 236
Type: Contributed
Date/Time: Monday, August 1, 2011 : 2:00 PM to 3:50 PM
Sponsor: Section on Statistical Computing
Abstract - #302467
Title: Determining Fitness Function Parameters for GA-Boost
Author(s): Dong-Yop Oh*+ and J. Brian Gray
Companies: University of Alabama and University of Alabama
Address: ISM Dept, 300 Alston Hall, Tuscaloosa, AL, 35487-0226,
Keywords: AdaBoost ; classification ; genetic algorithm ; predictive model ; weak classifier
Abstract:

Our recently proposed genetic boosting algorithm, GA-Boost, directly solves for the weak classifiers in an ensemble and their weights using a genetic algorithm. The fitness function consists of three parameters (a, b, and p) that limit the number of weak classifiers (by b) and control the effects of outliers (by a) to maximize an appropriately chosen p-th percentile of margins. We use several artificial data sets to compare GA-Boost performance at 16 different treatment levels, as well as how it compares to AdaBoost, at four different noise levels. Through these simulations, we verify that GA-Boost has better performance with simpler predictive models than AdaBoost when there is a large proportion of outliers in a data set. GA-Boost is applied to real data sets with three different weak classifier options and compared to other robust boosting methods. We also consider graphical methods for selecting the value of p.


The address information is for the authors that have a + after their name.
Authors who are presenting talks have a * after their name.

Back to the full JSM 2011 program




2011 JSM Online Program Home

For information, contact jsm@amstat.org or phone (888) 231-3473.

If you have questions about the Continuing Education program, please contact the Education Department.