Abstract #300924

This is the preliminary program for the 2003 Joint Statistical Meetings in San Francisco, California. Currently included in this program is the "technical" program, schedule of invited, topic contributed, regular contributed and poster sessions; Continuing Education courses (August 2-5, 2003); and Committee and Business Meetings. This on-line program will be updated frequently to reflect the most current revisions.

To View the Program:
You may choose to view all activities of the program or just parts of it at any one time. All activities are arranged by date and time.

The views expressed here are those of the individual authors
and not necessarily those of the ASA or its board, officers, or staff.


Back to main JSM 2003 Program page



JSM 2003 Abstract #300924
Activity Number: 368
Type: Contributed
Date/Time: Wednesday, August 6, 2003 : 10:30 AM to 12:20 PM
Sponsor: Section on Statistical Computing
Abstract - #300924
Title: TARGET: Tree Analysis with Randomly Generated and Evolved Trees
Author(s): J. Brian Gray*+ and Guangzhe Fan
Companies: University of Alabama and University of Alabama
Address: Dept. of IS, Statistics, and Mgt Science, Tuscaloosa, AL, 35487-0001,
Keywords: genetic algorithm ; prediction ; classification ; regression ; bagging ; CART
Abstract:

Tree-structured modeling is a valuable tool for predictive modeling and data mining. However, traditional tree-growing methodologies, such as CART, are known to suffer from "greediness," i.e., local optimizations at nodes to be split in a decision tree do not always result in global optimization of the tree model. In addition, CART solutions are known to be sensitive to perturbations in the data and can vary greatly across different training sets sampled from the same data. Bagging and other ensemble techniques have improved on the predictive performance of CART, but they do not have the ease of interpretation of traditional tree models. We introduce a method of tree analysis based on randomly generated and evolved trees, known as TARGET. Empirical evidence suggests that (1) TARGET produces smaller trees than CART, (2) TARGET has better predictive performance than CART and the same or better predictive performance than bagging, (3) TARGET is more stable than CART across different training samples from the same data, and (4) the training and test misclassification rates of TARGET solutions are in closer agreement than those for CART, indicating less data overfitting.


  • The address information is for the authors that have a + after their name.
  • Authors who are presenting talks have a * after their name.

Back to the full JSM 2003 program

JSM 2003 For information, contact meetings@amstat.org or phone (703) 684-1221. If you have questions about the Continuing Education program, please contact the Education Department.
Revised March 2003