Abstract #301545


The views expressed here are those of the individual authors
and not necessarily those of the ASA or its board, officers, or staff.


Back to main JSM 2002 Program page



JSM 2002 Abstract #301545
Activity Number: 398
Type: Topic Contributed
Date/Time: Thursday, August 15, 2002 : 10:30 AM to 12:20 PM
Sponsor: Section on Physical & Engineering Sciences*
Abstract - #301545
Title: Linear Regression Trees and Naive Bayes Trees
Author(s): Edwin Pednault*+
Affiliation(s): IBM T. J. Watson Research Center
Address: P.O. Box 218, Yorktown Heights, New York, 10598, USA
Keywords: Nonparametric models ; Classification and regression trees ; Treed regression ; Nonlinear models ; Logistic regression ; Naive Bayes
Abstract:

In real-world predictive modeling applications, segmentation-based modeling techniques are often employed wherein data records are partitioned into segments, and separate predictive models are developed for each segment. It is common practice to build models sequentially by first segmenting the data (using, for example, unsupervised clustering algorithms) and then developing predictive models for the segments. This approach, however, ignores the strong influence that segmentation exerts on the predictive accuracies of the segment models. It would be preferable to optimize the segmentation so as to maximize overall predictive accuracy. This talk will discuss the IBM ProbE (TM) predictive modeling system that accomplishes this optimization by combining decision tree techniques with statistical modeling performed at the leaves of trees. At present, ProbE is able to perform stepwise linear regression and stepwise naive Bayes modeling at the leaves as trees are being constructed. The models that are produced have been found to perform as well as or better than hand-crafted models in both credit-risk assessment and targeted marketing applications.


  • The address information is for the authors that have a + after their name.
  • Authors who are presenting talks have a * after their name.

Back to the full JSM 2002 program

JSM 2002

For information, contact meetings@amstat.org or phone (703) 684-1221.

If you have questions about the Continuing Education program, please contact the Education Department.

Revised March 2002