Online Program Home
  My Program

Abstract Details

Activity Number:
Register
214603 - Introduction to Data Mining with CART Classification and Regression Trees (ADDED FEE)
Type: Professional Development
Date/Time: Wednesday, August 2, 2017 : 8:00 AM to 9:45 AM
Sponsor: ASA
Abstract #325497
Title: Introduction to Data Mining with CART Classification and Regression Trees (ADDED FEE)
Author(s): Dan Steinberg* and Mikhail Golovnya* and Charles Harrison*
Companies: Salford Systems and Salford Systems and Salford Systems
Keywords:
Abstract:

This Tutorial is intended for the applied statistician wanting to understand and apply the CART Classification and Regression Trees methodology. The emphasis will be on practical data analysis and data mining involving classification and regression. All concepts will be illustrated using real-world step-by-step examples. The course will begin with an intuitive introduction to tree-structured analysis: what it is, why it works, why it is non-parametric and model-free, advantages in handling of all types of data including missing values and categorical. Working through examples, we will review how to read the CART Tree output and how to set up basic analyses. This session will include performance evaluation of CART trees and will cover ways to search for possible improvements of the results. Once a basic working knowledge of CART has been mastered, the tutorial will focus on critical details essential for advanced CART applications including: choice of splitting criteria, choosing the best split, using prior probabilities to shape results, refining results with differential misclassification costs, the meaning of cross validation, tree growing and tree pruning. The course will conclude with some discussion of the comparative performance of CART versus other computer-intensive methods such as artificial neural networks and statistician-generated parametric models.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2017 program

 
 
Copyright © American Statistical Association