JSM Preliminary Online Program
This is the preliminary program for the 2006 Joint Statistical Meetings in Seattle, Washington.

The views expressed here are those of the individual authors
and not necessarily those of the ASA or its board, officers, or staff.


Back to main JSM 2006 Program page




Activity Number: 242
Type: Contributed
Date/Time: Tuesday, August 8, 2006 : 8:30 AM to 10:20 AM
Sponsor: Section on Physical and Engineering Sciences
Abstract - #306845
Title: Analysis of Handwritten ZIP Code Digits Using OBSTree
Author(s): Atina Dunlap Brooks*+ and Jacqueline Hughes-Oliver
Companies: North Carolina State University and North Carolina State University
Address: Department of Statistics, Raleigh, NC, 27695,
Keywords: data mining ; machine learning ; handwriting analysis ; prediction ; classification ; simulated annealing
Abstract:

The classification method OBSTree has been used to successfully analyze drug discovery datasets. One significant feature of these datasets is that they contain a small percentage of observations which have meaningful classes and a large number of uninteresting observations. This paper explores the algorithmic modifications necessary to perform an OBSTree analysis of a dataset in which there is no irrelevant class. The dataset used is a well known US postal zip code dataset which contains individual handwritten digits. Our results are compared to a variety of data mining methods, and the interpretability benefits of OBSTree are examined.


  • The address information is for the authors that have a + after their name.
  • Authors who are presenting talks have a * after their name.

Back to the full JSM 2006 program

JSM 2006 For information, contact jsm@amstat.org or phone (888) 231-3473. If you have questions about the Continuing Education program, please contact the Education Department.
Revised April, 2006