JSM 2004 - Toronto

Abstract #301080

This is the preliminary program for the 2004 Joint Statistical Meetings in Toronto, Canada. Currently included in this program is the "technical" program, schedule of invited, topic contributed, regular contributed and poster sessions; Continuing Education courses (August 7-10, 2004); and Committee and Business Meetings. This on-line program will be updated frequently to reflect the most current revisions.

To View the Program:
You may choose to view all activities of the program or just parts of it at any one time. All activities are arranged by date and time.

The views expressed here are those of the individual authors
and not necessarily those of the ASA or its board, officers, or staff.


Back to main JSM 2004 Program page



Activity Number: 425
Type: Contributed
Date/Time: Thursday, August 12, 2004 : 10:30 AM to 12:20 PM
Sponsor: Section on Statistical Computing
Abstract - #301080
Title: A Statistical Foundation for Association Rules Based on Clustering
Author(s): Carlos Ordonez*+
Companies: Teradata, NCR
Address: 17095 Via del Campo, San Diego, CA, 92127,
Keywords: association rules ; binary data ; clustering ; support bounds
Abstract:

Association rules are a data-mining technique for which there exist efficient algorithms, but that as of today lack an adequate statistical foundation. Therefore, we propose a statistical model for association rules. This approach is instrumental in gaining a deeper understanding of association rules based on a simple, yet enlightening, statistical model. A dataset with binary variables is modeled as a set of clusters. Such binary data clusters are used to get tight lower and upper bounds on association support. Clusters are also used to get lower and upper bounds on association rule confidence. Then bound averages are used to estimate support and confidence. The model estimates show asymptotic increasing accuracy as the number of clusters increases. We discuss our progress on proving theoretical guarantees provided by the model.


  • The address information is for the authors that have a + after their name.
  • Authors who are presenting talks have a * after their name.

Back to the full JSM 2004 program

JSM 2004 For information, contact jsm@amstat.org or phone (888) 231-3473. If you have questions about the Continuing Education program, please contact the Education Department.
Revised March 2004