Abstract #301863


The views expressed here are those of the individual authors
and not necessarily those of the ASA or its board, officers, or staff.


Back to main JSM 2002 Program page



JSM 2002 Abstract #301863
Activity Number: 398
Type: Topic Contributed
Date/Time: Thursday, August 15, 2002 : 10:30 AM to 12:20 PM
Sponsor: Section on Physical & Engineering Sciences*
Abstract - #301863
Title: Discover Fully Dependent Patterns
Author(s): Sheng Ma*+ and Joseph Hellerstein and Feng Liang
Affiliation(s): IBM T. J. Watson Research Center and IBM T. J. Watson Research Center and Yale University
Address: 19 Skyline Drive, Hawthorne, New York, 10532, USA
Keywords: data mining ; statistical analysis ; pattern discovery ; dependent pattern
Abstract:

Pattern discovery is widely used to analyze market data. To date, the focus has been on frequent patterns. However, in applications such as detecting anomalies in computer networks and identifying security intrusions, frequent patterns characterize normal behavior, which is not of interest in these domains. Rather, the interest is in patterns that proceed malfunctions or other undesirable situations. Such patterns are characterized by items that co-occur with high probability, especially long, infrequent patterns (since these provide better predictive capabilities). Unfortunately, defining infrequent patterns in terms of the probability of item co-occurrence yields neither upward nor downward closure, and hence efficient algorithms cannot be constructed. Herein, we circumvent this problem by proposing fully dependent patterns (d-patterns), defined so that all subsets of a d-pattern are also d-patterns, a condition ensures downward closure. We develop a statistical test to qualify d-patterns, and construct an efficient algorithm for their discovery. We apply our algorithm to data from a real network and show that several patterns of interest are discovered.


  • The address information is for the authors that have a + after their name.
  • Authors who are presenting talks have a * after their name.

Back to the full JSM 2002 program

JSM 2002

For information, contact meetings@amstat.org or phone (703) 684-1221.

If you have questions about the Continuing Education program, please contact the Education Department.

Revised March 2002