Abstract #302034


The views expressed here are those of the individual authors
and not necessarily those of the ASA or its board, officers, or staff.


Back to main JSM 2002 Program page



JSM 2002 Abstract #302034
Activity Number: 54
Type: Other
Date/Time: Monday, August 12, 2002 : 8:30 AM to 10:20 AM
Sponsor: ASA
Abstract - #302034
Title: Data Mining: Where Do We Start?
Author(s): Richard De Veaux*+
Affiliation(s): Williams College
Address: Bronfman Science Center, Williamstown, Massachusetts, 01267, USA
Keywords:
Abstract:

The first and seemingly simplest analytical step in data mining is to describe the data. But the standard exploratory data techniques of graphing and summarizing each variable take too long when dealing with hundreds of candidate predictors.

Moreover, data description alone cannot provide an action plan. You must build a predictive model based on patterns determined from known results, then test that model on results outside the original sample. In classical data analysis, the exploratory phase usually precedes the model selection phase. It's seen as a necessary preliminary for understanding the data before beginning to think about how to model it. But in data mining, sometimes we start with a preliminary model just to narrow down the set of potential predictors. This exploratory data modeling (EDM) seems to be at odds with standard statistical practice, but, in fact, it's simply using models as a new exploratory tool.

In this talk, we'll take a brief tour of the current state of data mining algorithms and using several case studies to explain how EDM can be used to narrow the search for a predictive model and to increase the chances of producing useful and meaningful results.


  • The address information is for the authors that have a + after their name.
  • Authors who are presenting talks have a * after their name.

Back to the full JSM 2002 program

JSM 2002

For information, contact meetings@amstat.org or phone (703) 684-1221.

If you have questions about the Continuing Education program, please contact the Education Department.

Revised March 2002