JSM Preliminary Online Program
This is the preliminary program for the 2007 Joint Statistical Meetings in Salt Lake City, Utah.

The views expressed here are those of the individual authors
and not necessarily those of the ASA or its board, officers, or staff.



Back to main JSM 2007 Program page




Activity Number: 251
Type: Contributed
Date/Time: Tuesday, July 31, 2007 : 8:30 AM to 10:20 AM
Sponsor: Section on Statistical Computing
Abstract - #309886
Title: Feature Selection for Large Data
Author(s): Peng Liu*+ and Jiayang Sun
Companies: Case Western Reserve University and Case Western Reserve University
Address: 26241 Lake Shore Blvd, Euclid, OH, 44132,
Keywords: data mining ; large data ; feature selection ; mixture distribution ; partial EM ; intrusion detection
Abstract:

A typical challenge in data mining is that data can be too large to be loaded into a computer program once and for all for an analysis, or data come sequentially in streams, so it is necessary to work on pieces of the data and then combine the information from different pieces to obtain the whole picture. In this talk we describe our recent research in developing techniques for feature selection and mixture estimation for large data. Their performance is evaluated by asymptotic analysis and simulation, and compared with standard algorithms. The application of our proposed methods in intrusion detection is demonstrated on the KDD Cup 1999 dataset. (Part of the talk is based on joint work with J. Chen and Z. Zhang.)


  • The address information is for the authors that have a + after their name.
  • Authors who are presenting talks have a * after their name.

Back to the full JSM 2007 program

JSM 2007 For information, contact jsm@amstat.org or phone (888) 231-3473. If you have questions about the Continuing Education program, please contact the Education Department.
Revised September, 2007