JSM 2011 Online Program

The views expressed here are those of the individual authors and not necessarily those of the JSM sponsors, their officers, or their staff.

Abstract Details

Activity Number: 585
Type: Contributed
Date/Time: Wednesday, August 3, 2011 : 2:00 PM to 3:50 PM
Sponsor: Section on Statistical Learning and Data Mining
Abstract - #302882
Title: Data Mining Categorical Predictors with Missing Values
Author(s): Hua Fang*+ and Honggang Wang
Companies: University of Massachusetts and University of Massachusetts at Dartmouth
Address: Medical School, , ,
Keywords: data mining ; categorical predictor ; missing values ; high-dimensional ; multiple-imputation ; simulation
Abstract:

Predictors with categories of no clear boundaries are prevalent in all kinds of studies, for example, low- or high- cancer/behavioral risk people, better or worse consulting procedure, contrarian or non-contrarian investors, efficient or inefficient market, etc.. A number of attributes can be used to better describe this type of categorical predictors, accordingly reduce their measurement errors and increase their predictive power in statistical hypothesis testing. To accommodate missing values in the attributes, we proposed multiple-imputation based data mining techniques to characterize categorical predictors. Theoretical illustration, simulation and cases studies will be included to demonstrate the utility of our proposed method.


The address information is for the authors that have a + after their name.
Authors who are presenting talks have a * after their name.

Back to the full JSM 2011 program




2011 JSM Online Program Home

For information, contact jsm@amstat.org or phone (888) 231-3473.

If you have questions about the Continuing Education program, please contact the Education Department.