The views expressed here are those of the individual authors and not necessarily those of the JSM sponsors, their officers, or their staff.
Abstract Details
Activity Number:
|
585
|
Type:
|
Contributed
|
Date/Time:
|
Wednesday, August 3, 2011 : 2:00 PM to 3:50 PM
|
Sponsor:
|
Section on Statistical Learning and Data Mining
|
Abstract - #302882 |
Title:
|
Data Mining Categorical Predictors with Missing Values
|
Author(s):
|
Hua Fang*+ and Honggang Wang
|
Companies:
|
University of Massachusetts and University of Massachusetts at Dartmouth
|
Address:
|
Medical School, , ,
|
Keywords:
|
data mining ;
categorical predictor ;
missing values ;
high-dimensional ;
multiple-imputation ;
simulation
|
Abstract:
|
Predictors with categories of no clear boundaries are prevalent in all kinds of studies, for example, low- or high- cancer/behavioral risk people, better or worse consulting procedure, contrarian or non-contrarian investors, efficient or inefficient market, etc.. A number of attributes can be used to better describe this type of categorical predictors, accordingly reduce their measurement errors and increase their predictive power in statistical hypothesis testing. To accommodate missing values in the attributes, we proposed multiple-imputation based data mining techniques to characterize categorical predictors. Theoretical illustration, simulation and cases studies will be included to demonstrate the utility of our proposed method.
|
The address information is for the authors that have a + after their name.
Authors who are presenting talks have a * after their name.
Back to the full JSM 2011 program
|
2011 JSM Online Program Home
For information, contact jsm@amstat.org or phone (888) 231-3473.
If you have questions about the Continuing Education program, please contact the Education Department.