|
Activity Number:
|
500
|
|
Type:
|
Contributed
|
|
Date/Time:
|
Thursday, August 10, 2006 : 8:30 AM to 10:20 AM
|
|
Sponsor:
|
Section on Survey Research Methods
|
| Abstract - #306864 |
|
Title:
|
Creating Imputation Classes Using Nonparametric Classification Trees
|
|
Author(s):
|
Darryl Creel*+ and Stephen Black and Karol Krotki and Jeremy Porter
|
|
Companies:
|
RTI International and RTI International and RTI International and RTI International
|
|
Address:
|
312 Trotter Farm Drive, Rockville, MD, 20850,
|
|
Keywords:
|
nonresponse ; imputation ; CHAID ; CART
|
|
Abstract:
|
Virtually all surveys encounter some level of item nonresponse. To address this potential source of bias, practitioners often use imputation to replace missing values with valid values through some form of stochastic modeling. In order to improve the reliabilities of such models, imputation classes are formed to produce homogenous groups of respondents, where homogeneity is measured with respect to the item that is being imputed. A common method used to form imputation classes is CHAID where the splitting rule is based on Chi-squared tests. This paper examines an alternative methodology used to form imputation classes, nonparametric classification trees where the splitting rules are based on the Gini index of impurity. In addition to a brief description of the nonparametric classification tree methodology, comparative examples are provided.
|