Online Program Home
  My Program

Abstract Details

Activity Number: 112 - Methods for Imputing Missing Survey Data
Type: Topic Contributed
Date/Time: Monday, July 31, 2017 : 8:30 AM to 10:20 AM
Sponsor: Survey Research Methods Section
Abstract #323719 View Presentation
Title: Hot Deck Imputation of Multinomial Distributions When There Are Fewer Donors Than Recipients
Author(s): Katherine Thompson* and Rebecca Andridge and Laura Bechtel and Natasha McCarthy
Companies: U.S. Census Bureau and Ohio State University and U.S. Census Bureau and U.S. Census Bureau
Keywords: nearest neighbor ; hot deck imputation ; multinomial distribution

Detailed breakdown of totals items are collected in surveys. Detail proportions can vary greatly by sample unit, and the multinomial distributions can likewise vary by imputation cell. Consequently, although it might be feasible to develop viable parametric imputation models for the total, it is challenging for the collective set of detail items. Instead, a common practice is to use some form of hot deck imputation to match donor and recipient records, then impute the donor's complete set of proportions. Nearest neighbor imputation is useful when the set of proportions is correlated with unit size. This approach preserves the correlation between the detailed items within imputation cell, as long as the number of donors is greater than or equal to the number of recipients. Unfortunately, this condition often does not hold in practice. Collapsing imputation cells is not an attractive alternative. We explore unrestricted usage of the donor records in the original cell versus the usage of a random draw from the donor record's multinomial distribution via a limited simulation study using historic data in selected industries from the 2012 Economic Census.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2017 program

Copyright © American Statistical Association