Online Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 185 - The Missing Puzzle Piece: Estimating Survey Data Collection Costs
Type: Invited
Date/Time: Tuesday, August 10, 2021 : 1:30 PM to 3:20 PM
Sponsor: Survey Research Methods Section
Abstract #316570
Title: Using Machine Learning and Statistical Models to Predict Survey Costs
Author(s): James Wagner* and Brady T. West and Michael R. Elliott and Stephanie Coffey
Companies: University of Michigan and University of Michigan and University of Michigan and U.S. Census Bureau
Keywords: Responsive survey design; Nonresponse error; Survey costs
Abstract:

Responsive survey designs implement surveys in phases, where each phase is a separate protocol with different cost and error structures. The goal is to design a series of phases such that nonresponse errors cancel each other across the phases while staying within a fixed budget. Some work has been done to identify when phases are complementary with respect to errors. However, no work has been done to evaluate costs across phases. Without accurate cost estimates, resources may be inefficiently allocated across phases. In this presentation, we compare statistical and machine learning methods of predicting costs for alternative designs. The first modeling strategy uses multi-level models to predict the number of hours of interviewer time. The second approach uses a machine learning method, Bayesian Additive Regression Trees (BART). We evaluate the predictive accuracy of the models using data from a real survey. We find that the BART modeling approach yields a useful approach to maximizing predictive accuracy, while the multi-level regression models offer an alternative with results that are relatively easy to interpret.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2021 program