Abstract:
|
This paper examines nonresponse in a longitudinal establishment survey by using regression trees to model the effect of time in survey on data collection response propensities. We built regression trees using the R Package "Recursive Partitioning for Modeling Survey Data (RPMS)," which fits a linear model to the data conditionally on variables selected through recursive partitioning. The RPSM tree model provides estimates for the average response rate (across time), the intercept (initial response rate), and the slope (the effect of time in survey) for each end node. Using the RPSM package, we modeled the effect of time in survey on response rates in the Job Openings and Labor Turnover Survey (JOLTS) of the Bureau of Labor Statistics, so we could identify establishment characteristics and subgroups of establishments that were least likely to respond throughout the data collection process and thus determine which establishments required more follow-up over the course of the survey to ensure that they continue responding.
|