Abstract:
|
Multilevel regression with poststratification (MRP; MRP; e.g., Gelman, 2007; Ghitza and Gelman, 2013) and its extensions have made important contributions to addressing common problems in survey research, most notably frame error and sparse data. Despite its tremendous flexibility, researchers as yet have limited guidance on how studies can be designed to optimize MRP performance when they have at least some degree of control over the data collection.
The purpose of this paper is to understand MRP’s performance in scenarios common in public health, behavioral, and opinion research with highly clustered sparsity. A particular focus of this paper will be determining optimal sample sizes required along poststratification table margins; another will be quantifying the impact reasonable auxiliary variables have on data requirement. Results will be based on a comprehensive and realistic empirical simulation study derived from an application of MRP to understanding family migration at the county level in the United States.
|