Abstract:
|
Understanding the prevalence of infections in the population of interest is critical for making data-driven public health responses to infectious disease outbreaks. Accurate prevalence estimates, however, can be difficult to calculate due to a combination of low population prevalence, imperfect diagnostic tests, and limited testing resources. In addition, strategies based on convenience samples that target only symptomatic or high-risk individuals will yield biased estimates of the population prevalence. We present Bayesian multilevel regression and poststratification models that integrate probability sampling designs, the sensitivity and specificity of a diagnostic test, specimen pooling, and potential selective response to obtain unbiased prevalence estimates. These models easily incorporate all available prior information and can yield reasonable inferences even with very low base rates of infection and limited testing resources. We examine the performance of these models with an extensive numerical study that varies the sampling design, sample size, response rate, true prevalence, and pool size. We demonstrate the relative robustness of the models via sensitivity analyses.
|