Abstract:
|
Epidemiological studies involving biomarkers are often hindered by prohibitively expensive lab tests. Strategically pooling specimens prior to performing these assays has been shown to effectively reduce cost with minimal information loss in logistic regression settings. When the goal is to perform regression with a continuous biomarker as the outcome, regression analysis of pooled specimens may not be straightforward, particularly if the outcome is right-skewed. We demonstrate that a slight modification of a standard multiple linear regression model for poolwise data provides valid and precise coefficient estimates when pools are formed by combining biospecimens from subjects with identical covariate values. When these x-homogeneous pools cannot be formed, we propose an MCEM algorithm to compute MLEs. Simulations demonstrate that these analytical methods provide essentially unbiased estimates of coefficient parameters as well as their standard errors when appropriate assumptions are met. Furthermore, we show how one can utilize the fully observed covariate data to inform the pooling strategy, yielding a high level of statistical efficiency at a fraction of the total lab cost.
|
ASA Meetings Department
732 North Washington Street, Alexandria, VA 22314
(703) 684-1221 • meetings@amstat.org
Copyright © American Statistical Association.