Abstract:
|
NHANES 2009-16 collected saliva in a cross-sectional stratified multistage cluster probability survey of the US to study risk factors for oral human papillomavirus (HPV), which is strongly associated with incidence of oropharyngeal cancer (OPC). Three uses of this data and related statistical issues are: 1) identify risk factors for the prevalence of oral HPV infection; 2) examine effectiveness of prophylactic vaccination for HPV infection; and 3) develop/validate an absolute risk model of 1-year incident OPC in US with NHANES as a control series and non-representative clinical samples (i.e., nonprobability samples) of OPC as a case series. The statistical methods that address these data uses while accounting for the complex sample design of the NHANES are presented. The methods include predictive margins to estimate covariate/risk factor adjusted prevalence differences, cubic regression splines to assess nonlinear relationships between prevalence of oral HPV and age, quasi-score tests to test the impact of HPV vaccination on small HPV infection rates, and propensity weighting, cross-validation, and estimation of area under the ROC curve to evaluate the absolute risk models.
|