Abstract:
|
In many biomedical applications, covariates are naturally grouped, with variables in the same group being systematically related or statistically correlated. Under such settings, variable selection must be conducted at both group and individual variable levels. Motivated by the widespread availability of zero-inflated count outcomes and grouped covariates in many practical applications, we consider group regularization for zero-inflated regression models. Using a least squares approximation of the mixture likelihood and a variety of group-wise penalties on the coefficients, we propose a unified algorithm (Google: Group Regularization for Zero-inflated Count Regression Models) to efficiently compute the entire regularization path of the estimators. We investigate the finite sample performance of these methods through extensive simulation experiments and the analysis of a German Healthcare demand data set and an auto insurance claim data set from SAS Enterprise Miner database. Finally, we derive theoretical properties of these methods under reasonable assumptions, which further provide deeper insight into the asymptotic behavior of these approaches.
|