WSDS 2021 Online Program

All Times EDT

Friday, October 8

Fri, Oct 8, 2:45 PM - 4:00 PM
Virtual

This Is Statistics

Multiclass Regularized Regression Integrating Prior Information (309954)

David Conti, University of Southern California
*Jingxuan He, University of Southern California
Juan Pablo Lewinger, University of Southern California
Chubing Zeng, University of Southern California

Keywords: multi-classification, variable selection, prior information, regularized regression, empirical Bayes, relevant vector machine

Penalized regression is a common approach for feature selection. To enhance model prediction and interpretation, some methods exist to integrate prior data during the modeling process rather than post-hoc analysis for regression and binary classification. To this end, we developed an approach that implements prior-informed penalized regression for multi-classification problems. Specifically, regression coefficients are regularized by feature-specific penalty parameters which are modeled as a log-linear function of prior covariates. Penalty vectors are estimated by empirical Bayes method instead of cross-validation and a partial quadratic approximation is implemented for an analytical solution reducing the computational complexity for multiclass outcomes. The resulting marginal likelihood is optimized by a designed iterative reweighted-L2 algorithm. Through simulation studies and an applied example, we demonstrate our method's improved prediction accuracy, feature selection, and effect estimation compared with regular penalized models. We discuss the relationship to relevant vector machine and present extensions for grouped and ungrouped penalty vectors across multiple classes.

Online Program

Multiclass Regularized Regression Integrating Prior Information (309954)

American Statistical Association