Online Program Home
  My Program

Abstract Details

Activity Number: 7 - New Developments in Predictive Modeling of High-Dimensional Data
Type: Invited
Date/Time: Sunday, July 30, 2017 : 2:00 PM to 3:50 PM
Sponsor: Council of Chapters
Abstract #322121 View Presentation
Title: ROS Regression: Integrating Regularization with Optimal Scaling for Predictive Modeling of High-Dimensional Data
Author(s): Jacqueline J Meulman*
Companies: Leiden University
Keywords: Regularization ; Optimal Scaling ; Additive Models ; Monotonic Transformations ; Metabolomics
Abstract:

We combine two important extensions of ordinary least squares regression: regularization and optimal scaling. The latter uses splines and step functions variables in the same prediction framework to transform continuous predictors and quantify categorical, respectively. Both splines and step functions can be restricted to be monotonic, preserving the ordinal information in the data. In addition, they they can be combined with regularization methods such as the Lasso and the Elastic Net. Predictor variables in high-dimensional data, for example in metabolomics, are usually highly correlated. We will show how optimal scaling can reduce a predictor's own predictability from the other predictors, increasing its conditional independence, and the condition of the correlation matrix as a whole as measured by Log Determinant Divergence. We will discuss the interaction between regularization and optimal scaling, and finally, other options for regularization of regression coefficients and category quantifications/spline coefficients will be proposed. Applications will be presented in the context of metabolomics. This is joint work with Anita Van der Kooij and Thomas Hankemeijer.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2017 program

 
 
Copyright © American Statistical Association