Online Program Home
  My Program

Abstract Details

Activity Number: 170 - Nonparametric Methods for Longitudinal and Survival Data
Type: Contributed
Date/Time: Monday, July 31, 2017 : 10:30 AM to 12:20 PM
Sponsor: Section on Nonparametric Statistics
Abstract #324442 View Presentation
Title: Boosted Multivariate Trees for Longitudinal Data
Author(s): Amol Pande* and Liang Li and Jeevanantham Rajeswaran and John Ehrlinger and Udaya Kogalur and Eugene Blackstone and Hemant Ishwaran
Companies: and University of Texas MD Anderson Cancer Center and Cleveland Clinic Foundation and Cleveland Clinic Foundation and Cleveland Clinic Foundation and Cleveland Clinic Foundation and University of Miami
Keywords: Gradient boosting ; Multivariate tree ; P-splines ; Marginal model ; Smoothing parameter

Machine learning methods provide a powerful approach for modeling a univariate response, however its application for the multivariate response is limited. We use machine learning approach for modeling longitudinal data in which repeated measurements are observed for a subject over time. We use gradient boosting approach to boost multivariate tree to fit a novel flexible semi-nonparametric marginal model for longitudinal data. In this model, features are modeled non-parametrically using multivariate tree, while feature-time interactions are modeled semi-nonparametrically utilizing P-splines with estimated smoothing parameter. In order to avoid overfitting, we describe a relatively simple in sample cross-validation method which can be used to estimate the optimal boosting iteration and which has the surprising added benefit of stabilizing certain parameter estimates. Our new multivariate tree boosting method is shown to be highly flexible, robust to covariance misspecification and unbalanced designs, and resistant to overfitting in high dimensions. Feature selection is performed using variable importance to identify important features and feature-time interactions.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2017 program

Copyright © American Statistical Association