Online Program Home
  My Program

Abstract Details

Activity Number: 655 - Statistical Analysis of Naturalistic Driving Study Data: How You Slice and Dice Matters
Type: Topic Contributed
Date/Time: Thursday, August 3, 2017 : 10:30 AM to 12:20 PM
Sponsor: Transportation Statistics Interest Group
Abstract #322893 View Presentation
Title: Using Lasso Regression for the Selection of Matching Variables to Analyze Crash Risk in SHRP 2 Data for a Case-Control Setting
Author(s): Huizhong Guo* and Linda Ng Boyle
Companies: and University of Washington
Keywords: case control ; Lasso regression ; variable selection ; driver behavior ; crash risk
Abstract:

Case-control methods are used to examine safe driving from naturalistic studies, where exposure to a secondary task are compared between driving episodes with and without a crash. Matched case-control approaches are favored over independent approaches because they control for confounding factors. By matching cases and controls using identified confounding factors (namely matching variables), it provides a more robust estimate of the outcome of interest. We proposed a data-based approach of matching variable selection using the Lasso (least absolute shrinkage and selection operator) regression method. We considered variables related to the driver (demographics, driving history, and perceptions), vehicle (type, age, power) and environmental conditions. A driver-level and an epoch-level model was developed. At the driver level, the number of violations and the study site were ranked highest. At the epoch level, intersection, pre-incident maneuver and traffic control were the top ranked based on the AUC (Area Under Curve) of a ROC curve. These are used as the matching variables for developing the conditional logistic regression model to assess crash risk.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2017 program

 
 
Copyright © American Statistical Association