Online Program Home
My Program

Abstract Details

Activity Number: 181 - SPEED: Statistical Learning and Data Science Speed Session 1, Part 2
Type: Contributed
Date/Time: Monday, July 29, 2019 : 10:30 AM to 11:15 AM
Sponsor: Section on Statistical Learning and Data Science
Abstract #307532
Title: Visual Diagnostics of a Model Explainer: Tools for the Assessment of LIME Explanations from Random Forests
Author(s): Katherine Goode* and Heike Hofmann
Companies: Iowa State University and Iowa State University
Keywords: random forest; LIME; black box model; interpretability ; diagnostics

Random forests are known for their accurate predictive abilities, but they are a part of the family of machine learning models that lack interpretability. A technique called LIME was developed to provide local interpretations for predictive models. The technique fits a ridge regression model to binary encoded perturbations created from the observed data weighted by proximity to the prediction of interest. The predicted values associated with the perturbations computed using the complex model are used as the response variable in the regression. The coefficients from the linear model are then used to determine the influential variables. We have developed a suite of visualizations and diagnostic tools to assess the explanations produced by LIME from a random forest. In particular, we consider how well the simple model fit by LIME captures the random forest model and how the results from LIME vary based on different algorithm input options. To demonstrate these tools, we apply LIME to a random forest fit to a forensics bullet matching dataset using the lime R package and apply our methods to diagnose the LIME explanations.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2019 program