Name: 2020 Joint Statistical Meetings
Start: 2020-08-02T07:00:00+00:00
End: 2020-08-06

Online Program Home
My Program

All Times EDT

Abstract Details

Activity Number:	498 - Modern Machine Learning
Type:	Contributed
Date/Time:	Thursday, August 6, 2020 : 10:00 AM to 2:00 PM
Sponsor:	Section on Statistical Learning and Data Science
Abstract #312435
Title:	Machine Learning Oracle to Guide Statistical Data Processing
Author(s):	Lucas Koepke* and Michael Frey
Companies:	National Institute of Standards and Techology and National Institute of Standards and Technology
Keywords:	machine learning; isotonic regression; pool-adjacent-violators algorithm; change-point
Abstract:	We propose a hybrid framework that leverages machine learning (ML) techniques to make decisions about data processing steps preparatory to a formal statistical inference. We show that, by resampling from the data at hand, a ML algorithm can be trained that is tailored to the specific setting of the statistical analysis and that offers informed recommendations to guide the course of that analysis. Monte Carlo experiments show this method’s effectiveness and allow us to explore its degree of effectiveness depending on the classifier’s architecture. We finish with an application to change-point estimation, using data on fatigue crack growth in additively manufactured titanium. Crack growth rate never decreases under increased stress, but is classified into distinct regimes to determine material properties. Isotonic regression is thus applicable, due to the monotonic structure, but may not benefit the change-point estimation. For this example we use a ML classifier tailored to the available data to decide whether to apply isotonic regression preparatory to estimating the change-point separating the different crack growth regimes.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2020 program

JSM 2020 Online Program

Abstract Details

American Statistical Association