Conference Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 519 - Innovations in Time Series Modeling
Type: Contributed
Date/Time: Thursday, August 11, 2022 : 8:30 AM to 10:20 AM
Sponsor: Business and Economic Statistics Section
Abstract #322513
Title: K-ARMA Models for Clustering Time Series Data
Author(s): Derek O Hoare*
Companies: Cornell University
Keywords: Time Series; Clustering; Autoregression; ARIMA
Abstract:

We present an approach to clustering time series data using a model-based generalization of the K-means algorithm. We start with an AR(p) clustering example and show how the clustering algorithm can be made robust to outliers using a least-absolute deviations criteria. We then build our clustering algorithm up for clustering ARMA(p,q) models and extend this to ARIMA(p,d,q) models. We prove convergence of the algorithm and we discuss model appropriateness for the fitted clusters using a generalization of the Ljung-Box test. We perform experiments with simulated data to show how the algorithm can be used for outlier detection, detecting distributional drift, and discuss the impact of initialization method on empty clusters.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2022 program