Online Program Home
My Program

Abstract Details

Activity Number: 84 - SPEED: A Mixture of Topics in Health, Computing, and Imaging
Type: Contributed
Date/Time: Sunday, July 29, 2018 : 4:00 PM to 4:45 PM
Sponsor: Section on Statistical Learning and Data Science
Abstract #332723
Title: Tailoring PCA for Detecting Sparse Changes in Multi-Stream Data
Author(s): Martin Tveten* and Ingrid Kristine Glad
Companies: University of Oslo and University of Oslo
Keywords: Sequential change-point detection; High-dimensional data streams; Sparse changes; Principal component analysis

Consider monitoring by a high-dimensional data stream, for instance 100-1000 sensors that regularly measure the condition of a large system. The aim is to detect changes to the mean or covariance matrix of this data stream as quickly as possible, while controlling the number of false alarms. Here, we give special attention to the realistic case where only a few of the componenets change; the change is sparse.

Our strategy for sequentially detecting such changes is based on projecting the incoming data onto a few of the principal axes of the pre-change data. But which principal axes are the most sensitive to changes of different type and sparsity? Based on the Hellinger distance, we show that it depends on the pre-change covariance matrix and which changes are of interest, but that the least varying axes in general are more sensitive. In two-dimensional data, this is explained theoretically, while simulations provide insight into higher dimensions. The proposed monitoring procedure automatically selects the most informative principal axes given a covariance matrix and relevant change scenarios, and we show that it is highly efficient in detecting such changes.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2018 program