Conference Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 75 - Invited EPoster Session II
Type: Invited
Date/Time: Sunday, August 7, 2022 : 9:35 PM to 10:30 PM
Sponsor: Section on Statistical Learning and Data Science
Abstract #323515
Title: Online Data Selection and Sparse Estimation for Multivariate Streaming Data
Author(s): Rui Xie*
Companies: University of Central Florida
Keywords: Sampling ; Streaming Data; Sparse regression; Online analysis
Abstract:

Real-time analysis of large-scale streaming multivariate data often faces a trade-off between statistical estimation efficiency and computational cost efficiency. For multivariate data streams, one needs to carefully balance the trade-off, especially for sparse and possibly under-determined regression problems, which requires more computational efforts. Data selection enables one to process large-scale streaming data in real-time, so one can fit and update the sparse model in seconds instead of hours. We study the online real-time joint data-dependent sample selection and continuous variable selection for a multi-dimensional spare regression problem for streaming data. We propose a class of online data selection methods that achieve simultaneously sampling and sparse estimation to improve the computational efficiency of the online analysis. The online sparse model estimation involves using coordinate descent algorithms for nonconvex penalized regression, and the real-time data selection adapts optimal design-based sequential online sampling. The performance of the sampling-assisted online sparse estimation method is assessed via simulation studies and real data examples.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2022 program