Online Program Home
My Program

Abstract Details

Activity Number: 600 - Less Can Be More: Smart Sampling in Data and Engineering Sciences
Type: Topic Contributed
Date/Time: Thursday, August 1, 2019 : 8:30 AM to 10:20 AM
Sponsor: Section on Physical and Engineering Sciences
Abstract #305086
Title: Support Points: An Optimal and Model-Free Method for Subsampling Big Data
Author(s): Roshan Vengazhiyil* and Simon Mak
Companies: Georgia Institute of Technology and Georgia Institute of Technology
Keywords: Experimental design; Quasi-Monte Carlo; Big Data

This talk presents a novel method called support points, which tackles the problem of optimal subsampling of big data that is independent of the statistical method used for modeling the data. This method has important applications to many practical problems in statistics and engineering, particularly when the available data is plentiful and high-dimensional, but the processing of such data is expensive due to computation or storage costs. We also propose an extension of the method called Projected Support Points to deal with high dimensional data, which ensures that the data is well-reduced on low-dimensional projections of the data space.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2019 program