Abstract #301152


The views expressed here are those of the individual authors
and not necessarily those of the ASA or its board, officers, or staff.


Back to main JSM 2002 Program page



JSM 2002 Abstract #301152
Activity Number: 52
Type: Contributed
Date/Time: Sunday, August 11, 2002 : 4:00 PM to 5:50 PM
Sponsor: Section on Statistical Computing*
Abstract - #301152
Title: Low-Storage, Sequential, Simultaneous Estimation of Multiple Quantiles for Massive Datasets
Author(s): James McDermott*+ and John Liechty and Dennis Lin
Affiliation(s): Pennsylvania State University and Pennsylvania State University and Pennsylvania State University
Address: 325 Thomas Bldg, University Park, Pennsylvania, 16802, USA
Keywords: quantile estimation ; sequential methods ; massive datasets ; low-storage ; datamining ; data mining
Abstract:

We propose a low-storage, single-pass, sequential method for simultaneous estimation of multiple quantiles for massive datasets. The proposed method uses estimated ranks, assigned weights, and a scoring function that determines the most attractive candidate data points for estimates of the quantiles. The method uses a small fixed amount of storage and its computation time is O(n). Asymptotically, the proposed estimates are as accurate as the sample quantiles. We compare the proposed method's performance with that of the empirical distribution function through simulation study.


  • The address information is for the authors that have a + after their name.
  • Authors who are presenting talks have a * after their name.

Back to the full JSM 2002 program

JSM 2002

For information, contact meetings@amstat.org or phone (703) 684-1221.

If you have questions about the Continuing Education program, please contact the Education Department.

Revised March 2002