Abstract #300484

This is the preliminary program for the 2003 Joint Statistical Meetings in San Francisco, California. Currently included in this program is the "technical" program, schedule of invited, topic contributed, regular contributed and poster sessions; Continuing Education courses (August 2-5, 2003); and Committee and Business Meetings. This on-line program will be updated frequently to reflect the most current revisions.

To View the Program:
You may choose to view all activities of the program or just parts of it at any one time. All activities are arranged by date and time.

The views expressed here are those of the individual authors
and not necessarily those of the ASA or its board, officers, or staff.


Back to main JSM 2003 Program page



JSM 2003 Abstract #300484
Activity Number: 116
Type: Topic Contributed
Date/Time: Monday, August 4, 2003 : 10:30 AM to 12:20 PM
Sponsor: Section on Statistical Computing
Abstract - #300484
Title: Very Fast Multivariate Kernel Density Estimation via Computational Geometry
Author(s): Alexander G. Gray*+ and Andrew Moore
Companies: Carnegie Mellon University and Carnegie Mellon University
Address: Computer Science Department, Pittsburgh, PA, 15213,
Keywords: computation ; kernel density estimation ; nonparametric ; density estimation ; algorithm ; large datasets
Abstract:

To date, the use of nonparametric methods has been limited by their computational intractibility for all but the smallest datasets, particularly when cross-validation procedures are accounted for. By introducing powerful techniques from computational geometry to this problem, we present an algorithm for multivariate kernel density estimation which reduces the cost of estimating the density at a query point from O(N) to O(1), or constant time complexity, i.e., its cost does not depend on the dataset size. Furthermore, unlike preceding work, it does not scale exponentially in the dimension D. Empirically and analytically we show that the method achieves dramatic efficiencies superior to those of previous algorithmic approaches including binning methods and the FFT, in terms of both dataset size and dimensonality. Additionally, the algorithm guarantees accuracy within any tolerance specified by the user, has no indirect parameters (such as a number of grid points) to tune, naturally admits estimation at arbitrary query points, and works for all common kernel choices interchangeably.


  • The address information is for the authors that have a + after their name.
  • Authors who are presenting talks have a * after their name.

Back to the full JSM 2003 program

JSM 2003 For information, contact meetings@amstat.org or phone (703) 684-1221. If you have questions about the Continuing Education program, please contact the Education Department.
Revised March 2003