JSM 2004 - Toronto

Abstract #301433

This is the preliminary program for the 2004 Joint Statistical Meetings in Toronto, Canada. Currently included in this program is the "technical" program, schedule of invited, topic contributed, regular contributed and poster sessions; Continuing Education courses (August 7-10, 2004); and Committee and Business Meetings. This on-line program will be updated frequently to reflect the most current revisions.

To View the Program:
You may choose to view all activities of the program or just parts of it at any one time. All activities are arranged by date and time.

The views expressed here are those of the individual authors
and not necessarily those of the ASA or its board, officers, or staff.


Back to main JSM 2004 Program page



Activity Number: 187
Type: Contributed
Date/Time: Tuesday, August 10, 2004 : 8:30 AM to 10:20 AM
Sponsor: Section on Statistical Computing
Abstract - #301433
Title: Estimating Intrinsic Dimensionality from the Nearest-neighbor Distances
Author(s): Elizaveta Levina*+
Companies: University of Michigan
Address: Dept. of Statistics, 439 West Hall, Ann Arbor, MI, 48109-1092,
Keywords: dimensionality reduction ; intrinsic dimension ; manifold projections ; nearest-neighbor distances
Abstract:

Recently developed nonlinear manifold projection methods such as Locally Linear Embedding and Isomap have become popular dimensionality reduction tools. Using these methods successfully for applications such as classification requires estimating intrinsic data dimension. The existing methods for intrinsic dimension estimation are mostly heuristic and very little is known about how they scale with the number of observations and the true dimension. We propose a new method for estimating intrinsic dimensionality, derived through a rigorous analysis of distributions of the nearest-neighbor distances. A simple closed-form maximum likelihood estimator of dimension is obtained by treating observations in a small sphere as a homogeneous Poisson process. We apply the method to some of the popular manifold datasets and study its behavior as the dimension and the sample size grow, analytically and through simulations. This is joint work with Peter Bickel.


  • The address information is for the authors that have a + after their name.
  • Authors who are presenting talks have a * after their name.

Back to the full JSM 2004 program

JSM 2004 For information, contact jsm@amstat.org or phone (888) 231-3473. If you have questions about the Continuing Education program, please contact the Education Department.
Revised March 2004