JSM 2015 Preliminary Program

Online Program Home
My Program

Abstract Details

Activity Number: 74
Type: Contributed
Date/Time: Sunday, August 9, 2015 : 4:00 PM to 5:50 PM
Sponsor: Section on Nonparametric Statistics
Abstract #314898 View Presentation
Title: Estimating the Number of Clusters Using Cross Validation
Author(s): Wei Fu* and Patrick Perry
Companies: and New York University
Keywords: clustering ; K-means ; cross validation ; nonparametric
Abstract:

Many clustering methods, including K-means, require the user to specify the number of clusters as an input parameter. A variety of methods have been devised to choose the number of clusters automatically, but they often rely on strong modeling assumptions. We propose a data-driven approach to estimate the number of clusters based on a novel form of cross-validation. This differs from ordinary cross-validation, because clustering is fundamentally an unsupervised learning problem. Simulation and real data analysis results show that our proposed method outperforms existing methods, especially in high-dimensional settings with heavy-tailed data


Authors who are presenting talks have a * after their name.

Back to the full JSM 2015 program





For program information, contact the JSM Registration Department or phone (888) 231-3473.

For Professional Development information, contact the Education Department.

The views expressed here are those of the individual authors and not necessarily those of the JSM sponsors, their officers, or their staff.

2015 JSM Online Program Home