Conference Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 504 - Computational Challenges in Modern Statistical Inference
Type: Invited
Date/Time: Thursday, August 11, 2022 : 8:30 AM to 10:20 AM
Sponsor: IMS
Abstract #320342
Title: High-Dimensional Discriminant Analysis on Latent Variables
Author(s): Marten Wegkamp* and Xin Bing and Florentina Bunea
Companies: Cornell University and University of Toronto and Cornell University
Keywords: High-dimensional classification; latent factor model; principal component regression; dimension reduction; discriminant analysis
Abstract:

In high-dimensional classification problems, a commonly used approach is to first project the high-dimensional features into a lower dimensional space, and base the classification on the resulting lower dimensional projections. We formulate a latent-variable model with a hidden low-dimensional structure to justify this two-step procedure and to guide which projection to choose. We propose a computationally efficient classifier that takes certain principal components (PCs) of the observed features as projections, with the number of retained PCs selected in a data-driven way. A general theory is established for analyzing such two-step classifiers based on any low-dimensional projections. We derive explicit rates of convergence of the excess risk of the proposed PC-based classifier, and prove that these rates are minimax optimal. Our theory allows, but does not require, the lower-dimension to grow with the sample size and is also valid even when the feature dimension exceeds the sample size. Extensive simulations corroborate our theoretical findings. The proposed method also performs favorably relative to other existing discriminant methods on three real data examples.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2022 program