Online Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 336 - Le Cam Lecture
Type: Invited
Date/Time: Thursday, August 12, 2021 : 10:00 AM to 11:50 AM
Sponsor: IMS
Abstract #315533
Title: Understanding Spectral Embedding
Author(s): Jianqing Fan *
Companies: Princeton University
Keywords: Clustering; community detection; matrix pertubation; optimality
Abstract:

Spectral embedding has been widely used in statistics and machine learning. To provide fundamental understanding of its statistical properties, we develop an $\ell_p$ perturbation theory for a hollowed version of PCA in Hilbert spaces which provably improves upon the vanilla PCA in the presence of heteroscedastic noises. Through a novel $\ell_p$ analysis of eigenvectors, we investigate entrywise behaviors of principal component score vectors and show that they can be approximated by linear functionals of the Gram matrix in $\ell_p$ norm. For sub-Gaussian mixture models, the choice of $p$ in the theoretical analysis depends on the signal-to-noise ratio, which further yields optimality guarantees for spectral clustering. For contextual community detection, the $\ell_p$ theory leads to a simple spectral algorithm that achieves the information threshold for exact recovery. This provides optimal recovery results for the stochastic block model and Gaussian mixture model as special cases.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2021 program