Online Program Home
My Program

Abstract Details

Activity Number: 345 - Theory and Methods for Multivariate Analysis
Type: Contributed
Date/Time: Tuesday, July 31, 2018 : 10:30 AM to 12:20 PM
Sponsor: IMS
Abstract #329167 Presentation
Title: How to Select the Number of Components in PCA and Factor Analysis? Understanding and Improving Permutation Methods
Author(s): Edgar Dobriban* and Art Owen
Companies: and Stanford University
Keywords: PCA; factor analysis; parallel analysis; random matrix theory; permutation methods; big data
Abstract:

Selecting the number of components in PCA and factor analysis is a key problem facing practitioners. One of the most popular methods is a permutation approach that randomly scrambles the elements of each feature. It selects the components whose singular values are large compared to the permuted data. This method (also known as parallel analysis) is recommended in many textbooks and review papers, and used in genomics by leading applied statisticians including T Hastie, M Stephens, J Storey, R Tibshirani and WH Wong. However, it is poorly understood. In this talk, we develop a theoretical understanding and propose improvements.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2018 program