Abstract:
|
Dimension reduction and visualization is a staple of data analytics. Methods such as Principal Component Analysis (PCA) and Multidimensional Scaling (MDS) provide low dimensional (LD) projections of high dimensional (HD) data while preserving an HD relationship between observations. Traditional bi-plots assign meaning to the LD space of a PCA projection by displaying LD axes for the attributes. These axes, however, are specific to the linear projection used in PCA. MDS projections, which allow for arbitrary stress and dissimilarity functions, require special care when labeling the LD space. We propose an iterative scheme to plot an LD axis for each attribute based on the user-specified stress and dissimilarity metrics. In this talk, we will discuss the details of our general bi-plot methodology, its relationship with PCA-derived bi-plots, and provide some illuminating examples based on both simulated and real data.
|