Online Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 215 - Contributed Poster Presentations: Section on Statistical Learning and Data Science
Type: Contributed
Date/Time: Tuesday, August 4, 2020 : 10:00 AM to 2:00 PM
Sponsor: Section on Statistical Learning and Data Science
Abstract #314102
Title: Assessment of Data Reduction Models Including Autoencoders for Optimal Visualization, Interpretability and Speed
Author(s): Benedict Anchang*
Companies: NIEHS
Keywords: data reduction; model interpretability; visualization; auto encoders; single-cell

Recent technological advances for analyzing single cells motivate similar improvements in computational, visualization and interpretability tools. The increasing generated data sets with hundreds of thousands of data points are hard to interpret due to their high dimensionality. With the renewed interest in innovative dimensionality reduction methods, there is an urgent need to robustly assess the performance of these methods. Some methods are powerful at keeping the intrinsic structure of the data but are time costly and generally lack a close form that allows the model to be reused on new data points for explicit low dimensional embedding. Others are not visually interpretable but time efficient. We define a multivariate metric that can be used for assessing the quality of projection in terms of running time efficiency, fidelity and close form representation of the data structure in addition to quality of interpretability and visualization in terms of coverage and spread. We further motivate the use of Autoencoders, a growing category of Neural Networks that allows for optimal data visualization, corrupt data detection and data correction while outperforming matrix factorization

Authors who are presenting talks have a * after their name.

Back to the full JSM 2020 program