JSM 2016 Online Program

Activity Number:	560
Type:	Contributed
Date/Time:	Wednesday, August 3, 2016 : 10:30 AM to 11:15 AM
Sponsor:	Section on Statistics in Genomics and Genetics
Abstract #321769
Title:	Variable Selection in Untargeted Metabolomics Data Analysis
Author(s):	Alexander Kirpich* and Matthew Merritt and George Michailidis and Lauren McIntyre
Companies:	University of Florida and University of Florida and University of Florida and University of Florida
Keywords:	metabolomics ; data processing ; variable selection
Abstract:	Genomic data variable selection problems center around the issue of large p and small n. Metabolomics shares the same data structure, but possesses extra challenges. The main challenges in metabolomics data processing are the uncertainty caused by peak identification, uncertainty in relating each peak to a specific compound of interest, lack of consensus on the best way to estimate quantities when the assay is untargeted (e.g. peak area vs peak height). Additionally, lack of independence caused by adducts and isomers and heterogeneity of the variances across metabolites caused by metabolites correlation structure and unequal sample weights. The performance of variable selection methods in these complex circumstances is explored. In this work we discuss those challenges in details and provide the examples of the results for the different approaches.

Authors who are presenting talks have a * after their name.