Online Program

Return to main conference page
Saturday, October 21
Sat, Oct 21, 7:30 AM - 8:30 AM
Aventine Ballroom G
Continental Breakfast and Speed Poster 4 sponsored by Bank of America

Integration of Sufficient Dimension Reduction and Boosting for Gene Expression Data (304095)

*Nusrat Jahan, James Madison University 

Statistical learning framework is a data-driven decision making process. In general, this is an iterative process of class prediction for complex high dimensional data. Statistical learning approaches develop base prediction functions based on subsets of data (training data), final predictive function is an aggregate of all the base functions, and it is independently validated by exposing to a new data (test data). These approaches do not depend on the probabilistic distributional assumptions of data, and very adept in handing data structures characterized by small sample size, high dimensionality, and inter-dependencies. Boosting is one of the statistical learning approaches that has been used in several studies for class prediction using gene expression data. In this work, we integrate sparse sufficient dimension reduction technique with boosting, to improve the accuracy of the class prediction function.