Online Program

Return to main conference page
Friday, May 18
Computing Science
Distinguished Colleagues of Edward Wegman: Applications to Data Science
Fri, May 18, 10:30 AM - 12:00 PM
Grand Ballroom D
 

Cherry-Picking for Complex Datasets (304551)

Presentation

*David Banks, SAMSI and Duke University 

Keywords: Many regressions, cluster analysis, multidimensional scaling, mixture models

Many data sets, especially Big Data sets, are complex, in the sense that the observations are generated by many different mechanisms. This talk describes a procedure which iteratively extracts simpler component structures from complex data. The ideas are illustrated in the context of regression, cluster analysis, and multidimensional scaling. Portions of the work are based upon ideas implicit in earlier research by Ed Wegman.