JSM Preliminary Online Program
This is the preliminary program for the 2009 Joint Statistical Meetings in Washington, DC.

The views expressed here are those of the individual authors
and not necessarily those of the ASA or its board, officers, or staff.


Back to main JSM 2009 Program page




Activity Number: 538
Type: Invited
Date/Time: Thursday, August 6, 2009 : 8:30 AM to 10:20 AM
Sponsor: Section on Physical and Engineering Sciences
Abstract - #303184
Title: A Co-Training Algorithm for Multiview Data with Applications in Data Fusion
Author(s): Mark Culp*+ and George Michailidis
Companies: West Virginia University and University of Michigan
Address: , , ,
Keywords: multi-view learning ; co-training ; data fusion ; partial least squares ; random forest ; variabl importance
Abstract:

In several scientific applications, data are generated from two or more diverse sources (views) with the goal of predicting an outcome of interest. Often it is the case that the outcome is not associated with any single view. However, the synergy of all measurements from each view may yield a more predictive classifier. For example, consider a drug discovery application in which individual molecules are described partially by several assay screens based on diverse profiles and partially by their chemical structural fingerprint. In this talk, a co-training algorithm is developed to utilize data from diverse sources to predict the common class variable. Novel enhancements for variable importance, robustness to a mislabeled class variable, and a technique to handle unbalanced classes are applied to the motivating data set. Comparisons data fusion using PLS are assessed on real data.


  • The address information is for the authors that have a + after their name.
  • Authors who are presenting talks have a * after their name.

Back to the full JSM 2009 program


JSM 2009 For information, contact jsm@amstat.org or phone (888) 231-3473. If you have questions about the Continuing Education program, please contact the Education Department.
Revised September, 2008