Online Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 325 - Machine Learning Methods for Better-Informed Decision-Making in Heath Care
Type: Invited
Date/Time: Thursday, August 12, 2021 : 10:00 AM to 11:50 AM
Sponsor: Health Policy Statistics Section
Abstract #316709
Title: Multiple Instance Learning with Distributional Data: Application to Early-Stage Breast Cancer Diagnosis
Author(s): Menggang Yu*
Companies: University of Wisconsin
Keywords: Multiple instance learning; Support Vector Machine; Optimization; Classification

This talk is motivated by a recent cancer micro-environment study for breast cancer. In the study, tumor and adjacent normal tissues from cancer patients were collected. Then several spots of interest were selected on a tissue slide. Fibers of each spot on a slide were measured using a customized software and for each fiber, 19 measurements including length, curvature, etc were collected. In the dataset, each slide contained a maximum of 8 spots and each spot had hundreds of fibers. As the number of fibers of each spot is large, it is reasonable to view the data of each spot as an empirical distribution of fiber features. The challenge is that the label for each spot may be unobserved. If a slide is 'tumor' then we know at least one spot on this slide should be labeled 'tumor'. If a slide is 'normal', then all the spots of this slide should be labeled 'normal'. Therefore, for a 'tumor' slide, we were not able to observe spot labels. But for a 'normal' slide, we observed all spot labels. We consider developing classification rules based on such data.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2021 program