JSM 2014 Home
Online Program Home
My Program

Abstract Details

Activity Number: 301
Type: Contributed
Date/Time: Tuesday, August 5, 2014 : 8:30 AM to 10:20 AM
Sponsor: Section on Statistical Learning and Data Mining
Abstract #313622
Title: Classification with Known Class Probabilities
Author(s): Joshua Magarick*+
Companies: University of Pennsylvania
Keywords: Sequential Prediction ; Machine Learning ; Classification
Abstract:

We investigate the problem of classification when the marginal distribution over classes is known but varies substantially from sample to sample. This will occur when the data are not independent. In particular, this situation arises when the classes are generated from a Markov process with a known stationary distribution and unknown other parameters. In real data, this phenomenon has been observed in studies of sleep, where the amount of time spent in different sleep stages is well known overall but variable across individuals and studies. We show how to use this additional information in several classifiers, such as logistic regression to mitigate the effect of inter-sample class proportion variability using both simulation studies and real data.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2014 program




2014 JSM Online Program Home

For information, contact jsm@amstat.org or phone (888) 231-3473.

If you have questions about the Professional Development program, please contact the Education Department.

The views expressed here are those of the individual authors and not necessarily those of the JSM sponsors, their officers, or their staff.

ASA Meetings Department  •  732 North Washington Street, Alexandria, VA 22314  •  (703) 684-1221  •  meetings@amstat.org
Copyright © American Statistical Association.