Activity Number:
|
151
|
Type:
|
Topic Contributed
|
Date/Time:
|
Monday, August 3, 2009 : 10:30 AM to 12:20 PM
|
Sponsor:
|
Section on Bayesian Statistical Science
|
Abstract - #303797 |
Title:
|
Targeted Sequential Resampling from Large Data Sets in Mixture Modeling
|
Author(s):
|
Ioanna Manolopoulou*+
|
Companies:
|
Statistical and Applied Mathematical Sciences Institute
|
Address:
|
19 T.W. Alexander Drive, Durham, NC, 27709-4006,
|
Keywords:
|
Sequential Monte Carlo ; Rare event detection ; Large datasets ; Mixture models
|
Abstract:
|
One of the challenges of Markov Chain Monte Carlo in large data sets is the need to scan through the whole data at each iteration of the sampler, which can be computationally prohibitive. Several approaches have been developed to address this, typically drawing computationally manageable samples of the data. Here we consider the specific case when most of the data provides no information about the parameters of interest. The motivating application arises in flow cytometry, where interest lies in identifying specific rare cell subtypes and characterizing them according to their corresponding markers. We present a MCMC approach where an initial sample of the full data is used to draw a further set of datapoints which contains more information about rare events, and extend it to a Sequential Monte Carlo framework whereby the selected sample is augmented sequentially as estimates improve.
|