Conference Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 245 - Bayesian Models for Clustering and Latent Allocation
Type: Contributed
Date/Time: Tuesday, August 9, 2022 : 8:30 AM to 10:20 AM
Sponsor: Section on Bayesian Statistical Science
Abstract #322266
Title: Supervised Bayesian Nonparametric Clustering Techniques for Survey Data
Author(s): Stephanie M. Wu* and Briana Joy Kennedy Stephenson
Companies: Harvard T.H. Chan School of Public Health and Harvard T.H. Chan School of Public Health
Keywords: Bayesian clustering; Survey weights; Bayesian nonparametrics; Dietary intake patterns; Cardiovascular disease

Dietary intake is a major modifiable risk factor for cardiovascular disease. We can characterize dietary intake patterns and their effects on risk of cardiovascular disease by using supervised Bayesian nonparametric clustering methods. However, when data are sourced from surveys where unequal probabilities of selection are inherent in the design, this complex survey design must be accounted for to avoid biased estimation and inference. Working from an overfitted finite mixture model framework, we explore two approaches that use sampling weights to adjust for survey design and apply them to a supervised cluster setting. The first approach replaces the likelihood with a weighted pseudo-likelihood in the posterior update. The second approach uses a weighted finite population Bayesian bootstrap to generate a pseudo-population, which is then integrated into the Markov chain Monte Carlo algorithm. Using categorical dietary consumption data and binary cardiovascular disease data from representative surveys, we apply these two methods and discuss their performance via simulation studies in an effort to better understand the impact of diet on cardiovascular disease risk.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2022 program