Online Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 53 - Applications of Data Linkage and Machine Learning Techniques
Type: Contributed
Date/Time: Monday, August 3, 2020 : 10:00 AM to 2:00 PM
Sponsor: Survey Research Methods Section
Abstract #310985
Title: Machine-Learning Algorithms to Improve Payment Imputation in the Medical Expenditure Panel Survey (MEPS)
Author(s): Emily Mitchell* and Chandler McClellan and Jerrod Anderson and Samuel H Zuvekas
Companies: Agency for Healthcare Research and Quality (AHRQ) and Agency for Healthcare Research and Quality (AHRQ) and Agency for Healthcare Research and Quality (AHRQ) and Agency for Healthcare Research and Quality (AHRQ)
Keywords: Machine Learning; Medical Expenditure Panel Survey; Imputation; Survey Data; Predictive Mean Matching
Abstract:

The Medical Expenditure Panel Survey (MEPS) is an annual survey that collects nationally representative data on healthcare use and expenditures for the civilian, non-institutionalized U.S. population and is the primary source for micro-level national data on medical expenditures. The MEPS contacts both households and their medical providers to gather as much accurate expenditure information as possible. However, a significant portion of this expenditure data must be imputed.

Currently, imputation is conducted using a predictive mean matching (PMM) algorithm in which a linear regression model predicts total expenditures for recipients and donors. Recipients and donors are matched based on the smallest distance between predicted values, and expenditures are then allocated to the recipient.

For this analysis, we assess whether more sophisticated machine-learning (ML) algorithms can improve the existing PMM process to impute total expenditures. We apply and compare supervised ML algorithms such as random forest, neural networks, and regularized regression. We also assess the possibility of adding additional features into the algorithms and weighting important features.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2020 program