Online Program Home
My Program

Abstract Details

Activity Number: 8 - Machine Learning Methods and Applications: Making an Impact in Biomedical Research
Type: Invited
Date/Time: Sunday, July 28, 2019 : 2:00 PM to 3:50 PM
Sponsor: Section on Statistical Learning and Data Science
Abstract #300263 Presentation
Title: Matching Methods for Observational Data with Small Group Sizes and Mising Covariates
Author(s): Juanjuan Fan* and Afrooz Jahedi and Tristan Hillis and Ralph-Axel Mueller
Companies: San Diego State University and San Diego State University and San Diego State University and San Diego State University
Keywords: Observational Study; Matching; Random Forest; Propensity Score; Proximity; Missing data

In order to derive unbiased inference from observational data, matching methods are often applied to produce balanced treatment groups in terms of relevant background variables. Although many matching algorithms exit in the literature, most require a large control reservoir and can not deal with missing covariates. Random forest, averaging outcomes from many decision trees, is nonparametric in nature, can deal with missing data in the tree building process, and can produce more accurate and less model dependent estimates of propensity scores as well as a proximity matrix. In this study, iterative matching algorithms are developed in order to form balanced samples based on limited sample sizes for both groups. In addition, the issue of how to evaluate sample balance in the presence of missing data is also investigated. The proposed methods are applied to two data sets, arising from studies of autism spectrum disorder (ASD) and student success.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2019 program