JSM 2012 Home

JSM 2012 Online Program

The views expressed here are those of the individual authors and not necessarily those of the JSM sponsors, their officers, or their staff.

Online Program Home

Abstract Details

Activity Number: 658
Type: Contributed
Date/Time: Thursday, August 2, 2012 : 10:30 AM to 12:20 PM
Sponsor: Biometrics Section
Abstract - #304715
Title: A Comparison of Strategies to Handle Missing Data in Random Forests
Author(s): Ren He and Christina Ramirez Kitchen and Anna Liza Antonio*+
Companies: University of California at Los Angeles Fielding School of Public Health and University of California at Los Angeles Fielding School of Public Health and University of California at Los Angeles Fielding School of Public Health
Address: Department of Biostatistics, Los Angeles, CA, 90095, United States
Keywords:
Abstract:

In recent years, Random Forests have become a very popular tool used in various fields of research (e.g. bioinformatics, financial services and pattern recognition). The attractiveness of this algorithm lies in its ability to deal with high-dimensional problems which include complex interaction effects. However, missing values may affect the efficiency of Random Forests. In this paper, we compare several strategies to handle missing data under a variety of simulation settings, which include different missing data mechanisms, missing rates and correlation structures. The simulation results are presented in this paper.


The address information is for the authors that have a + after their name.
Authors who are presenting talks have a * after their name.

Back to the full JSM 2012 program




2012 JSM Online Program Home

For information, contact jsm@amstat.org or phone (888) 231-3473.

If you have questions about the Continuing Education program, please contact the Education Department.