Online Program Home
  My Program

Abstract Details

Activity Number: 440 - Causal Inference as an Essential Part of the Foundation of Statistical Education
Type: Invited
Date/Time: Wednesday, August 2, 2017 : 8:30 AM to 10:20 AM
Sponsor: Section on Teaching of Statistics in the Health Sciences
Abstract #322110 View Presentation
Title: Causal Inference in a Big Data World - The Roadmap
Author(s): Laura B Balzer*
Companies: Harvard T.H. Chan School of Public Health
Keywords: Causal inference ; Targeted maximum likelihood estimation (TMLE) ; Machine learning ; Semi-parametric inference ; HIV ; Big Data

Along with its many challenges, Big Data present exciting opportunities to better understand risk factors, build improved predictors, and elucidate causal relationships. Still, there are many sources of association between two variables: direct effects, indirect effects, measured confounding, unmeasured confounding, and selection bias. Methods to delineate causation from correlation are perhaps more pressing now than ever. We introduce a roadmap for translating a causal query into a statistical analysis: 1) clear statement of the scientific question; 2) definition of the causal model and causal parameter; 3) assessment of identifiability - linking the causal effect to a parameter estimable from the observed data distribution; 4) choice and implementation of estimators including parametric and semi-parametric approaches; and 5) interpretation of findings. This framework assures the parameters being estimated match the questions posed, elaborates what assumptions are necessary to interpret an estimate causally, and when the assumptions are not met, provides guidance on future research. These concepts are illustrated with an application to HIV prevention and treatment in East Africa.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2017 program

Copyright © American Statistical Association