Abstract:
|
Collaborative double robust targeted maximum likelihood estimator (C-TMLE) is an extension of targeted maximum likelihood estimators (TMLE) that pursues an optimal strategy for estimation of the nuisance parameter required in the targeting step. We consider the problem of estimation of the average causal effect based on an observational study in which we observe on each unit baseline covariates, a binary treatment and an outcome of interest. A forward stepwise variable selection procedure based on this latter criterion was proposed by van der Laan and Gruber (2010). This C-TMLE was shown to outperform a standard TMLE when there are variables that are highly correlated with treatment but non-predictive of the outcome. However, its computational complexity is quadratic in the number of variables, which makes this particular C-TMLE not scalable for large scale and high dimensional data. In this article, we propose several scalable versions of C-TMLE: instead of using a greedy search at each step, it follows an easy to compute data adaptive ordering of the variables. Simulations are provided to illustrate the performance of these scalable C-TMLEs relative to current competitors.
|