Abstract:
|
Existing literature on constructing optimal regimes often focuses on intention-to-treat analyses that completely ignore the compliance behavior of individuals. Instrumental variable-based methods have also been developed to learn optimal regimes under endogeneity. However, when there are two active treatment arms, the average causal effects of treatments cannot be identified using instrumental variable methods, and thus the existing methods will not be applicable. To fill this gap, we provide a procedure that identifies an optimal regime and the corresponding value function as a function of a vector of sensitivity parameters. We also derive the canonical gradient of the target parameter and propose a multiply robust classification-based estimator of the optimal regime. Our simulations highlight the need for and usefulness of the proposed method in practice.
|