Online Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 414 - Risk Modeling and Regression Techniques
Type: Contributed
Date/Time: Thursday, August 12, 2021 : 2:00 PM to 3:50 PM
Sponsor: Biometrics Section
Abstract #317858
Title: Efficient Data Fusion
Author(s): Sijia Li*
Companies: University of Washington
Keywords: data fusion; semiparametric theory; transportability
Abstract:

We aim to make inferences about a smooth finite-dimensional parameter by fusing data from multiple sources together. Previous works have studied the estimation of a variety of parameters in similar data fusion settings, including the average treatment effect, optimal treatment rule, or average reward, with the majority of them merging one historical dataset with covariates, actions, and rewards and one dataset of the same covariates. In this work, we consider the general case where multiple datasets align with different parts of the distribution of the target population, for example, the conditional distribution of the reward given actions and covariates. We then examine potential gains in efficiency that can arise from fusing these datasets together in a single analysis, which are characterized by a reduction in the semiparametric efficiency bound. Our framework allows researchers to tackle data fusion problems in generality without limiting themselves to specific parameters, numbers of data sources, or particular data structures. In a variety of examples, we show marked improvements in efficiency from using our proposed estimators compared to natural alternatives.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2021 program