Online Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 165 - SLDS CSpeed 2
Type: Contributed
Date/Time: Tuesday, August 10, 2021 : 10:00 AM to 11:50 AM
Sponsor: Section on Statistical Learning and Data Science
Abstract #318821
Title: Identifying Invariant Factors Across Multiple Environments with Kullback-Leibler Regression
Author(s): Jaime Roquero Gimenez* and James Zou
Companies: Stanford University and Stanford University
Keywords: Causal inference; Multiple environment
Abstract:

Many datasets are collected from multiple environments (e.g. different labs, perturbations, etc.), and it is often advantageous to learn models and relations that are invariant across environments. Invariance can improve robustness to unknown confounders and improve generalization to new domains. We develop a novel framework that we term Kullback-Leibler regression (KL regression) to reliably estimate regression coefficients in a challenging multi-environment setting, where latent confounders affect the data from each environment. KL regression is based on a new objective of simultaneously minimizing the sum of Kullback-Leibler divergences between a parametric model and the observed data in each environment, and we derive an analytic solution for its global optimum. We prove that KL regression recovers the true invariant factors under a flexible confounding setup. Extensive experiments show that KL regression performed better than state-of-the-art causal inference techniques across a variety of settings, even with large model mismatch. Moreover KL regression achieved the top score on a DREAM5 challenge for inferring causal genes.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2021 program