Online Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 434 - Recent Advances in Unlinked and Permuted Regression
Type: Topic-Contributed
Date/Time: Thursday, August 12, 2021 : 4:00 PM to 5:50 PM
Sponsor: IMS
Abstract #317090
Title: Multivariate Regression with Unknown Permutation
Author(s): Martin Slawski* and Bodhisattva Sen
Companies: George Mason University and Columbia University
Keywords: Record Linkage; Broken Sample Problem; Regression; Permutation; Optimal Transport
Abstract:

Standard regression setups take it for granted that the response variables are observed jointly with their corresponding predictor variables. However, in the case of asynchronous data collection responses and predictors are given in two separate files with limited information about which records belong to the same statistical unit. Such setting pertains to record linkage, data privacy, and various other applications in computer science and engineering. In this talk, we present a series of practical methods and accompanying theory on the setting in which predictors and responses are observed up to an unknown permutation that encodes the underlying correspondence, starting from multivariate linear regression and concluding with a specific notion of monotone functions arising in optimal transportation. Specifically, we uncover a “blessing of dimensionality” phenomenon that indicates that recovery of the permutation becomes easier as the dimension of the response increases.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2021 program