Online Program

Return to main conference page

All Times EDT

Thursday, October 1
Thu, Oct 1, 11:25 AM - 12:40 PM
Virtual
Concurrent Session

Transfer Learning in High-Dimensional Sparse Regression (308507)

Tony Cai, University of Pennsylvania 
Hongzhe Li, University of Pennsylvania 
*Sai Li, University of Pennsylvania 

Keywords: Machine learning, high-dimensional regression, minimax optimality

We consider the problem of estimation and prediction of a high-dimensional linear regression in the setting of transfer learning, using samples from the target model as well as samples from some di erent but possibly related regression models. If some auxiliary samples are known to be "informative", we show that the minimax optimal rates for prediction and estimation are faster than the rates which do not use the auxiliary data. That is, some knowledge from the informative auxiliary data can been transferred to improve the learning performance of the target problem. Without knowing which auxiliary samples are informative, we propose a data-driven method for transfer learning (Trans-Lasso) and show that under proper conditions its performance is comparable to the oracle case where the set of informative samples are known. Our proposed approaches are demonstrated in various numerical studies and are applied to a dataset concerning the associations among gene expressions in a target tissue with samples from multiple other tissues as auxiliary data.