Conference Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 408 - Recent Advances in Statistical Machine Learning
Type: Topic Contributed
Date/Time: Wednesday, August 10, 2022 : 10:30 AM to 12:20 PM
Sponsor: Section for Statistical Programmers and Analysts
Abstract #320835
Title: Optimal and Safe Estimation for High-Dimensional, Semi-Supervised Learning
Author(s): Yang Ning*
Companies: Cornell University
Keywords: High dimensionality; Minimax optimality; semi-supervised learning
Abstract:

There are many scenarios such as the electronic health records where the outcome is much more difficult to collect than the covariates. In this paper, we consider the linear regression problem with such a data structure under the high dimensionality. Our goal is to investigate when and how the unlabeled data can be exploited to improve the estimation and inference of the regression parameters in linear models, especially in light of the fact that such linear models may be misspecified in data analysis. In particular, we address the following two important questions. (1) Can we use the labeled data as well as the unlabeled data to construct a semi-supervised estimator such that its convergence rate is faster than the supervised estimators? (2) Can we construct confidence intervals or hypothesis tests that are guaranteed to be more efficient or powerful than the supervised estimators?


Authors who are presenting talks have a * after their name.

Back to the full JSM 2022 program