Conference Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 67 - Advances in Variable Selection
Type: Contributed
Date/Time: Sunday, August 7, 2022 : 4:00 PM to 5:50 PM
Sponsor: Section on Statistical Learning and Data Science
Abstract #322222
Title: Sample Size Calibration by FDR-Power Tradeoff for Logistic Regression in High Dimensions
Author(s): Gerda Claeskens and Jing Zhou*
Companies: KU Leuven and KU Leuven
Keywords: high-dimensional statistics; logistic regression; FDR control; power analysis; variable selection; reproducibility
Abstract:

In recent years, false discovery rate (FDR) control has gradually attracted attention to improve the reproducibility of variable selection. We focus on the variable selection problem for $l_1$-regularized logistic regression with $p$ variables and $n$ samples. In addition, we assume $n$, $p$ follow a linear growth rate $n/p \to \delta \in (0, \infty)$ which include both $n>p$ and $n \leq p$ cases. Since the $l_1$-regularizer by nature performs variable selection, we characterize its asymptotic FDR-power tradeoff and classification accuracy using a system of equations with six parameters. Further, we propose a sample size calibration procedure to achieve certain power under pre-specified FDR using the FDR-power tradeoff. Similar asymptotic analysis for the model-X knockoff, which provides FDR controlled selection, is also investigated. We illustrate the FDR-power analysis and the corresponding sample size calibration using simulated and real data.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2022 program