Online Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 425 - Modern Statistical Learning of Complex Data
Type: Invited
Date/Time: Thursday, August 12, 2021 : 4:00 PM to 5:50 PM
Sponsor: Section on Nonparametric Statistics
Abstract #316850
Title: Kernel-Based Learning for Informative Selection in Complex Surveys
Author(s): Jay Breidt* and Teng Liu
Companies: Colorado State University and Colorado State University
Keywords: informative selection; maximum mean discrepancy; survey weighting; nonparametric test
Abstract:

Informative selection, in which the distribution of response variables given that they are sampled is different from their distribution in the population, is pervasive in complex surveys. Failing to take such informativeness into account can produce severe inferential errors, including biased and inconsistent estimation of population parameters. While several parametric procedures exist to test for informative selection, these methods are limited in scope and their parametric assumptions are difficult to assess. Motivated by a kernel-based learning method that compares distributions based on their maximum mean discrepancy, we develop a class of nonparametric tests for informative selection that compares weighted and unweighted distributions. The asymptotic distributions of the test statistics are established under the null hypothesis of noninformative selection. Simulation results show that our tests have power competitive with existing parametric tests in a correctly specified parametric setting, and better than those tests under model misspecification. Our approach adapts automatically to multidimensional settings. A recreational angling application illustrates the methodology.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2021 program