Conference Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 188 - SLDS Student Paper Awards
Type: Topic Contributed
Date/Time: Monday, August 8, 2022 : 2:00 PM to 3:50 PM
Sponsor: Section on Statistical Learning and Data Science
Abstract #322845
Title: Crowdsourcing Utilizing Subgroup Structure of Latent Factor Modeling
Author(s): Qi Xu* and Yubai Yuan and Junhui Wang and Annie Qu
Companies: Unviersity of California Irvine and UC Irvine and City university of Hong Kong and UC Irvine
Keywords: Angle-based approach; Low rank approximation; Multi-centroid grouping penalty; Orthogonal transformation ; Subgroup structure
Abstract:

Crowdsourcing has emerged as an alternative solution for collecting large scale la- bels. However, the majority of recruited workers are not domain experts, so their contributed labels could be noisy. In this paper, we propose a two-stage model to pre- dict the true labels for binary and multicategory classification tasks in crowdsourcing. In the first stage, we fit the observed labels with a latent factor model and incorpo- rate subgroup structures for both tasks and workers through a multi-centroid grouping penalty. Group-specific rotations are introduced to align workers with different task categories to solve multicategory crowdsourcing tasks. In the second stage, we pro- pose an angle-based approach to identify high-quality worker subgroups who are relied upon to assign labels to tasks. In theory, we show the estimation consistency of the latent factors and the prediction consistency of the proposed method. The simulation studies show that the proposed method outperforms the existing competitive methods, assuming the subgroup structures within tasks and workers. We also demonstrate the application of the proposed method to real world problems and show its superiority


Authors who are presenting talks have a * after their name.

Back to the full JSM 2022 program