Online Program Home
My Program

Abstract Details

Activity Number: 294 - SPEED: Statistical Learning and Data Science Speed Session 2, Part 1
Type: Contributed
Date/Time: Tuesday, July 30, 2019 : 8:30 AM to 10:20 AM
Sponsor: Section on Statistical Learning and Data Science
Abstract #305059 Presentation
Title: Block-Wise Partitioning for Extreme Multi-Label Classification
Author(s): Yuefeng Liang* and Thomas C. M. Lee and Cho-Jui Hsieh
Companies: UC Davis and UC Davis and UCLA
Keywords: multi-label classification

Extreme multi-label classification aims to learn a classifier that annotates an instance with a relevant subset of labels from an extremely large label set. Many existing solutions embed the label matrix to a low-dimensional linear subspace, or examine the relevance of a test instance to every label via a linear scan. In practice, however, those approaches can be computationally exorbitant. To alleviate this drawback, we propose a Block-wise Partitioning (BP) pretreatment that divides all instances into disjoint clusters, to each of which the most frequently tagged label subset is attached. One multi-label classifier is trained on one pair of instance and label clusters, and the label set of a test instance is predicted by first delivering it to the most appropriate instance cluster. Experiments on benchmark multi-label data sets reveal that BP pretreatment significantly reduces prediction time, and retains almost the same level of prediction accuracy.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2019 program