Online Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 288 - SLDS CSpeed 5
Type: Contributed
Date/Time: Wednesday, August 11, 2021 : 1:30 PM to 3:20 PM
Sponsor: Section on Statistical Learning and Data Science
Abstract #318640
Title: WITHDRAWN: A Virtual Multi-Label Approach to Imbalanced Data Classification
Author(s): Elizabeth Chou
Companies: Dept. of Statistics, National Chengchi University
Keywords: imbalance; classification; virtual multi-label
Abstract:

One of the most challenging studies in machine learning is imbalanced data analysis. Usually, in this type of research, it is more critical to predict minority class correctly than to majority class. However, traditional machine learning techniques are easy to cause such learning bias. Some ensemble methods cause various problems, such as over-fitting, disregard some information, or long computation time. Besides, the methods do not apply to all kinds of datasets. Based on the problem above, the virtual labels approach for the majority class is proposed to solve the imbalanced problem. A new multiclass classification approach with the equal K-means clustering method is performed in the study. The proposed method is compared with the commonly used imbalance problem methods, such as sampling methods (oversampling, undersampling, and SMOTE) and classifier methods (SVM and One-Class SVM). The result shows that the proposed method will have better performance when the degree of data imbalance increases and will gradually outperform other methods.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2021 program