Online Program Home
My Program

Abstract Details

Activity Number: 342 - SPEED: Sports to Fire: Fascinating Applications of Statistics
Type: Contributed
Date/Time: Tuesday, July 31, 2018 : 10:30 AM to 12:20 PM
Sponsor: Section on Statistical Computing
Abstract #330569 Presentation
Title: Application of Email Spam Filtering Algorithms to SMS Data
Author(s): Yishu Xue*
Companies: University Of Connecticut
Keywords: Document classification; Feature extraction; Model tuning; Imbalanced data
Abstract:

Email spam filters have been universally applied in reality. Spams, however, are also sent via text messages. In this project, multiple popular algorithms for Email spam filtering are implemented on a Short Message Service (SMS) dataset to see if they will successfully identify spams as well. Two different methods for representing the dataset using matrices were attempted. In addition to utilizing only tokens, other characteristics of the message, such as proportion of numbers or capital letters, were explored. The final classification results are presented, and a few caveats when applying these algorithms will be discussed.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2018 program