Online Program Home
My Program

Abstract Details

Activity Number: 139 - Competing Effectively: Hosting, Designing, and Participating in Kaggle-Style Competitions
Type: Invited
Date/Time: Monday, July 30, 2018 : 10:30 AM to 12:20 PM
Sponsor: Section on Statistics in Defense and National Security
Abstract #326788 Presentation
Title: General Techniques for Successful Data Science Competitions
Author(s): Ian Michael Mouzon*
Companies: Iowa State University
Keywords: competitions; machine learning; data mining; management; data exploration; teamwork

While most participants in a data science competition will not see their efforts rewarded by the obvious motivations, the difficult nature of the underlying problem causes many competitors to find a stronger motivation in the opportunity to develop a novel solution. Since obvious approaches to these problems do not work, successful results are based on the creativity of the competitors, their unique areas of expertise, and their ability to capture the less obvious features in the data. However, the goal of developing such a unique model, combined with the limited amount of time in which to work, greatly disrupts the normal routine of statistical modeling. Having participated in many data mining competitions, I have had the opportunity to notice some common techniques which allow competitors to build successful models, as well as understandable mistakes that lead to less successful results. By avoiding the traps that limit exploration and successfully shifting focus between the larger and smaller details, competitors can recognize important frameworks upon which to best drape and model the data.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2018 program