JSM 2012 Home

JSM 2012 Online Program

The views expressed here are those of the individual authors and not necessarily those of the JSM sponsors, their officers, or their staff.

Online Program Home

Abstract Details

Activity Number: 104
Type: Invited
Date/Time: Monday, July 30, 2012 : 8:30 AM to 10:20 AM
Sponsor: General Methodology
Abstract - #303476
Title: Increasing Innovation Using Data Mining Competitions
Author(s): Jeremy Howard*+
Companies: Kaggle
Address: 307/107 Beach St, Port Melbourne, International, 3207, Australia
Keywords: data mining ; machine learning ; competitions ; crowdsourcing ; predictive modelling
Abstract:

Kaggle is an online data analytics contest site that allows organizations to host predictive analytics competitions, much like the Netflix $1M Prize from 2009. Recent contests have involved developing statistical procedures to predict success of submitted grant applications, severity of HIV progression, and when supermarket shoppers will next visit the store and the amount they will spend.

A typical contest involves an organization posting a portion of a data set, and contestants are challenged to produce the most accurate predictions on a held-out sample. Typical first-place prizes for contest winners are in the thousands of dollars, including a currently run contest on predicting hospital lengths-of-stay that will earn the winner $3M. This talk discusses Kaggle's role in creating an environment for influencing academic research through data mining competitions.


The address information is for the authors that have a + after their name.
Authors who are presenting talks have a * after their name.

Back to the full JSM 2012 program




2012 JSM Online Program Home

For information, contact jsm@amstat.org or phone (888) 231-3473.

If you have questions about the Continuing Education program, please contact the Education Department.