Online Program Home
  My Program

Abstract Details

Activity Number:
Register
214069 - Preparing Statistician/Statistics Graduates to Be Data Scientists (ADDED FEE)
Type: Professional Development
Date/Time: Sunday, July 30, 2017 : 8:30 AM to 5:00 PM
Sponsor: ASA
Abstract #325472
Title: Preparing Statistician/Statistics Graduates to Be Data Scientist (ADDED FEE)
Author(s): Ming Li* and Hui Lin*
Companies: Amazon and TAMU - Commerce and DuPont Pioneer
Keywords:
Abstract:

With the recent big data revolution, enterprises ranging from FORTUNE 500 to startup companies across the US are hungry for Data Scientist to bring valuable insight from data collected. Statistics graduate students are great data scientist candidates, but there are relatively few data scientist with statistics education background. In this CE course, we will go through the needed data science knowledge and skills to prepare statistician to be excellent data scientist. The cloud-based computation environment will be used in this course with plenty of hands-on exercises. We will use case studies to cover how to leverage big data distributed platform (Hadoop / Hive / Spark), data wrangling, modeling, dynamic report (R markdown) and interactive dashboard (R-Shiny) to tackle real-world data science problems. Data ETL in production environments is one typical gap for statistician and it will be covered. Data science is a combination of science and art with data as the foundation. We will also cover the "art" part to guide participant to learn soft skills to define data science problems and to effectively communicate with business stakeholders. The prerequisite knowledge is MS level education in statistics and entry level of R-Studio.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2017 program

 
 
Copyright © American Statistical Association