Friday, February 24
CS14 Addressing Statistical Problems and Issues Fri, Feb 24, 3:45 PM - 5:15 PM
River Terrace 3

Data Preparation: The Key for Meaningful Insights (303340)

*Huiyu Qian, AutoAnything Inc. 

Keywords: Data Preparation; Qaulity Control;

To prepare the right data is the key for any valid data analysis or predictive modeling, otherwise it is "Garbage In, Garbage Out". In this talk, I will show you the steps and some methods to get to know your source data, how to validate and clean your data, based on my years of working and mentoring experience in this area. Examples will include data formatting, transformation, merging and blending, quality control, dealing with missing data and categorical data. This talk will be most helpful for those with none or little working experience in analyzing or modeling complicated data structure or big data.