Online Program Home
My Program

Abstract Details

Activity Number: 47
Type: Invited
Date/Time: Sunday, July 31, 2016 : 4:00 PM to 5:50 PM
Sponsor: Section on Statistical Computing
Abstract #318293 View Presentation
Title: Thinking with Data Using R and RStudio: Powerful Idioms for Analysts
Author(s): Nicholas Jon Horton* and Randall Pruim and Daniel Kaplan
Companies: Amherst College and Calvin College and Macalester College
Keywords: statistical computing ; data science ; big data ; algorithmic thinking ; statistical education ; RStudio

Statisticians and data scientists need to be able to "think with data" in order to answer statistical questions that arise from the flood of data that are now available. In this talk, I will introduce a set of key idioms due to Hadley Wickham that provide a framework to teach data management skills and facilitate loading, merging, and transforming large datasets.

This talk will demonstrate these idioms implemented in new packages in R (namely readr, dplyr, haven, lubridate, mosaic, rvest, stringr, and tidyr) to ingest, manage, transform, analyze, and model data. You'll see that it is easy to learn to use these packages, and that it is very worthwhile to do so. The talk provides a headstart on learning, then points out the next steps. No prior experience with R is expected.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2016 program

Copyright © American Statistical Association