Date/Time: Tuesday, July 31, 2018 : 2:00 PM to 3:50 PM
Sponsor: Section on Statistical Computing
Title: Harnessing the power of open data on the web
Author(s): Karthik Ram* and Scott Chamberlain
Companies: University of California, Berkeley and University of California, Berkeley
Keywords: API ; data ; open science ; data science ; R

The Open Science movement, which promotes sharing all artifacts of scientific research such as data, code, and methods, is becoming increasingly popular among researchers worldwide. This rapid increase in popularity is evidenced in the thriving ecosystem of tools and also numerous training initiatives that have sprung up at various institutions. One such effort in this space is the rOpenSci project. Founded as a grassroots effort six years ago, the project works to promote open and reproducible research by enabling discovery of data, code and software, creating the technical and social infrastructure to enable reproducible research, and advocate for a culture of open science in various disciplines. This community driven project builds upon R's popularity as a scientific research tool and fills in additional gaps in the computational pipeline by providing tools for data acquisition (from APIs and other source), data manipulation (including an emerging suite for spatial data analysis and mapping), data visualization tools, and data publication tools. Beyond tooling, the project has been working towards promoting best practices in scientific computing (beyond the R language itself) by

