Online Program

Return to main conference page
Saturday, February 16
Sat, Feb 16, 8:00 AM - 9:15 AM
St. James Ballroom
Poster Session 3 and Continental Breakfast

Developing and Deploying Reproducible In-House R Packages: A Non-Packrat Workflow (303828)

View Presentation View Presentation

*Eric Tesdahl, SpecialtyCare 

Keywords: R, R package development, reproducibility

Relative to traditionally-used enterprise tools for data analysis, the R statistical computing environment with its framework of packages, affords a number of impressive capabilities for the creation of analytic tools that are transparent, well-documented, and easily-disseminated. However, R’s lack of an explicit package versioning mechanism puts the need for reproducibility of analytic findings in direct conflict with the continuous development and improvement of package functions. Packrat, the most commonly used tool to address reproducibility of R packages, unfortunately introduces significant resource and logistical constraints for most analytic workflows, and does not lend itself well to use by novice-level R users. This presentation describes one response to these challenges: a workflow for developing and distributing in-house R packages, which employs an explicit versioning system to ensure reproducible outputs, even as package code is updated over time.