Conference Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 13 - A Multi-Disciplinary View of Reproducibility
Type: Invited
Date/Time: Sunday, August 7, 2022 : 2:00 PM to 3:50 PM
Sponsor: American Association for the Advancement of Science
Abstract #320392
Title: Containerization for Interactive and Reproducible Analysis
Author(s): Gregory J Hunt* and Johann A Gagnon-Bartsch
Companies: William & Mary and University of Michigan
Keywords: containerization; reproducibility; sharing; docker
Abstract:

Modern data analysis is typically quite computational. Correspondingly, sharing scientific and statistical work now often means sharing code and data in addition writing papers and giving talks. This type of code sharing faces several challenges. For example, it is often difficult to take code from one computer and run it on another due to software configuration, version, and dependency issues. Even if the code runs, writing code that is easy to understand or interact with can be difficult. This makes it difficult to assess third-party code and its findings, for example, in a review process. In this talk we describe a combination of two computing technologies that help make analyses shareable, interactive, and completely reproducible. These technologies are (1) analysis containerization, which leverages virtualization to fully encapsulate analysis, data, code and dependencies into an interactive and shareable format, and (2) code notebooks, a literate programming format for interacting with analyses. This talks reviews both the problems at the high-level and also provides concrete solutions to the challenges faced.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2022 program