Online Program Home
My Program

Abstract Details

Activity Number: 212 - An Emerging Ecosystem for Data Science/Statistics Education
Type: Invited
Date/Time: Monday, July 30, 2018 : 2:00 PM to 3:50 PM
Sponsor: Section on Statistical Education
Abstract #326743 Presentation
Title: Version Control: The Gain You Get for Your Pain
Author(s): Jennifer Bryan*
Companies: RStudio, University of British Columbia
Keywords: version control; data science; R; R markdown; Git; reproducibility

Version control is a system for managing the evolution of a set of files across different people, computers, and time. Its roots are in software development, but it is increasingly important in both the practice and teaching of data science. I'll give an accessible description of what version control is and what it feels like to use it. We'll compare and contrast this to alternatives such as collaboration via Google Drive. Version control is important for educators for at least two reasons. First, it facilitates the exchange of code-rich documents between instructor and student. Second, it is a valid learning objective in and of itself, since version control is widely used by potential employers. I'll provide general information, as well as specifics relevant to the statistical programming environment R, the RStudio IDE, Git, and the GitHub hosting service.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2018 program