JSM 2016 Online Program

Online Program Home

My Program

Abstract Details

Activity Number:	127
Type:	Contributed
Date/Time:	Monday, August 1, 2016 : 8:30 AM to 10:20 AM
Sponsor:	Section on Statistical Computing
Abstract #321492	View Presentation
Title:	Broom: An R Package for Converting Statistical Modeling Objects Into Tidy Data Frames
Author(s):	David G. Robinson*
Companies:	Stack Overflow
Keywords:	R ; computing ; tidy ; software ; modeling
Abstract:	The concept of "tidy data" offers a powerful and intuitive framework for structuring data to ease manipulation, modeling and visualization, and has guided the development of R tools such as ggplot2, dplyr, and tidyr. However, most functions for statistical modeling, both built-in and in third-party packages, produce output that is not tidy, and that is therefore difficult to reshape, recombine, and otherwise manipulate. I introduce the R package "broom," which turns the output of model objects into tidy data frames that are suited to further analysis and visualization with input-tidy tools. The package defines the tidy, augment, and glance methods, which arrange a model into three levels of tidy output respectively: the component level, the observation level, and the model level. These three levels can be used to describe many kinds of statistical models, and offer a framework for combining and reshaping analyses using standardized methods. Along with the R implementations in the broom package, this offers a grammar for describing the output of statistical models that can be applied across many statistical programming environments, including databases and distributed applications.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2016 program

Copyright © American Statistical Association

Privacy Policy | Conduct Policy | Previous JSMs