Abstract:
|
With the advent of large-scale and high-throughput data collection coupled with the creation and implementation of complex statistical algorithms for data analysis, the reproducibility of modern data analyses, meaning the ability of independent analysts to recreate the results claimed by the original authors using the original data and analysis techniques, has become an important topic of discussion. In this talk, I will discuss the origins of reproducible research, characterize the current status of reproducibility research, including in the life sciences and in public health. Finally, I will describe some best practices and efforts towards a path forward for improving both the reproducibility and replicability in statistics and data science in the future.
|