Abstract:
|
Today we are awash in data, which is being collected at unprecedented scale and speed in a broad range of scientific and business applications. Big Data holds great promises for advances in science and business, but also introduces unique computational and statistical challenges. The focus of this roundtable is on ways statisticians and biostatisticians can better embrace those challenges and make real impact in Big Data applications. A collection of topics related to Big Data computing and statistics are to be discussed, including data collection (smart devices, the Internet), data storage and processing (Hadoop, Spark), parallelization (MapReduce, doParallel), and analytics (statistical machine learning, efficient computational algorithms).
|