Abstract:
|
Modern data analysis (aka "data science") builds upon foundational techniques from statistics, computer science and a host of other research fields and applications domains. While only the future will answer whether data science is an emerging new discipline, or simply a rebranding and repackaging of existing techniques, it is clear that the sheer scale and heterogeneity of newly available data is changing the character of data analytsis across many disciplines and industries. In this talk, I will describe, from a computer science perspective, a few of the emerging research issues. I will compare and contrast these with statistical questions, highlighting several areas where there are opportunities for fruitful collaborations. Scale and heterogeneity pose important challenges, and I will also discuss some potential pitfalls that both communities should work together to avoid.
|