Abstract:
|
Dedicated training was designed and offered annually to introduce diverse cohorts of students and early-career scientists to first principles and concepts from data analysis, while also working within interdisciplinary teams. Participants completed a pre-workshop online four-week Introduction to R course. The week-long workshop emphasized hands-on tutorials with techniques for data wrangling and visualization including data scraping, parsing, cleaning, and analysis while also fostering interdisciplinary team science. Diverse backgrounds and experience were prioritized during the selection of participants, along with disciplinary interests from the full spectrum of STEM disciplines and beyond. Teams were organized around real-world, data-driven research projects. Students from statistics, math, and computer science domains were matched with students from engineering, life sciences, and liberal arts. Multi-institutional interdisciplinary teams received funds for continuing collaborative research with the goal of co-publishing results. Outcomes demonstrate that participants gain tangible data science skills and knowledge. Further, the interdisciplinary team experiences result in successful long-term student collaborations across institutions and topic domains at the nexus of data science.
|