Online Program Home
My Program

Abstract Details

Activity Number: 213233
Type: Professional Development
Date/Time: Saturday, July 30, 2016 : 8:30 AM to 5:00 PM
Sponsor: ASA
Abstract #321860
Title: A Primer to Web Scraping with R (ADDED FEE)
Author(s): Simon Munzert*
Companies: University of Konstanz
Keywords:
Abstract:

The web is full of data that are of great interest to scientists and businesses alike. Firms, public institutions, and private users provide every imaginable type of information, and new channels of communication generate vast amounts of data on human behavior. But how to efficiently collect data from the Internet; retrieve information from social networks, search engines, and dynamic web pages; tap web services; and, finally, process the collected data with statistical software? We will learn about the basics of web data collection practice with R. The sessions are hands-on; we will practice every step of the process with R using various examples. We will learn how to scrape content from static and dynamic web pages, connect to APIs from popular web services such as Twitter to read out and process user data, and set up automatically working scraper programs. This course assumes prior experience using R. Please bring a laptop with the latest version of R and Rstudio installed. You'll be able to download accompanying slides, code, and data during the course.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2016 program

 
 
Copyright © American Statistical Association