JSM 2014 Home
Online Program Home
My Program

Abstract Details

Activity Number: 180
Type: Contributed
Date/Time: Monday, August 4, 2014 : 10:30 AM to 12:20 PM
Sponsor: Section on Statistical Learning and Data Mining
Abstract #312868
Title: Sanitation Inspector Allocation in San Francisco Using Yelp Reviews
Author(s): Guillaume Pouliot*+ and Michael Luca
Companies: and Harvard Business School
Keywords: latent Dirichlet allocation ; Bayesian Support Vector Machines ; Variational Inference ; Topic Models ; Supervised Learning
Abstract:

The city of San Francisco has a limited number of sanitation inspectors. It would be more efficient to send inspectors to restaurants that are more likely to get caught for sanitation norm violations. Our data set consists of a comprehensive list of San Francisco restaurants, their general characteristics and Yelp reviews (obtained from Yelp), as well as their health inspection records. The first-order task is to predict who will commit what infraction in the future. Many methods are compared. In particular, we develop a latent Dirichlet allocation (LDA) topic model supervised with multicategory Bayesian support vector machines. We compare performance of inference via Gibbs sampling and variational inference. We find that this method improves on LDA supervised with a generalized linear model.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2014 program




2014 JSM Online Program Home

For information, contact jsm@amstat.org or phone (888) 231-3473.

If you have questions about the Professional Development program, please contact the Education Department.

The views expressed here are those of the individual authors and not necessarily those of the JSM sponsors, their officers, or their staff.

ASA Meetings Department  •  732 North Washington Street, Alexandria, VA 22314  •  (703) 684-1221  •  meetings@amstat.org
Copyright © American Statistical Association.