Abstract Details
Activity Number:
|
634
|
Type:
|
Topic Contributed
|
Date/Time:
|
Thursday, August 7, 2014 : 10:30 AM to 12:20 PM
|
Sponsor:
|
Government Statistics Section
|
Abstract #312067
|
|
Title:
|
Automated Coding of Worker Injury Narratives
|
Author(s):
|
Alexander Measure*+
|
Companies:
|
|
Keywords:
|
machine learning ;
statistical learning ;
natural language processing ;
text classification ;
logistic regression ;
support vector machines
|
Abstract:
|
Much of the information about work related injuries and illnesses in the U.S. is recorded only as short text narratives on Occupational Safety and Health Administration (OSHA) logs and Worker's Compensation records. Analysis of these data has the potential to answer many important questions about workplace safety, but typically requires that the individual cases be "coded" first to indicate their specific characteristics. Unfortunately the process of assigning these codes is often manual, time consuming, and prone to human error.
This paper compares manual and automated approaches to assigning detailed occupation, nature of injury, part of body, event resulting injury, and source of injury codes to narratives collected through the Survey of Occupational Injuries and Illnesses, an annual survey of U.S. establishments that collects OSHA logs describing approximately 300,000 work related injuries and illnesses each year. We demonstrate that machine learning coders based on the logistic regression and support vector machine algorithms outperform those based on naïve Bayes, and achieve coding accuracies comparable to or better than trained human coders.
|
Authors who are presenting talks have a * after their name.
Back to the full JSM 2014 program
|
2014 JSM Online Program Home
For information, contact jsm@amstat.org or phone (888) 231-3473.
If you have questions about the Professional Development program, please contact the Education Department.
The views expressed here are those of the individual authors and not necessarily those of the JSM sponsors, their officers, or their staff.
Copyright © American Statistical Association.