JSM 2012 Home

JSM 2012 Online Program

The views expressed here are those of the individual authors and not necessarily those of the JSM sponsors, their officers, or their staff.

Online Program Home

Abstract Details

Activity Number: 672
Type: Contributed
Date/Time: Thursday, August 2, 2012 : 10:30 AM to 12:20 PM
Sponsor: Section on Statistical Learning and Data Mining
Abstract - #304720
Title: Creating an Automated Industry and Occupation-Coding Process for the American Community Survey
Author(s): Michael Kornbau*+ and Julie Vesely and Matthew Thompson
Companies: U.S. Census Bureau and U.S. Census Bureau and U.S. Census Bureau
Address: 4700 Silver Hill Road, Suitland, MD, 20746,
Keywords: Logistic regression ; American Community Survey ; automated coding ; industry coding ; occupation coding
Abstract:

Every year the American Community Survey (ACS) collects data on millions of individuals. In particular, data is collected on the industry and occupation in which individuals work. This data, however, is collected in the form of write-ins. In order to produce estimates using this data, the industry and occupation write-ins must be assigned 4-digit codes indicating a specific industry or occupation. The coding of industry and occupation for the ACS is a massive operation. Every year over 2 million industry and occupation write-ins are assigned census codes, and this number continues to grow. Each of these cases is reviewed by a clerk and assigned a code. To reduce costs, a process was developed to assign industry and occupation codes using the write-in fields and a logistic regression model was created to determine the best code.


The address information is for the authors that have a + after their name.
Authors who are presenting talks have a * after their name.

Back to the full JSM 2012 program




2012 JSM Online Program Home

For information, contact jsm@amstat.org or phone (888) 231-3473.

If you have questions about the Continuing Education program, please contact the Education Department.