|
Activity Number:
|
255
|
|
Type:
|
Topic Contributed
|
|
Date/Time:
|
Tuesday, August 4, 2009 : 8:30 AM to 10:20 AM
|
|
Sponsor:
|
Section on Government Statistics
|
| Abstract - #303238 |
|
Title:
|
Truthing Production Data Capture
|
|
Author(s):
|
Brad Paxton*+ and Steven P. Spiwak and Douglass Huang
|
|
Companies:
|
ADI, LLC and ADI, LLC and ADI, LLC
|
|
Address:
|
200 Canal View Blvd., Rochester, NY, 14623,
|
|
Keywords:
|
Forms Processing ; Data Capture ; Data Quality
|
|
Abstract:
|
In order to really know how a production forms data capture system is doing, it has been customary to have keyers sample captured data fields and do "double key and verify" operations to determine the correct answers ("truth") of production data. In the system we call Production Data Quality, which will be used in the 2010 Census, we use software automation and good statistical design to reduce the human effort involved by as much as 40 times and obtain high quality "truth." Once the "truth" is known, the production data may be scored using whatever correctness criteria are appropriate for the application, for example, some type of a "soft match."
|
- The address information is for the authors that have a + after their name.
- Authors who are presenting talks have a * after their name.
Back to the full JSM 2009 program |