JSM Preliminary Online Program
This is the preliminary program for the 2006 Joint Statistical Meetings in Seattle, Washington.

The views expressed here are those of the individual authors
and not necessarily those of the ASA or its board, officers, or staff.


Back to main JSM 2006 Program page




Activity Number: 533
Type: Topic Contributed
Date/Time: Thursday, August 10, 2006 : 10:30 AM to 12:20 PM
Sponsor: Section on Survey Research Methods
Abstract - #306251
Title: Automatically Estimating Record Linkage False Match Rates
Author(s): William E. Winkler*+ and William E. Yancey
Companies: U.S. Census Bureau and U.S. Census Bureau
Address: Statistical Research Division, Washington, DC, 20233-9100,
Keywords: EM algorithm ; unsupervised learning ; semi-supervised learning
Abstract:

This paper provides a mechanism for automatically estimating record linkage false match rates in situations where the subset of the true matches is reasonably well-separated from other pairs. The method provides an alternative to the method of Belin and Rubin (1995) and is applicable in more situations. We provide examples demonstrating why the general problem of error rate estimation (both false match and false nonmatch rates) is likely impossible in situations without training data and exceptionally difficult in the extremely rare situations where training data are available.


  • The address information is for the authors that have a + after their name.
  • Authors who are presenting talks have a * after their name.

Back to the full JSM 2006 program

JSM 2006 For information, contact jsm@amstat.org or phone (888) 231-3473. If you have questions about the Continuing Education program, please contact the Education Department.
Revised April, 2006