JSM 2013 Home
Online Program Home
My Program

Abstract Details

Activity Number: 23
Type: Topic Contributed
Date/Time: Sunday, August 4, 2013 : 2:00 PM to 3:50 PM
Sponsor: Survey Research Methods Section
Abstract - #310420
Title: Parameter Estimation for Record Linkage
Author(s): Joshua Tokle*+
Companies: U.S. Census Bureau
Keywords: record linkage ; EM ; optimality
Abstract:

For each pair of records from two files, the Fellegi-Sunter model for record linkage provides a matching weight or score that is an aggregate of similarity scores for the fields being compared. The matching weight is the likelihood conditional on (typically unobserved) match status. In this paper, I focus on accurate estimates of these conditional probabilities. The probabilities themselves can vary greatly by data source and are heavily dependent on typographical error in the quasi-identifying fields such as name, address and date-of-birth. In the Decennial Census, computing these probabilities automatically without training data in more than 400 contiguous regions yields increased accuracy of matching and reduces clerical review by as much as 2/3. In this talk I will discuss the probability models that have been applied in this setting. In particular, I will discuss EM fitting with little or no training data and possibly alternative models that may not be as 'optimal' but can still work well.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2013 program




2013 JSM Online Program Home

For information, contact jsm@amstat.org or phone (888) 231-3473.

If you have questions about the Continuing Education program, please contact the Education Department.

The views expressed here are those of the individual authors and not necessarily those of the JSM sponsors, their officers, or their staff.

ASA Meetings Department  •  732 North Washington Street, Alexandria, VA 22314  •  (703) 684-1221  •  meetings@amstat.org
Copyright © American Statistical Association.