Abstract #301517


The views expressed here are those of the individual authors
and not necessarily those of the ASA or its board, officers, or staff.


Back to main JSM 2002 Program page



JSM 2002 Abstract #301517
Activity Number: 200
Type: Topic Contributed
Date/Time: Tuesday, August 13, 2002 : 10:30 AM to 12:20 PM
Sponsor: Section on Survey Research Methods*
Abstract - #301517
Title: Methods for Record Linkage and Bayesian Networks
Author(s): William Winkler*+
Affiliation(s): U.S. Census Bureau
Address: 4600 Silver Hill Road, Washington, District of Columbia, 20233-9100, U.S.A.
Keywords: likelihood ratio ; Bayesian Nets ; EM Algorithm ; datamining
Abstract:

Although terminology differs, there is considerable overlap between record linkage methods based on the Fellegi-Sunter model (JASA 1969) and Bayesian networks used in machine learning (Mitchell 1997). Both are based on formal probabilistic model that can be shown to be equivalent in many situations (Winkler 2000). When no missing data are present in identifying fields and training data are available, then both can efficiently estimate parameters of interest. When missing data are present, the EM algorithm can be used for parameter estimation in Bayesian Networks when there are training data (Friedman 1997) and in record linkage when there are no training data (unsupervised learning). EM and MCMC methods can be used for automatically estimating error rates in some of the record linkage situations (Belin and Rubin 1995, Larsen and Rubin 2001). Automatic error-rate estimation has generally not been addressed in the computer science literature. If there are interactions between variables, then parameters can be estimated. For Bayesian networks, efficient automatic methods exist for determining the most important interactions between variables exist (e.g., Friedman 1997, 1999).


  • The address information is for the authors that have a + after their name.
  • Authors who are presenting talks have a * after their name.

Back to the full JSM 2002 program

JSM 2002

For information, contact meetings@amstat.org or phone (703) 684-1221.

If you have questions about the Continuing Education program, please contact the Education Department.

Revised March 2002