JSM Preliminary Online Program
This is the preliminary program for the 2009 Joint Statistical Meetings in Washington, DC.

The views expressed here are those of the individual authors
and not necessarily those of the ASA or its board, officers, or staff.


Back to main JSM 2009 Program page




Activity Number: 13
Type: Topic Contributed
Date/Time: Sunday, August 2, 2009 : 2:00 PM to 3:50 PM
Sponsor: Biometrics Section
Abstract - #304454
Title: Estimating Counts for Queries Without Accessing a Database
Author(s): Kyongryun Lee*+
Companies: Iowa State University
Address: , Ames, IA, 50011,
Keywords: Bayesian Networks ; Data clustering ; Expectation-Maximization ; Mixture of Bayesian Networks
Abstract:

We address the problem of estimating count queries on databases quickly, without accessing the database at query time. We accomplish that we build a model of the domain from the database in a preprocessing phase and use this to answer count queries. The model we use is the Mixture of Bayesian Networks (MBNs) , which effectively encodes the joint probability distribution of the domain. An MBN is a weighted model with Bayesian networks as components. We describe how to learn an MBN model from a database using an instance of the modified Expectation-Maximization (EM) algorithm, called the EAM algorithm, and evaluate its accuracy on real and artificial data sets. Experimental results show that MBNs can represent a data set satisfactorily and can approximate counts with a high degree of accuracy, without accessing the database. We illustrate our method by applying it on a biological database.


  • The address information is for the authors that have a + after their name.
  • Authors who are presenting talks have a * after their name.

Back to the full JSM 2009 program


JSM 2009 For information, contact jsm@amstat.org or phone (888) 231-3473. If you have questions about the Continuing Education program, please contact the Education Department.
Revised September, 2008