JSM 2005 - Toronto

Abstract #302778

This is the preliminary program for the 2005 Joint Statistical Meetings in Minneapolis, Minnesota. Currently included in this program is the "technical" program, schedule of invited, topic contributed, regular contributed and poster sessions; Continuing Education courses (August 7-10, 2005); and Committee and Business Meetings. This on-line program will be updated frequently to reflect the most current revisions.

To View the Program:
You may choose to view all activities of the program or just parts of it at any one time. All activities are arranged by date and time.



The views expressed here are those of the individual authors
and not necessarily those of the ASA or its board, officers, or staff.


The Program has labeled the meeting rooms with "letters" preceding the name of the room, designating in which facility the room is located:

Minneapolis Convention Center = “MCC” Hilton Minneapolis Hotel = “H” Hyatt Regency Minneapolis = “HY”

Back to main JSM 2005 Program page



Legend: = Applied Session, = Theme Session, = Presenter
Activity Number: 377
Type: Invited
Date/Time: Wednesday, August 10, 2005 : 10:30 AM to 12:20 PM
Sponsor: SSC
Abstract - #302778
Title: LAGO: A Computationally Efficient Approach for Statistical Detection
Author(s): Mu Zhu*+
Companies: University of Waterloo
Address: , Waterloo, ON, N2L 3G1, Canada
Keywords: Average precision ; Drug discovery ; Kernel density estimator ; Nearest neighbor ; Radial basis function (RBF) network ; Support vector machine (SVM)
Abstract:

In drug discovery, chemists are faced with the problem of having to screen a large pool of candidates and identify a small number of chemical compounds worthy of further investigation. The drug discovery problem is an example of a more general class of statistical detection problems where the underlying objective is to detect items belonging to a rare class from a large database. We propose a computationally efficient method to achieve this goal. Our method consists of two steps. First, we estimate the density function of the rare class alone with an adaptive bandwidth kernel density estimator. The adaptive choice of the bandwidth is inspired by the ancient Chinese board game known as "Go." We then adjust this density locally depending on the density of the background class nearby. We show the amount of the adjustments needed are approximately equal to the adaptive bandwidths of our choice, which gives additional computational savings. We name the resulting method LAGO for "locally adjusted Go-kernel" density estimator and apply LAGO to a real drug discovery dataset, comparing its performance with a number of existing methods.


  • The address information is for the authors that have a + after their name.
  • Authors who are presenting talks have a * after their name.

Back to the full JSM 2005 program

JSM 2005 For information, contact jsm@amstat.org or phone (888) 231-3473. If you have questions about the Continuing Education program, please contact the Education Department.
Revised March 2005