JSM 2004 - Toronto

Abstract #300062

This is the preliminary program for the 2004 Joint Statistical Meetings in Toronto, Canada. Currently included in this program is the "technical" program, schedule of invited, topic contributed, regular contributed and poster sessions; Continuing Education courses (August 7-10, 2004); and Committee and Business Meetings. This on-line program will be updated frequently to reflect the most current revisions.

To View the Program:
You may choose to view all activities of the program or just parts of it at any one time. All activities are arranged by date and time.

The views expressed here are those of the individual authors
and not necessarily those of the ASA or its board, officers, or staff.


Back to main JSM 2004 Program page



Activity Number: 96
Type: Invited
Date/Time: Monday, August 9, 2004 : 10:30 AM to 12:20 PM
Sponsor: SSC
Abstract - #300062
Title: Averaging Methods for High-throughput Screening Data
Author(s): Yuanyuan (Marcia) Wang*+
Companies: University of Waterloo
Address: 200 Elm St., Toronto, ON, M5T 1K4, Canada
Keywords: HTS ; averaging methods ; KNN ; CART ; bias and variance trade-off
Abstract:

In drug discovery, high-throughput screening (HTS) is used to assay large numbers of compounds against a biological target. A research pharmaceutical company might have of the order 2 million compounds available, and the human genome project is generating many new targets. Hence, there is a need for a more efficient strategy: smart or virtual screening. In smart screening, a representative sample (experimental design) of compounds is selected from a collection and assayed against a target. A model is built relating activity to explanatory variables describing compound structure. Our previous work shows that local methods, like K-nearest neighbors (KNN) and classification and regression trees (CART), perform very well. I will propose a method of averaging across multiple classifiers based on building classifiers on subspaces (subsets of variables). It improves the performance of KNN and CART for HTS data. Some interpretation of the method has also been considered.


  • The address information is for the authors that have a + after their name.
  • Authors who are presenting talks have a * after their name.

Back to the full JSM 2004 program

JSM 2004 For information, contact jsm@amstat.org or phone (888) 231-3473. If you have questions about the Continuing Education program, please contact the Education Department.
Revised March 2004