JSM 2005 - Toronto

Abstract #304273

This is the preliminary program for the 2005 Joint Statistical Meetings in Minneapolis, Minnesota. Currently included in this program is the "technical" program, schedule of invited, topic contributed, regular contributed and poster sessions; Continuing Education courses (August 7-10, 2005); and Committee and Business Meetings. This on-line program will be updated frequently to reflect the most current revisions.

To View the Program:
You may choose to view all activities of the program or just parts of it at any one time. All activities are arranged by date and time.



The views expressed here are those of the individual authors
and not necessarily those of the ASA or its board, officers, or staff.


The Program has labeled the meeting rooms with "letters" preceding the name of the room, designating in which facility the room is located:

Minneapolis Convention Center = “MCC” Hilton Minneapolis Hotel = “H” Hyatt Regency Minneapolis = “HY”

Back to main JSM 2005 Program page



Legend: = Applied Session, = Theme Session, = Presenter
Activity Number: 94
Type: Topic Contributed
Date/Time: Monday, August 8, 2005 : 8:30 AM to 10:20 AM
Sponsor: Section on Nonparametric Statistics
Abstract - #304273
Title: Crossvalidated and Bagged Partitioning Estimators with Variable Importance
Author(s): Annette Molinaro*+
Companies: NCI/Yale University
Address: 6120 Executive Blvd MSC 7244, Rockville, MD, 20852, United States
Keywords: variable importance ; piecewise constant estimation ; prediction ; high-dimensional data ; genomics ; bagging
Abstract:

Clinicians aim toward a more preventive model of attacking cancer by pinpointing and targeting specific early events in disease development. These early events can be measured as genomic, proteomic, epidemiologic, and clinical variables. Such measurements are then used to predict clinical outcomes such as primary occurrence or mortality. Recursive partitioning algorithms seek to explain the individual contributions of various covariates as well as their interactions for the purposes of predicting outcomes. There are several important considerations when using such algorithms. The first is to not overfit the data; the second is the stability of the resulting predictor. For example, CART is sensitive to data fluctuations and, given a perturbation, will potentially build a different predictor than that built on the original data. A third consideration is variable importance. We present crossvalidated and bagged estimators using both CART and DSA-Part algorithms for a proteomic dataset. We explore the benefits of the suggested bagging scheme and introduce a data adaptive ranking for assessing each covariate's contribution.


  • The address information is for the authors that have a + after their name.
  • Authors who are presenting talks have a * after their name.

Back to the full JSM 2005 program

JSM 2005 For information, contact jsm@amstat.org or phone (888) 231-3473. If you have questions about the Continuing Education program, please contact the Education Department.
Revised March 2005