JSM 2005 Online Program

Abstract #303418

This is the preliminary program for the 2005 Joint Statistical Meetings in Minneapolis, Minnesota. Currently included in this program is the "technical" program, schedule of invited, topic contributed, regular contributed and poster sessions; Continuing Education courses (August 7-10, 2005); and Committee and Business Meetings. This on-line program will be updated frequently to reflect the most current revisions.

To View the Program:
You may choose to view all activities of the program or just parts of it at any one time. All activities are arranged by date and time.

The views expressed here are those of the individual authors
and not necessarily those of the ASA or its board, officers, or staff.

The Program has labeled the meeting rooms with "letters" preceding the name of the room, designating in which facility the room is located:

Minneapolis Convention Center = “MCC” Hilton Minneapolis Hotel = “H” Hyatt Regency Minneapolis = “HY”

Back to main JSM 2005 Program page

Legend:

= Applied Session,

= Theme Session,

= Presenter

Activity Number:	34
Type:	Contributed
Date/Time:	Sunday, August 7, 2005 : 2:00 PM to 3:50 PM
Sponsor:	Biopharmaceutical Section
Abstract - #303418
Title:	Generating Forests of Tree-based Models by Permuting the Model-building Process
Author(s):	Bret Musser*+
Companies:	Merck Research Laboratories
Address:	126 E Lincoln Ave, Rahway, NJ, 07065, United States
Keywords:	recursive partitioning ; model building ; gene expression ; tree-based models
Abstract:	In any model-building process, relationships between predictor variables cause complications. In "feature selection" approaches such as subset regression and recursive partitioning (RP), correlated predictors may be mutually substitutable, may make additive contributions, or may show synergy. As with stepwise regression methods, these features are rarely discovered in recursive partitioning models as most RP methods generate a single tree as "the" answer. As in subset regression, there is a solution. This is to produce not a single tree, but a "forest" of models that fit the data. Harvesting knowledge from the forest is much harder than generating it, however. Not only may visually different trees be identical in the sense of generating identical rules, but different rules may lead to identical sets of terminal nodes given correlated predictors. The first case reflects purely topological issues; the second the possibility of alternative models. The methods of this talk focus on permuting the model-building process without changing the input data. The methods are applied to understanding the relationship between DNA microarray data and outcome of breast cancer.

The address information is for the authors that have a + after their name.
Authors who are presenting talks have a * after their name.

Back to the full JSM 2005 program