JSM 2011 Online Program

The views expressed here are those of the individual authors and not necessarily those of the JSM sponsors, their officers, or their staff.

Abstract Details

Activity Number: 667
Type: Contributed
Date/Time: Thursday, August 4, 2011 : 10:30 AM to 12:20 PM
Sponsor: Biopharmaceutical Section
Abstract - #301059
Title: Variable Selection in Large P, Small N Problems with Applications to Tailored Therapeutics
Author(s): Wei-Yin Loh*+
Companies: University of Wisconsin
Address: Department of Statistics, Madison, WI, 53706, United States
Keywords: classification and regression trees ; nonparametric regression ; machine learning ; random forest
Abstract:

Classification and regression problems in which the number of predictor variables, p, is larger than the number of observations, n, are increasingly common due to rapid technological advances in data collection. Because traditional solutions usually require p to be less than n, new approaches to solving these problems are needed. Two methods that have been proposed are Random forest (Breiman, Machine Learning 2001) and EARTH (Doksum, Tang and Tsui, JASA 2008). This talk presents a method based on the GUIDE classification and regression tree algorithm (Loh, Statist. Sinica 2002; Ann. Appl. Statist. 2009). Its performance against Random forest and EARTH is evaluated with simulated and real data.


The address information is for the authors that have a + after their name.
Authors who are presenting talks have a * after their name.

Back to the full JSM 2011 program




2011 JSM Online Program Home

For information, contact jsm@amstat.org or phone (888) 231-3473.

If you have questions about the Continuing Education program, please contact the Education Department.