|
Activity Number:
|
198
|
|
Type:
|
Contributed
|
|
Date/Time:
|
Monday, August 7, 2006 : 2:00 PM to 3:50 PM
|
|
Sponsor:
|
Section on Statistical Graphics
|
| Abstract - #306759 |
|
Title:
|
Letter Value Box Plots: Box Plots for Large Datasets
|
|
Author(s):
|
Karen Kafadar*+ and Heike Hofmann and Hadley Wickham
|
|
Companies:
|
University of Colorado and Iowa State University and Iowa State University
|
|
Address:
|
Department of Mathematics, Denver, CO, 80217-3364,
|
|
Keywords:
|
boxplot ; distribution ; shape
|
|
Abstract:
|
Conventional boxplots are useful displays for conveying rough information about the data distributional shape. Tail information beyond the quartiles (whiskers) can be unreliable in small samples so whiskers show only the extent of the outer quartiles. Thus, boxplots present two shortcomings for large data sets: more "outliers" are labeled and information about tail behavior is sparse. Letter value box plots address both shortcomings: a fixed, reasonably small, number of outliers can be labeled, and more detailed estimates of tail behavior based on letter values beyond the quartiles can be shown. We describe their construction and illustrate their usefulness on real data sets.
|