JSM 2011 Online Program

The views expressed here are those of the individual authors and not necessarily those of the JSM sponsors, their officers, or their staff.

Abstract Details

Activity Number: 189
Type: Contributed
Date/Time: Monday, August 1, 2011 : 10:30 AM to 12:20 PM
Sponsor: Section for Statistical Programmers and Analysts
Abstract - #301665
Title: There Is Such a Thing as Too Much Data
Author(s): Ying Su*+
Companies: Merck Research Laboratories
Address: , Upper Gwynedd, PA, 19454, United States
Keywords: SAS ; Pharmacoepidemiology ; Nested Case-Control Study ; Random Sampling
Abstract:

Pharmacoepidemiology studies increasingly use healthcare databases, which involve massive amounts of data. Sometimes conventional programming methods are unable to handle the data volume, or even overwhelm the computing resources. An example of selecting controls for a nested matched case-control study is used to illustrate this challenge. Different attempted approaches in SAS programming are discussed, and a solution is developed to overcome the limitation of computing space and speed. The solution presented also addresses the programming efficiency concerns, and ensures the statistical randomness in sampling. Lessons learned from this example will provide guidance for other studies with similar characteristics.


The address information is for the authors that have a + after their name.
Authors who are presenting talks have a * after their name.

Back to the full JSM 2011 program




2011 JSM Online Program Home

For information, contact jsm@amstat.org or phone (888) 231-3473.

If you have questions about the Continuing Education program, please contact the Education Department.