The views expressed here are those of the individual authors and not necessarily those of the JSM sponsors, their officers, or their staff.
Abstract Details
Activity Number:
|
189
|
Type:
|
Contributed
|
Date/Time:
|
Monday, August 1, 2011 : 10:30 AM to 12:20 PM
|
Sponsor:
|
Section for Statistical Programmers and Analysts
|
Abstract - #301665 |
Title:
|
There Is Such a Thing as Too Much Data
|
Author(s):
|
Ying Su*+
|
Companies:
|
Merck Research Laboratories
|
Address:
|
, Upper Gwynedd, PA, 19454, United States
|
Keywords:
|
SAS ;
Pharmacoepidemiology ;
Nested Case-Control Study ;
Random Sampling
|
Abstract:
|
Pharmacoepidemiology studies increasingly use healthcare databases, which involve massive amounts of data. Sometimes conventional programming methods are unable to handle the data volume, or even overwhelm the computing resources. An example of selecting controls for a nested matched case-control study is used to illustrate this challenge. Different attempted approaches in SAS programming are discussed, and a solution is developed to overcome the limitation of computing space and speed. The solution presented also addresses the programming efficiency concerns, and ensures the statistical randomness in sampling. Lessons learned from this example will provide guidance for other studies with similar characteristics.
|
The address information is for the authors that have a + after their name.
Authors who are presenting talks have a * after their name.
Back to the full JSM 2011 program
|
2011 JSM Online Program Home
For information, contact jsm@amstat.org or phone (888) 231-3473.
If you have questions about the Continuing Education program, please contact the Education Department.