|
Activity Number:
|
304
|
|
Type:
|
Topic Contributed
|
|
Date/Time:
|
Tuesday, August 5, 2008 : 2:00 PM to 3:50 PM
|
|
Sponsor:
|
Section on Statistical Computing
|
| Abstract - #300863 |
|
Title:
|
BigmemoRy: Handling Massive Data in R
|
|
Author(s):
|
Michael Kane*+ and John W. Emerson
|
|
Companies:
|
Yale University and Yale University
|
|
Address:
|
25 Avalon Drive Apt 2424, Milford, CT, 06460,
|
|
Keywords:
|
R ; data ; memory ; C
|
|
Abstract:
|
Multi-gigabyte data sets challenge and frustrate R users even on well-equipped hardware. C programming can provide memory efficiency and speed improvements, but is cumbersome for interactive data analysis and lacks the flexibility and power of R's rich statistical programming environment. The new package, bigmemoRy, bridges this gap, implementing persistent massive objects in memory (managed in R but implemented in C) and supporting the manipulation and exploration of these objects. This talk will show bigmemoRy in action with data from the Netflix Prize competition. This talk should be followed by the talk by John Emerson.
|
- The address information is for the authors that have a + after their name.
- Authors who are presenting talks have a * after their name.
Back to the full JSM 2008 program |