JSM 2012 Home

JSM 2012 Online Program

The views expressed here are those of the individual authors and not necessarily those of the JSM sponsors, their officers, or their staff.

Online Program Home

Abstract Details

Activity Number: 267
Type: Invited
Date/Time: Tuesday, July 31, 2012 : 8:30 AM to 10:20 AM
Sponsor: Section on Statistical Computing
Abstract - #303887
Title: Integrating R with New Algorithms, Data Structures, and Data Sources
Author(s): Duncan Temple Lang*+
Companies: University of California at Davis
Address: Statistics Department, Davis, CA, 95618,
Keywords: R ; compiler ; large data ; extensible ; large data ; block algorithms
Abstract:

I'll talk about 2 approaches we are exploring to allow R to better handle large volumes of data faster and more flexibly. The first is an experiment in how to make an R-based interpreter extensible so that developers of packages can introduce new first-class data types/structures that are tightly integrated with the interpreter and existing core code. This allows us to introduce alternative implementations of common data structures where the data are structured in very different ways (e.g. streaming data) or even remotely (e.g. out of memory).

The second approach analyzes entire R "scripts" and attempts to determine the flow and lifetimes of data objects. It uses this information to try to improve the organization and reuse of memory. It also uses LLVM (Low Level Virtual Machine) to compile simple R functions into machine code that can exploit context-specific information. These can generate fast code that cumulate information by working on individual or sub-blocks of observations as opposed to all-data-in-memory algorithms.

The talk is intended to stimulate discussion of future systems and how we might "get there from here".


The address information is for the authors that have a + after their name.
Authors who are presenting talks have a * after their name.

Back to the full JSM 2012 program




2012 JSM Online Program Home

For information, contact jsm@amstat.org or phone (888) 231-3473.

If you have questions about the Continuing Education program, please contact the Education Department.