SupR: Multithreaded and Distributed R for Big Data Analysis
At a high level, SupR is a R-style implementation of a computing system for
Distributed Interactive Statistical Computing (DISC).
As an initial effort, SupR is currently built from
R (version 3.1.1) system source code.
The current focus has been on the implementation of
a R-style front-end by maintaining the existing R syntax and internal
basic data structures,
a Java-like multithreading model,
which would be the key to the success of big data analysis,
a Spark-like cluster computing environment, and
a builtin Simple Distributed File System, which, to some extent, represents a kind of cluster-wide namespace.