SupR: Multithreaded and Distributed R for Big Data Analysis

At a high level, SupR is a R-style implementation of a computing system for Distributed Interactive Statistical Computing (DISC). As an initial effort, SupR is currently built from R, Version 3.1.1. The current focus has been on the implementation of
  1. a R-style front-end by maintaining the existing R syntax and internal basic data structures,
  2. a Java-like multithreading model, which would be the key to the success of big data analysis,
  3. a Spark-like cluster computing environment, and
  4. a builtin Simple Distributed File System, which, to some extent, represents a kind of cluster-wide namespace.


Recent Talks

Fundamental Work (deprecated/under construction)

Future Directions

Useful Links