Local Cluster Microsoft HPC Microsoft Azure Burst LSF
30 / 35
Distributed Environments
Data Partition
BIG DATA
Data Partition
... Data Partition
Compute Node
Compute Node
Master Node
... Compute Node
31 / 35
RevoScaleR ComputeContext
One Line of Code for all supported architectures Defines Hardware Handles Distribution, Monitoring, and Failover via native job scheduler
32 / 35
Performance GLM ‘Gamma’ Simulation Timings
Independent Variables: 2 factors (100 and 20 levels) and one continuous
Computation Time (seconds)
80 70 60 50
Revolution R Enterprise / Parallel performance scales linearly with data size
40 30 20 10 .5
1.0
1.5
2.0
2.5
3.0
3.5
4.0
4.5
5.0
Data Size (millions of rows)
Timings from a Windows 7, 64-bit quadcore laptop with 8 GB RAM Open Source Revolution R Enterprise 33 / 35
Summary RevoScaleR provides Fast and efficient ways to process Big Data: Import Explore Manipulate Visualize Analyze
34 / 35
Thank you Revolution Analytics is the leading commercial provider of software and support for the popular open source R statistics language. www.revolutionanalytics.com, 1.855.GET.REVO, Twitter: @RevolutionR