R programming on big data

Use familiar tools and run R workloads at unprecedented scale

Increase productivity of R users

  • Use familiar tools and libraries from the R ecosystem
  • Benefit from seamless integration with RStudio Server
  • Quickly parallelize R jobs with SparkR and sparklyr

Simplify access to large data sets

  • Quickly tap into datasets from various sources
  • Easily explore data with familiar tools and interfaces
  • Clean and enrich data faster with optimized format

Enable R workloads at scale

  • Get results 10-100x faster than Apache Spark™
  • Auto-scale resources for the most demanding jobs
  • Lower costs with efficient clusters management