Logo
Sign in
Product Logo
Apache Spark on IBM PowerIBM

Apache Spark is an in-memory distributed compute engine that speeds analysis on large-scale data up to 100X faster than current technologies.

Vendor

Vendor

IBM

Company Website

Company Website

Product details

Apache Spark is an open-source cluster computing framework optimized for extremely fast and large scale data processing.

Developed in the AMPLab at UC Berkeley, Apache Spark can help reduce data interaction complexity, increase processing speed and enhance mission-critical applications with deep intelligence.

Benefits

  • **Innovate faster: **Apache Spark delivers 100x the performance of Apache Hadoop for certain workloads because of its advanced in-memory computing engine.
  • **Accelerate app development: **Apache Spark's Streaming and SQL programming models backed by MLlib and GraphX make it easier to build apps that exploit machine learning and graph analytics.
  • **Optimize with open technologies: **The OpenPOWER Foundation enables GPU, CAPI Flash, RDMA, FPGA acceleration and machine learning innovation optimizing performance for Apache Spark workloads.