Power of data. Simplicity of design. Speed of innovation.
Apache® Spark™ is an open-source cluster computing framework with in-memory processing to speed analytic applications up to 100 times faster compared to technologies on the market today. Developed in the AMPLab at UC Berkeley, Apache Spark can help reduce data interaction complexity, increase processing speed and enhance mission-critical applications with deep intelligence.
Highly versatile in many environments, Apache Spark is known for its ease of use in creating algorithms that harness insight from complex data. Spark was elevated to a top-level Apache Project in 2014 and continues to expand today.
IBM is committing to the Apache Spark project, with investments in design-led innovation and broad-scale education programs to promote open source innovation and accelerate intelligence into every application.
As part of its commitment to Apache Spark, IBM will:
- Open source its breakthrough IBM SystemML machine learning technology and collaborate with Databricks to advance machine learning at the core of the Apache Spark project
- Offer IBM Analytics for Apache Spark™ on IBM Bluemix
- Open a Spark Technology Center in San Francisco for the Data Science and Developer Community
- Educate one million data scientists and data engineers on Apache Spark through extensive partnerships with AMPLab, DataCamp, MetiStream, Galvanize, and Big Data University MOOC