Open source parallel processing framework

What is Spark?.

Apache Spark is an open source parallel processing framework for running large-scale data analytics applications across clustered computers. It can handle both batch and real-time analytics and data processing workloads.

Maintained by License Type Popular Examples Support Updates Developer Skills
Spoddutur Apache License 2.0
Often Compared to Testing Accessibility Maintained by Repository
Hadoop , Kafka Scala Spoddutur

  • High data processing speed.
  • In-Memory Computation.
  • No Support for Real-time Processing.
  • No File Management System.