Spark

Spark

Open source parallel processing framework


What is Spark?.

Apache Spark is an open source parallel processing framework for running large-scale data analytics applications across clustered computers. It can handle both batch and real-time analytics and data processing workloads.


Maintained by License Type Popular Examples Support Updates Developer Skills
Spoddutur Apache License 2.0 spark.apache.org
Often Compared to Testing Accessibility Maintained by Repository
Hadoop , Kafka Scala Spoddutur github.com/apache/spark



  Pros:
  • High data processing speed.
  • In-Memory Computation.
  Cons:
  • No Support for Real-time Processing.
  • No File Management System.