Kafka

Kafka

Real time data management


What is Kafka?.

Apache Kafka is a distributed streaming platform capable of handling trillions of events a day. Initially conceived as a messaging queue, Kafka is based on an abstraction of a distributed commit log. As a streaming platform, Apache Kafka provides low-latency, high-throughput, fault-tolerant publish and subscribe pipelines and is able to process streams of events. Kafka provides reliable, millisecond responses to support both customer-facing applications and connecting downstream systems with real-time data.


Maintained by License Type Popular Examples Support Updates Developer Skills
Apache Kafka developer teams Apache License 2.0 kafka.apache.org Full Stack, Python or Scala
Often Compared to Testing Accessibility Maintained by Repository
Spark, rabbitmq Jmeter Apache Kafka developer teams github.com/apache/kafka



  Pros:
  • Batch Handling Capability (ETL like Functionality).
  • Message Broker Capabilities.
  • Low Latency.
  • Scalability
  Cons:
  • No Complete Set of Monitoring Tools.
  • Issues with Message Tweaking.
  • Behaves Clumsy.