Skip to content

conduktor/awesome-kafka

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 

Repository files navigation

Awesome Kafka Awesome

A curated list of awesome Apache Kafka resources, tools, libraries, and applications.

Last verified: December 31, 2025 · Legend: ⚠️ Inactive (no commits in 2+ years) · 📦 Archived

See also: awesome-kafka-connect and Kafka Security Controls to help with CWE, NIST 800-53, and PCI-DSS compliance.

Kafka

  • Apache Kafka - Distributed event streaming platform capable of handling trillions of events a day.
  • Kafka Documentation - Official documentation covering architecture, APIs, and operations.
  • Kafka Quickstart - Get up and running with Kafka in minutes.
  • KIPs - Kafka Improvement Proposals tracking upcoming features and changes.

Clients

Java

C/C++

  • librdkafka - High-performance C/C++ library, foundation for many language bindings.
  • cppkafka - Modern C++11 wrapper for librdkafka.

Python

  • confluent-kafka-python - High-performance client based on librdkafka with AsyncIO support.
  • aiokafka - Native asyncio client for Python async applications.
  • kafka-python - Pure Python client (not actively maintained).
  • Faust - Stream processing library, Python equivalent of Kafka Streams.

Go

  • franz-go - Feature-complete high-performance pure Go client.
  • kafka-go - Go-idiomatic Kafka library by Segment.
  • Sarama - Most popular Go client, pure Go implementation.
  • confluent-kafka-go - CGo wrapper around librdkafka with official Confluent support.

Rust

  • rust-rdkafka - Async futures-based client built on librdkafka.
  • kafka-rust - Pure Rust Kafka protocol implementation.

Node.js / TypeScript

.NET / C#

Ruby

  • rdkafka-ruby - Modern Ruby client based on librdkafka.
  • Karafka - Ruby and Rails framework for Kafka with Web UI.
  • ruby-kafka - Pure Ruby client (deprecated).

PHP

Scala

  • Alpakka Kafka - Reactive Streams connector for Akka Streams.
  • ZIO Kafka - ZIO-based Kafka client for functional Scala.
  • ⚠️ Greyhound - Wix's high-level Kafka SDK with ZIO, used in 1,500+ microservices.

Kotlin

  • kotlin-kafka - Kotlin Coroutines and Arrow integration for Kafka.

Elixir / Erlang

  • brod - Robust Erlang/Elixir client with consumer groups support.
  • Kaffe - Opinionated Elixir wrapper around brod.
  • KafkaEx - Pure Elixir Kafka client.

Swift

Clojure

  • Jackdaw - Comprehensive Kafka library for Clojure.
  • ⚠️ kafka.clj - Clojure wrapper with Kafka Streams support.

Haskell

HTTP / REST

Stream Processing

Kafka-Native

  • Kafka Streams - Client library for building stream processing applications.
  • ksqlDB - Event streaming database with SQL interface built on Kafka Streams.
  • Interactive Queries - Query state stores in Kafka Streams applications.
  • StreamT - Conduktor's dbt for streaming, transforming data in Kafka with SQL.

Frameworks

  • Apache Flink - Distributed stream processing framework with exactly-once semantics.
  • Apache Spark Streaming - Micro-batch stream processing on Spark.
  • Apache Samza - Stream processing framework with deep Kafka integration.
  • Apache Storm - Distributed real-time computation system.
  • Apache Beam - Unified batch and streaming API portable across runners.

Streaming Databases

  • RisingWave - PostgreSQL-compatible streaming database with materialized views.
  • Materialize - Streaming database with incremental view maintenance.
  • Apache Pinot - Real-time OLAP datastore with sub-second queries on Kafka topics.
  • Apache Druid - Distributed data store for real-time analytics with Kafka ingestion.
  • Timeplus Proton - Fast streaming SQL engine for real-time analytics.
  • DeltaStream - Serverless stream processing using Flink SQL (commercial).
  • StarRocks - MPP database with continuous Kafka routine load for lakehouse analytics.

Python Libraries

  • Faust - Stream processing library porting Kafka Streams to Python.
  • Quix Streams - Python library for high-volume time-series streaming.
  • Bytewax - Python stream processing with Rust performance.
  • Pathway - Python ETL framework with Rust engine for real-time processing.
  • kstreams - Lightweight Kafka Streams implementation for Python.

Other

  • Hazelcast - In-memory data grid with Jet stream processing engine.
  • Redpanda Connect - Declarative stream processor with 100+ connectors (formerly Benthos).

Kafka Connect

CDC (Change Data Capture)

  • Debezium - CDC platform for MySQL, PostgreSQL, MongoDB, SQL Server, Oracle, and more.

Databases

Cloud Storage

  • S3 Sink - Export data to Amazon S3 in Avro, JSON, or Parquet.

Data Warehouses

  • BigQuery Sink - Stream data to Google BigQuery with upsert support.
  • Snowflake Sink - Official Snowflake connector with Iceberg support.

Message Queues

IoT & Protocols

  • MQTT Connector - Community MQTT source and sink connector for brokers and devices.

File Transfer

HTTP & APIs

  • HTTP Sink - Send data to HTTP endpoints.
  • HTTP Source - Poll HTTP APIs as a Kafka source.
  • WebSocket Source - Stream messages from WebSocket endpoints into Kafka.
  • gRPC Source - Consume gRPC server-streaming endpoints directly into Kafka topics.

Observability

Tools & Utilities

Schema Registry

Implementations

Serialization

  • Apache Avro - Compact binary format with rich schema evolution support.
  • Protocol Buffers - Google's binary serialization with strong typing.
  • Buf - Modern Protobuf toolchain with linting and breaking change detection.

Management & Monitoring

Web UIs

  • Kafka UI - Free open-source UI for managing Kafka clusters, topics, and consumers.
  • Kafbat UI - Community fork of Kafka UI with active development.
  • Redpanda Console - Developer-friendly UI with time-travel debugging.
  • AKHQ - Kafka GUI for topics, consumer groups, Schema Registry, and Connect.
  • Kafdrop - Lightweight web UI for viewing Kafka topics and consumer groups.
  • ⚠️ CMAK - Cluster Manager for Apache Kafka by Yahoo.
  • Kouncil - Modern web interface with advanced message browsing.
  • Conduktor Console - Enterprise control plane for Kafka with access control (commercial).
  • Lenses - DataOps platform with SQL Studio and data policies (commercial).
  • kPow - Enterprise Kafka monitoring with RBAC and audit logging (commercial).
  • Confluent Control Center - Monitoring and management for Confluent Platform (commercial).

Metrics & Exporters

  • Burrow - LinkedIn's consumer lag checking and monitoring service.
  • Kafka Exporter - Prometheus exporter for Kafka broker and consumer group metrics.
  • JMX Exporter - Prometheus exporter for JMX metrics from Kafka brokers.
  • KMinion - Prometheus exporter for consumer lag and log directory sizes.
  • 📦 Kafka Lag Exporter - Consumer group latency exporter for Kubernetes.
  • Grafana Kafka Dashboards - Curated Grafana dashboards for Kafka broker, topic, and consumer metrics.

Cluster Management

  • Strimzi - Kubernetes operator for running Apache Kafka (CNCF Incubating).
  • Cruise Control - LinkedIn's automated workload rebalancer and self-healing.
  • Cruise Control UI - Web interface for Cruise Control operations.
  • MirrorMaker 2 - Built-in cross-cluster replication using Kafka Connect.
  • Jikkou - GitOps tool for managing Kafka resources as code.
  • Kafka-Kit - Datadog's tools for partition mapping, rebalancing, and auto-throttling.

Infrastructure as Code

Interactive Tools

CLI Tools

  • kcat - Swiss army knife for Kafka, formerly kafkacat.
  • kafkactl - Command-line tool inspired by kubectl with auto-completion.
  • kaf - Modern CLI for Kafka inspired by kubectl and docker.
  • Zoe - CLI for humans with time-based consumption and filtering.
  • ⚠️ kcli - Simple Kafka command line browser.
  • trubka - CLI tool for Kafka with Protobuf support.
  • topicctl - Tool for managing Kafka topics with YAML configs.
  • ⚠️ kafka-shell - Interactive shell for Apache Kafka.

Security

Authentication & Authorization

  • Strimzi OAuth - OAuth2/OIDC authentication for Kafka.
  • Apache Ranger - Fine-grained authorization and centralized policy management.
  • ⚠️ Kafka Security Manager - Git-based ACL management with auto-revert by Conduktor.
  • Julie - GitOps for Kafka RBAC and topologies.
  • ⚠️ kafka-gitops - Manage Kafka resources as code with Git.

Encryption

Governance

  • Apache Atlas - Metadata management and data lineage for Kafka.
  • DataHub - Modern data catalog with Kafka metadata and lineage support.
  • OpenMetadata - Open-source metadata platform with Kafka integration.

Proxies & Gateways

  • Conduktor Gateway - Kafka proxy with encryption, masking, and policy enforcement (commercial).
  • Kroxylicious - Java framework for building Kafka protocol proxies.
  • Gloo Gateway - Kubernetes-native API gateway with Kafka support.
  • Apache APISIX - API gateway with kafka-proxy plugin.

Testing & Development

Local Development

  • Kafka Docker - Docker Compose for Kafka ecosystem by Conduktor.
  • cp-all-in-one - Confluent Platform Docker Compose files.
  • kafka-local - Minimal Docker setup for local Kafka development.
  • Redpanda - Kafka-compatible broker ideal for local development (no JVM).

Testing Frameworks

Data Generation

  • Kafka Connect Datagen - Generate mock data based on Avro schemas.
  • ⚠️ Voluble - Realistic relational data generator for Kafka.
  • 📦 Datagen - Multi-format data generator by Materialize.
  • Mockingbird - Mock streaming data generator.
  • DataFaker - Java library for generating realistic fake data.

Benchmarking

Chaos Engineering

IDE Plugins

Observability & Tracing

OpenTelemetry

Collectors & Pipelines

Tracing Backends

  • Jaeger - Open-source distributed tracing platform.
  • Zipkin - Distributed tracing system for gathering timing data.
  • Grafana Tempo - High-scale distributed tracing backend.

Kafka on Kubernetes

Operators

  • Strimzi - CNCF Incubating Kubernetes operator for Apache Kafka.
  • Confluent for Kubernetes - Official Confluent operator for deploying Confluent Platform.
  • 📦 Koperator - Banzai Cloud's Kubernetes operator for Kafka (formerly Kafka Operator).

Helm Charts

Resources

Use Cases & Case Studies

Enterprise Scale

Architecture Patterns

Best Practices & Patterns

Event Sourcing & CQRS

Exactly-Once Semantics

Partitioning Strategies

Transactional Outbox Pattern

Schema Evolution

Troubleshooting & Performance

Consumer Lag

Performance Tuning

Debugging & Troubleshooting

Internals & Architecture

KRaft (ZooKeeper Replacement)

Replication & Consistency

Log Compaction

How Kafka Works

Disaster Recovery

Multi-Datacenter Patterns

Replication Tools

Migration Guides

ZooKeeper to KRaft

Cross-Cluster Replication

AI/ML Integration

Feature Stores

ML Pipelines

Online Learning

Data Lakehouse Integration

Tableflow & Iceberg

Data Warehouse Integration

Notable KIPs

KIP-500: ZooKeeper Removal

KIP-1150: Diskless Topics

Other Notable KIPs

Kafka-Compatible Alternatives

  • Redpanda - C++ implementation with Kafka API compatibility and 10x lower latency.
  • WarpStream - Diskless Kafka on object storage, acquired by Confluent (commercial).
  • AutoMQ - Cloud-native Kafka with shared storage for 50-90% cost reduction.
  • Bufstream - Kafka-compatible broker with Apache Iceberg integration (commercial).

Related Message Brokers

Managed Services

  • Confluent Cloud - Fully managed Kafka with Kora engine and stream processing.
  • AWS MSK - Amazon Managed Streaming for Apache Kafka with serverless option.
  • Azure Event Hubs - Kafka protocol compatible event streaming on Azure.
  • Aiven for Apache Kafka - Multi-cloud managed Kafka with open-source focus.
  • Instaclustr - Managed open-source Kafka with strong compliance.
  • Redpanda Cloud - Managed Redpanda with BYOC and serverless options.

Learning Resources

Courses

Documentation & Tutorials

Videos & Podcasts

Community

Reference

Conferences & Events

Kafka Summit / Current

Flink Forward

Meetups

Newsletters & Community

Newsletters

Podcasts

Community Resources

Books


Contributing

Contributions are welcome! Please read the contribution guidelines first.

License

CC0

About

A curated list of awesome Apache Kafka resources, tools, libraries, and applications

Resources

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published