Version: 95.1

Kafka development & monitoring with Kpow

This stack provides a complete Apache Kafka development and monitoring environment, built using Confluent Platform components and enhanced with Kpow for enterprise-grade observability and management. It includes a 3-node Kafka cluster, Zookeeper, Schema Registry, Kafka Connect, and Kpow itself.

📌 Description

This architecture is designed for developers and operations teams who need a robust, local Kafka ecosystem for building, testing, and managing Kafka-based applications. It features high availability (3 brokers), schema management for data governance, a data integration framework (Kafka Connect), and a powerful UI (Kpow) for monitoring cluster health, inspecting data, managing topics, and troubleshooting.

It's ideal for scenarios involving event-driven architectures, microservices communication, data integration pipelines, and situations requiring deep visibility into Kafka internals.

🔑 Key components

🚀 Kpow (Kafka management & monitoring toolkit)

Container: kpow from (factorhouse/kpow:latest (enterprise)) or kpow-ce from (factorhouse/kpow-ce:latest (community))
An engineering toolkit providing a rich web UI for comprehensive Kafka monitoring, management, and data inspection. Kpow gathers Kafka resource information, stores it locally in internal topics, and delivers custom telemetry and insights. Features include:
- Comprehensive Real-time Kafka Resource Monitoring:
  - Instant visibility ("X-Ray vision") of brokers, topics, consumer groups, partitions, offsets, and more.
  - Gathers data every minute, with a "Live mode" for real-time updates.
  - No JMX access required.
- Advanced Consumer and Streams Monitoring (Compute Console):
  - Visualize message throughput and lag for consumers (and soon, Kafka Streams topologies).
  - Multi-dimensional consumer lag insights from custom derived telemetry.
  - Ability to reset consumption at group, host, member, topic, or assignment level.
- Deep Data Inspection with kJQ:
  - Powerful JQ-like querying (kJQ) to search tens of thousands of messages per second.
  - Supports JSON, Apache Avro®, Transit, EDN, and Protobuf messages (with custom SerDes for Protobuf to JSON).
- Schema Registry Integration: Full support for controlling and monitoring Schema Registries.
- Kafka Connect Cluster Management: Full support for controlling and monitoring Kafka Connect clusters.
- Enterprise-Grade Security & Governance:
  - Authentication: Supports DB, File, LDAP, SAML, or OpenID configurations.
  - Authorization: Simple or Role-Based Access Controls (RBAC). The original summary also mentioned JAAS, often paired with RBAC and configured via volume mounts (Enterprise edition (ee) only).
  - Data Policies: Includes capabilities for masking and redaction of sensitive data (e.g., PII, Credit Card).
  - Audit Logging: All user actions are captured in the Kpow audit log.
- Key Integrations & Deployment Features:
  - Slack Integration: Sends user actions to an operations channel.
  - Prometheus Endpoints: For integration with preferred metrics and alerting systems.
  - HTTPS Support: Easily configured with your own certificates or via a reverse-proxy.
  - Multi-Cluster Monitoring: Manage multiple Kafka clusters from a single Kpow installation.
  - Air-Gapped Environments: Well-suited due to all data being stored in local Kafka topics.
Exposes UI at http://localhost:3000

🧠 Kafka cluster (3 brokers + Zookeeper)

Zookeeper (confluentinc/cp-zookeeper:7.8.0): Coordinates the Kafka brokers.
Kafka Brokers (confluentinc/cp-kafka:7.8.0 x3): Form the core message bus.
- Configured with distinct internal (1909x) and external (909x) listeners for docker networking.
- Provides basic fault tolerance with 3 nodes.
- Accessible externally via ports 9092, 9093, 9094.

📜 Schema Registry (`confluentinc/cp-schema-registry:7.8.0`)

Manages schemas (Avro, Protobuf, JSON Schema) for Kafka topics, ensuring data consistency and enabling schema evolution.
Accessible at http://localhost:8081.
Configured with Basic Authentication (schema_jaas.conf).
Stores its schemas within the Kafka cluster itself.

🔌 Kafka Connect (`confluentinc/cp-kafka-connect:7.8.0`)

Framework for reliably streaming data between Apache Kafka and other systems.
Accessible via REST API at http://localhost:8083.
Configured to use JSON converters by default.
Custom Connector Support: Volume mount ./resources/kpow/connector allows adding custom or third-party connectors (e.g., JDBC, S3, Iceberg - hinted by AWS_REGION).
Manages its configuration, offsets, and status in dedicated Kafka topics.

🧰 Use cases

Local kafka development & testing

Build and test Kafka producers, consumers, and Kafka Streams applications against a realistic, multi-broker cluster.
Validate application behavior during broker failures (by stopping/starting broker containers).

Data integration pipelines

Utilize Kafka Connect to ingest data into Kafka from databases, logs, or APIs using source connectors.
Stream data out of Kafka to data lakes (S3, Iceberg), warehouses, or other systems using sink connectors.
Test and manage connector configurations via the Connect REST API or Kpow UI.

Schema management & evolution

Define, register, and evolve schemas using Schema Registry to enforce data contracts between producers and consumers.
Test compatibility modes and prevent breaking changes in data pipelines.

Real-time monitoring & operations simulation

Use Kpow to monitor cluster health, track topic/partition metrics (size, throughput), identify consumer lag, and inspect messages in real-time.
Understand Kafka performance characteristics and troubleshoot issues within a controlled environment.

Learning & exploring Kafka

Provides a self-contained environment to learn Kafka concepts, experiment with configurations, and explore the capabilities of the Confluent Platform and Kpow.

📌 Description​

🔑 Key components​

🚀 Kpow (Kafka management & monitoring toolkit)​

🧠 Kafka cluster (3 brokers + Zookeeper)​

📜 Schema Registry (confluentinc/cp-schema-registry:7.8.0)​

🔌 Kafka Connect (confluentinc/cp-kafka-connect:7.8.0)​

🧰 Use cases​

Local kafka development & testing​

Data integration pipelines​

Schema management & evolution​

Real-time monitoring & operations simulation​

Learning & exploring Kafka​