Apache Kafka client for Python; high-level & low-level consumer/producer, with great performance.
-
Updated
Jan 27, 2021 - Python
Apache Kafka client for Python; high-level & low-level consumer/producer, with great performance.
docker-compose.yml files for cp-all-in-one , cp-all-in-one-community, cp-all-in-one-cloud, Apache Kafka Confluent Platform
Kafka-based Job Queue for Python
An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All components are containerized with Docker for easy deployment and scalability.
Real-Time Financial Market Data Processing and Prediction application
Command Line Interface for the Strimzi Kafka Operator
The Python fake data producer for Apache Kafka® is a complete demo app allowing you to quickly produce JSON fake streaming datasets and push it to an Apache Kafka topic.
Astronomy Broker based on Apache Spark
Ecommerce Realtime Data Pipeline (Data Modeling, Workflow Orchestration, Change Data Capture, Analytical Database and Dashboarding)
Sentiment Analysis of a Twitter Topic with Spark Structured Streaming
This repository focuses on gathering and making a curated list resources to learn Hadoop for FREE.
Open Source Computer Vision with TensorFlow, MiniFi, Apache NiFi, OpenCV, Apache Tika and Python For processing images from IoT devices like Raspberry Pis, NVidia Jetson TX1, NanoPi Duos and more which are equipped with attached cameras or external USB webcams, we use Python to interface via OpenCV and PiCamera. From there we run image processin…
Explore Apache Kafka data pipelines in Kubernetes.
📡 Real-time data pipeline with Kafka, Flink, Iceberg, Trino, MinIO, and Superset. Ideal for learning data systems.
Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python
Counting Tweets Per User in Real-Time
Distributed Streaming with Apache Kafka and Python OpenCV
Fully Managed Apache Kafka Cluster with Ansible & Terraform.
This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgres and Streamlit. The system is built using Docker Compose to easily spin up the required services in Docker containers.
Security Analytics Engine - Anomaly Detection in Web Traffic
Add a description, image, and links to the apache-kafka topic page so that developers can more easily learn about it.
To associate your repository with the apache-kafka topic, visit your repo's landing page and select "manage topics."