AbdullahEmad22/realtime-data-engineering-project
An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All components are containerized with Docker for seamless deployment and scalability.
Stars
3
Forks
1
Language
Python
License
MIT
Last pushed
Apr 03, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/data-engineering/AbdullahEmad22/realtime-data-engineering-project"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
dagucloud/dagu
A local-first workflow engine built the way it should be: declarative, file-based,...
risesoft-y9/DataFlow-Engine
数据流引擎是一款面向数据集成、数据同步、数据交换、数据共享、任务配置、任务调度的底层数据驱动引擎。数据流引擎采用管执分离、多流层、插件库等体系应对大规模数据任务、数据高频上报、数据高频采集、异构...
insitro/redun
Yet another redundant workflow engine
hyparam/icebird
Icebird: JavaScript Iceberg Client
cnstlungu/portable-data-stack-dagster
A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and...