airscholar / e2e-data-engineering

An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All components are containerized with Docker for easy deployment and scalability.

Home Page:https://www.youtube.com/watch?v=GqAcTrqKcrY

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

airscholar/e2e-data-engineering Stargazers