Serge-Ntumba / stream_data_with_kafka_docker_airflow_spark

A complete data pipeline, from data extraction to storage, using a combination of tools for specific purposes: Python for data retrieval from API, Airflow for scheduling task, Kafka for data streaming, Spark for data processing, and Cassandra for data storage.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Serge-Ntumba/stream_data_with_kafka_docker_airflow_spark Stargazers