wlopezm-unal / reddit_project_airflow_aws

This project focuses on implementing an ETL pipeline using Apache Airflow to efficiently extract data from Reddit, transform it as needed, and load it into an AWS S3 bucket. The use of Airflow allows for robust orchestration of the data workflow, ensuring that each step of the ETL process is executed in a reliable and repeatable manner.

Repository from Github https://github.comwlopezm-unal/reddit_project_airflow_awsRepository from Github https://github.comwlopezm-unal/reddit_project_airflow_aws

reddit_project_DE

About

This project focuses on implementing an ETL pipeline using Apache Airflow to efficiently extract data from Reddit, transform it as needed, and load it into an AWS S3 bucket. The use of Airflow allows for robust orchestration of the data workflow, ensuring that each step of the ETL process is executed in a reliable and repeatable manner.


Languages

Language:Python 75.1%Language:HCL 19.3%Language:Dockerfile 5.6%