ismaildawoodjee / aws-data-pipeline

A batch processing data pipeline, using AWS resources (S3, EMR, Redshift, EC2, IAM), provisioned via Terraform, and orchestrated from locally hosted Airflow containers. The end product is a Superset dashboard and a Postgres database, hosted on an EC2 instance at this address (powered down):

Home Page:http://54.169.163.221:8080

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

ismaildawoodjee/aws-data-pipeline Stargazers