harrisonhenri / sparkify-etl

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Sparkify ETL

Welcome to Sparkify ETL!!

About this Project

"The objective of this project is to exercise ETL and Data-warehouse concepts as part of my Data Engineer Nanodegree". The project consists of an ETL pipeline to build a Data-warehouse in AWS following the schemas bellow:

Why?

This project is part of my personal portfolio, so, I'll be happy if you could provide me any feedback about the project, code, structure or anything that you can report that could make me a better developer!

Email-me: harrisonhenrisn@gmail.com

Connect with me at [LinkedIn](https://linkedin.com/in/harrison-henri-dos-santos-nascimento

Local installation and configuration

To install and run the project locally, first, add a .env file (based on the .env.example)!

  1. Install the python (minimum version of the project.toml at least)

  2. Install the dependencies with:

pip install poetry
python -m venv venv
poetry install
  1. Run with:
jupyter notebook

Built with

Redshift S3



About


Languages

Language:Jupyter Notebook 74.4%Language:Python 25.6%