Harduim / Data-Modeling-Postgres

Data Modeling with Postgres | Udacity Project 01

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Data Modeling with Postgres | Udacity Project 01

Main goals of this project:

  • To Define fact and dimension tables for a star schema for a particular analytic focus
  • Build an ETL pipeline that transfers data local files into these tables in Postgres using Python and SQL.

Setting up the environment:

Requirements:

Database

  • Navigate to the project folder
  • Create the postgres container: docker-compose up
  • Create the sparkifydb database and tables: python3 create_tables.py

Conda

  • Navigate to the project folder
  • Create the environment: conda env create -f environment.yml
  • Activate the new environment: conda activate rubrics

ETL

  • Run the ETL pipeline: python3 etl.py

About

Data Modeling with Postgres | Udacity Project 01


Languages

Language:Jupyter Notebook 60.8%Language:Python 39.2%