cltj / pipeline

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Demo of a simple pipeline

This project simulates a simple data pipeline

Through automations it aims to efficiently move data through the pipeline, exploring and cleaning it for each step of the way



The project demontrates the following:

  • How to transport, clean and store data in an efficient way using various libraries and services
  • How to inspect, explore and analyze data in a usefull maner
  • How to normalize, optimize and prepare data for further conusmption of an analytics tool
  • How to apply business logic on the dataset
  • How to identify insights in the dataset
  • How to create content for end user reporting from insights
  • How to create content for end user presenting from insights

About


Languages

Language:Python 56.4%Language:Jupyter Notebook 43.6%