Tafkas / luigi-sf-bay-area-bike-share

A luigi pipeline to import SF Bay Area Bike Share data into Postgres

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Luigi San Francisco Data Pipeline

This repository contains code to import the anonymized bike trip data from August 2013 to August 2015 provided by Kaggle.

Getting Started

  • Download the data from the Kaggle website and put into data.
  • Install Python dependencies with Pipenv: pipenv install
  • Create the Postgres Database with make create-postgres-db
  • Run the luigi daemon with luigid
  • Run the pipeline with python import_sfbike.py

About

A luigi pipeline to import SF Bay Area Bike Share data into Postgres


Languages

Language:Python 87.8%Language:Makefile 12.2%