ambujraj / SourceByte

An open-source boilerplate code for data engineering with pandas

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

SourceByte

An open-source boilerplate code for data engineering with pandas

Setup

  1. Install dependencies from requirements.txt
  2. Create Data folder in root
  3. Add your AWS credentials in configuration/config.ini file
  4. Put your raw file in either S3 or data/raw/ folder
  5. Run from src/main.py

About

An open-source boilerplate code for data engineering with pandas


Languages

Language:Python 100.0%